MAPPO fails to converge on VMAS Balance task with default configuration

**Description**
I recently cloned the repository and attempted to run the default example following the README instructions. However, the training process fails to converge. Actually,  it does not converge when using QMix.

**Modifications**
I have not modified any core logic.
The only change made was adding TensorBoard logging for visualization.

**Steps to Reproduce**
Clone the repository.
Install dependencies.
Run the following command: python benchmarl/run.py algorithm=mappo task=vmas/balance

**Expected Behavior**
The model should show learning progress and converge on the vmas/balance task, distance to goal should become shorter and shorter.
**Actual Behavior**
The training does not converge.Please see the attached TensorBoard screenshots below:

<img width="1420" height="1985" alt="Image" src="https://github.com/user-attachments/assets/6d211c47-8231-4a8c-b324-d977a6ad1a73" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MAPPO fails to converge on VMAS Balance task with default configuration #249

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

MAPPO fails to converge on VMAS Balance task with default configuration #249

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions