Dataset Cards - OMIGA
2c_vs_64zg - Download
Metadata
Environment name | Version | Agents | Action type | Observation size | Reward type |
---|---|---|---|---|---|
SMAC (v1) | Modified version of SMAC v1, popularised by MAPPO | 2 | Discrete | [478] | Dense |
Generation procedure for each dataset
Converted from omiga format to a Vault.
Summary statistics
Uid | Episode return mean | Min return | Max return | Transitions | Trajectories | Joint SACo |
---|---|---|---|---|---|---|
Poor | 8.91 ± 1.01 | 2.53 | 10.00 | 10830 | 348 | 1.00 |
Medium | 13.00 ± 1.39 | 10.01 | 15.00 | 37940 | 1001 | 1.00 |
Good | 19.94 ± 1.26 | 15.18 | 21.61 | 59215 | 1001 | 1.00 |
6h_vs_8z - Download
Metadata
Environment name | Version | Agents | Action type | Observation size | Reward type |
---|---|---|---|---|---|
SMAC (v1) | Modified version of SMAC v1, popularised by MAPPO | 6 | Discrete | [172] | Dense |
Generation procedure for each dataset
Converted from omiga format to a Vault.
Summary statistics
Uid | Episode return mean | Min return | Max return | Transitions | Trajectories | Joint SACo |
---|---|---|---|---|---|---|
Poor | 9.12 ± 0.81 | 4.80 | 9.99 | 24255 | 1001 | 1.00 |
Medium | 11.97 ± 1.26 | 10.00 | 14.99 | 29511 | 1001 | 1.00 |
Good | 17.84 ± 2.15 | 15.01 | 20.02 | 38040 | 1001 | 1.00 |
5m_vs_6m - Download
Metadata
Environment name | Version | Agents | Action type | Observation size | Reward type |
---|---|---|---|---|---|
SMAC (v1) | Modified version of SMAC v1, popularised by MAPPO | 5 | Discrete | [124] | Dense |
Generation procedure for each dataset
Converted from omiga format to a Vault.
Summary statistics
Uid | Episode return mean | Min return | Max return | Transitions | Trajectories | Joint SACo |
---|---|---|---|---|---|---|
Poor | 8.50 ± 1.19 | 1.81 | 9.89 | 22747 | 1001 | 0.96 |
Medium | 11.03 ± 0.58 | 10.08 | 11.96 | 27717 | 1001 | 0.95 |
Good | 20.00 ± 0.00 | 20.00 | 20.00 | 27734 | 1001 | 0.96 |
corridor - Download
Metadata
Environment name | Version | Agents | Action type | Observation size | Reward type |
---|---|---|---|---|---|
SMAC (v1) | Modified version of SMAC v1, popularised by MAPPO | 6 | Discrete | [346] | Dense |
Generation procedure for each dataset
Converted from omiga format to a Vault.
Summary statistics
Uid | Episode return mean | Min return | Max return | Transitions | Trajectories | Joint SACo |
---|---|---|---|---|---|---|
Poor | 4.93 ± 1.71 | 0.00 | 9.99 | 51268 | 1001 | 1.00 |
Medium | 13.07 ± 1.27 | 10.02 | 14.99 | 126012 | 1001 | 1.00 |
Good | 19.88 ± 1.01 | 15.01 | 20.49 | 100170 | 1001 | 1.00 |
6halfcheetah - Download
Metadata
Environment name | Version | Agents | Action type | Observation size | Reward type |
---|---|---|---|---|---|
MAMuJoCo | V1.0, Mujoco v200 | 6 | Continuous | [23] | Dense |
Generation procedure for each dataset
Converted from omiga format to a Vault.
Summary statistics
Uid | Episode return mean | Min return | Max return | Transitions | Trajectories | Joint SACo |
---|---|---|---|---|---|---|
Medium-Replay | 655.76 ± 590.40 | -198.77 | 2132.60 | 1001000 | 1000 | 1.00 |
Medium-Expert | 2105.38 ± 1073.24 | 251.94 | 3866.09 | 2002000 | 2000 | 1.00 |
Medium | 1425.66 ± 520.12 | 251.94 | 2113.52 | 1001000 | 1000 | 1.00 |
Expert | 2785.10 ± 1053.14 | 317.94 | 3866.09 | 1001000 | 1000 | 1.00 |
2ant - Download
Metadata
Environment name | Version | Agents | Action type | Observation size | Reward type |
---|---|---|---|---|---|
MAMuJoCo | V1.0, Mujoco v200 | 2 | Continuous | [113] | Dense |
Generation procedure for each dataset
Converted from omiga format to a Vault.
Summary statistics
Uid | Episode return mean | Min return | Max return | Transitions | Trajectories | Joint SACo |
---|---|---|---|---|---|---|
Medium-Replay | 1029.51 ± 141.27 | 895.37 | 1517.06 | 1751750 | 1750 | 0.66 |
Medium-Expert | 1736.88 ± 319.64 | 840.77 | 2124.15 | 2002000 | 2000 | 1.00 |
Medium | 1418.70 ± 37.04 | 840.77 | 1473.86 | 1001000 | 1000 | 1.00 |
Expert | 2055.07 ± 22.07 | 1994.03 | 2124.15 | 1001000 | 1000 | 1.00 |
3hopper - Download
Metadata
Environment name | Version | Agents | Action type | Observation size | Reward type |
---|---|---|---|---|---|
MAMuJoCo | V1.0, Mujoco v200 | 3 | Continuous | [14] | Dense |
Generation procedure for each dataset
Converted from omiga format to a Vault.
Summary statistics
Uid | Episode return mean | Min return | Max return | Transitions | Trajectories | Joint SACo |
---|---|---|---|---|---|---|
Medium-Replay | 746.42 ± 671.89 | 70.76 | 2801.15 | 1314826 | 4160 | 1.00 |
Medium-Expert | 1190.61 ± 973.40 | 95.27 | 3762.69 | 1919782 | 5481 | 1.00 |
Medium | 723.57 ± 211.66 | 128.38 | 2776.49 | 919391 | 4000 | 1.00 |
Expert | 2452.02 ± 1097.86 | 95.27 | 3762.69 | 1000391 | 1481 | 1.00 |