Mario Bros#
This environment is part of the Atari environments. Please read that page first for general information.
Import |
|
---|---|
Actions |
Discrete |
Parallel API |
Yes |
Manual Control |
No |
Agents |
|
Agents |
2 |
Action Shape |
(1,) |
Action Values |
[0,17] |
Observation Shape |
(210, 160, 3) |
Observation Values |
(0,255) |
A mixed-sum game of planning and control.
The main goal is to kick a pest off the floor. This requires 2 steps:
Hit the floor below the pest, flipping it over. This knocks the pest on its back.
You to move up onto the floor where the pest is and you can kick it off. This earns +800 reward
Note that since this process has two steps there are opportunities for the two agents to either collaborate by helping each other knock pests over and collect them (potentially allowing both to collect reward more quickly), or for agents to steal the other’s work.
If you run into an active pest or a fireball, you lose a life. If you lose all your lives, you are done, and the other player keeps playing. You can gain a new life after earning 20000 points.
There are other ways of earning points, by collecting bonus coins or wafers, earning 800 points each.
Environment parameters#
Environment parameters are common to all Atari environments and are described in the base Atari documentation .
Action Space#
In any given turn, an agent can choose from one of 18 actions.
Action |
Behavior |
---|---|
0 |
No operation |
1 |
Fire |
2 |
Move up |
3 |
Move right |
4 |
Move left |
5 |
Move down |
6 |
Move upright |
7 |
Move upleft |
8 |
Move downright |
9 |
Move downleft |
10 |
Fire up |
11 |
Fire right |
12 |
Fire left |
13 |
Fire down |
14 |
Fire upright |
15 |
Fire upleft |
16 |
Fire downright |
17 |
Fire downleft |
Version History#
v3: Minimal Action Space (1.18.0)
v2: Breaking changes to entire API (1.4.0)
v1: Fixes to how all environments handle premature death (1.3.0)
v0: Initial versions release (1.0.0)
Usage#
AEC#
from pettingzoo.atari import mario_bros_v3
env = mario_bros_v3.env(render_mode="human")
env.reset(seed=42)
for agent in env.agent_iter():
observation, reward, termination, truncation, info = env.last()
if termination or truncation:
action = None
else:
# this is where you would insert your policy
action = env.action_space(agent).sample()
env.step(action)
env.close()
Parallel#
from pettingzoo.atari import mario_bros_v3
env = mario_bros_v3.parallel_env(render_mode="human")
observations, infos = env.reset()
while env.agents:
# this is where you would insert your policy
actions = {agent: env.action_space(agent).sample() for agent in env.agents}
observations, rewards, terminations, truncations, infos = env.step(actions)
env.close()