Mario Bros

../../../_images/atari_mario_bros.gif

This environment is part of the Atari environments. Please read that page first for general information.

Import

from pettingzoo.atari import mario_bros_v3

Actions

Discrete

Parallel API

Yes

Manual Control

No

Agents

agents= ['first_0', 'second_0']

Agents

2

Action Shape

(1,)

Action Values

[0,17]

Observation Shape

(210, 160, 3)

Observation Values

(0,255)

A mixed-sum game of planning and control.

The main goal is to kick a pest off the floor. This requires 2 steps:

  1. Hit the floor below the pest, flipping it over. This knocks the pest on its back.

  2. You to move up onto the floor where the pest is and you can kick it off. This earns +800 reward

Note that since this process has two steps there are opportunities for the two agents to either collaborate by helping each other knock pests over and collect them (potentially allowing both to collect reward more quickly), or for agents to steal the other’s work.

If you run into an active pest or a fireball, you lose a life. If you lose all your lives, you are done, and the other player keeps playing. You can gain a new life after earning 20000 points.

There are other ways of earning points, by collecting bonus coins or wafers, earning 800 points each.

Official mario bros manual

Environment parameters

Environment parameters are common to all Atari environments and are described in the base Atari documentation .

Action Space

In any given turn, an agent can choose from one of 18 actions.

Action

Behavior

0

No operation

1

Fire

2

Move up

3

Move right

4

Move left

5

Move down

6

Move upright

7

Move upleft

8

Move downright

9

Move downleft

10

Fire up

11

Fire right

12

Fire left

13

Fire down

14

Fire upright

15

Fire upleft

16

Fire downright

17

Fire downleft

Version History

  • v3: Minimal Action Space (1.18.0)

  • v2: Breaking changes to entire API (1.4.0)

  • v1: Fixes to how all environments handle premature death (1.3.0)

  • v0: Initial versions release (1.0.0)

Usage

AEC

from pettingzoo.atari import mario_bros_v3

env = mario_bros_v3.env(render_mode="human")
env.reset(seed=42)

for agent in env.agent_iter():
    observation, reward, termination, truncation, info = env.last()

    if termination or truncation:
        action = None
    else:
        # this is where you would insert your policy
        action = env.action_space(agent).sample()

    env.step(action)
env.close()

Parallel

from pettingzoo.atari import mario_bros_v3

env = mario_bros_v3.parallel_env(render_mode="human")
observations, infos = env.reset()

while env.agents:
    # this is where you would insert your policy
    actions = {agent: env.action_space(agent).sample() for agent in env.agents}

    observations, rewards, terminations, truncations, infos = env.step(actions)
env.close()

API

class pettingzoo.atari.mario_bros.mario_bros.raw_env(**kwargs)[source]