Shimmy Compatibility Wrappers#

The Shimmy package (pip install shimmy) allows commonly used external reinforcement learning environments to be used with PettingZoo and Gymnasium.

Supported multi-agent environments:#

OpenSpiel #

70+ implementations of various board games

DeepMind Control Soccer #

Multi-agent robotics environment where teams of agents compete in soccer.

DeepMind Melting Pot #

Suite of test scenarios for multi-agent reinforcement learning
Assesses generalization to novel social situations:
- familiar and unfamiliar individuals
- social interactions: cooperation, competition, deception, reciprocation, trust, stubbornness
50+ substrates and 250+ test scenarios

Usage#

To load a DeepMind Control multi-agent soccer game:

from shimmy import DmControlMultiAgentCompatibilityV0
from dm_control.locomotion import soccer as dm_soccer

env = dm_soccer.load(team_size=2)
env = DmControlMultiAgentCompatibilityV0(env, render_mode="human")

observations = env.reset()
while env.agents:
    actions = {agent: env.action_space(agent).sample() for agent in env.agents}  # this is where you would insert your policy
    observations, rewards, terminations, truncations, infos = env.step(actions)

For more information, see Shimmy DM Control Multi-Agent documentation

To load an OpenSpiel game of backgammon:

from shimmy import OpenspielCompatibilityV0
import pyspiel

env = pyspiel.load_game("backgammon")
env = OpenspielCompatibilityV0(game=env, render_mode=None)

env.reset()
for agent in env.agent_iter():
    observation, reward, termination, truncation, info = env.last()
    if termination or truncation:
        action = None
    else:
        action = env.action_space(agent).sample(info["action_mask"])  # this is where you would insert your policy
    env.step(action)
    env.render()

For more information, see Shimmy OpenSpiel documentation

To load a Melting Pot prisoner’s dilemma in the matrix substrate:

from shimmy import MeltingPotCompatibilityV0
env = MeltingPotCompatibilityV0(substrate_name="prisoners_dilemma_in_the_matrix__arena", render_mode="human")
observations = env.reset()
while env.agents:
    actions = {agent: env.action_space(agent).sample() for agent in env.agents}
    observations, rewards, terminations, truncations, infos = env.step(actions)
    env.step(actions)
env.close()

For more information, see Shimmy Melting Pot documentation

Multi-Agent Compatibility Wrappers:#

shimmy.dm_control_multiagent_compatibility.DmControlMultiAgentCompatibilityV0
shimmy.openspiel_compatibility.OpenspielCompatibilityV0
shimmy.meltingpot_compatibility.MeltingPotCompatibilityV0

Citation#

If you use this in your research, please cite:

@software{shimmy2022github,
  author = {{Jun Jet Tai, Mark Towers} and Elliot Tower and Jordan Terry},
  title = {Shimmy: Gymnasium and Pettingzoo Wrappers for Commonly Used Environments},
  url = {http://github.com/Farama-Foundation/Shimmy},
  version = {0.2.0},
  year = {2022},
}```