Learning and Sustaining Shared Normative Systems via Bayesian Rule Induction in Markov Games

With this piece of code, we formalize the problem of constrained norm learning from observing other agents' actions in the context of Markov games in a multi-agent environment via approximately Bayesian rule induction of obligative and prohibitive norms. The environment is based on the Melting Pot project by DeepMind taking together the different games Commons Harvest, Clean Up, and Territory to allow agents to learn and sustain a range of norms to handle various dilemmas such as a tragedy of the commons, social-role conditioned labor, and territorial norms.

You can use this code to test your own version of this. The main test scenarios are (1) Pure norm learning from a predefined set of norms, (2) Intergenerational norm transmission, and (3) Spontaneous norm emergence.

For more information, see our paper

Ninell Oldenburg & Tan Zhi-Xuan. 2024. Learning and Sustaining Shared Normative Systems via Bayesian Rule Induction in Markov Games. In Proc. of the 23rd International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2024).

Setup

For this code, your machine requires Python 3.10 or higher. Follow these steps:

Go to the Melting Pot repository and follow their installation steps. It's probably good to skim their repository / docs to get a bit of an idea of the terminology used and how / why they are doing stuff. From their docs: If you get a ModuleNotFoundError: No module named 'meltingpot.python' error, you can solve it by exporting the meltingpot home directory as PYTHONPATH (e.g. by calling export PYTHONPATH=$(pwd))
Download the meltingpot directory from this repo and replace the meltingpot directory of your newly installed Melting Pot instance. Note: make sure the to-be-replaced directory is your local version of this one -- their repository structure is something like meltingpot/meltingpot so watch out to not confuse these two.
Done! Now you can run the files.

Test Modes

You can play around in three different modes that differ according to how many agents believe in a set of norms in the onset of the episode and whether agents are mortal. The potental norms are created at agent creation time from a set of observable environment features (see meltingpot/python/utils/rule_generation.py)

With learning you can see how well and how fast norms are being picked up by one learning agent i.e. an agent that does not believe in any norms in the beginning. We detail the process of that approximate Bayesian rule induction in our paper. The idea is that some experienced agents believe in a predefined set of norms and the learning agent learns those by observation.
With intergen the idea is similar to 1., only that now, on a regular basis (episode length / # agents) agents die and then resurrect without any norm beliefs. I.e. here you can test which norms can be tranmissed over generations.
Lastly, with emergence you can see what norms evolve when no agent believes in any norm. They will naturally exhibit some behavior to survive, will they accidentally follow any norms?

Repository Structure

Don't be discouraged by this slightly overwhelming load of files in this folder! There are two important folders:

meltingpot for all substrates (meltingpot/python/configs/substrates), agents planning and learning policies (meltingpot/python/utils/policies/rule_{adjusting|learning}_policy), and the Lua backend (meltingpot/lua/levels/rule_obeying_harvest/components.lua)
exmaples for all code that can invoke the backend (examples/eval_scripts/view_{learning|intergen|enemergence}_model.py), experiments (examples/eval_scripts/eval_{learning|intergen|enemergence}_model.py), and our results (examples/results_{learning|intergen|emerge}) always referring to one of the three modes detailed above.

Name		Name	Last commit message	Last commit date
Latest commit History 394 Commits
assets		assets
build		build
dm_meltingpot.egg-info		dm_meltingpot.egg-info
docs		docs
examples		examples
lab2d		lab2d
meltingpot		meltingpot
notebooks		notebooks
.DS_Store		.DS_Store
.gitignore		.gitignore
AUTHORS		AUTHORS
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
WORKSPACE		WORKSPACE
bazel-bin		bazel-bin
bazel-norm-games		bazel-norm-games
bazel-out		bazel-out
bazel-testlogs		bazel-testlogs
grid.csv		grid.csv
install-dmlab2d.sh		install-dmlab2d.sh
install-extras.sh		install-extras.sh
install-meltingpot.sh		install-meltingpot.sh
output.prof		output.prof
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learning and Sustaining Shared Normative Systems via Bayesian Rule Induction in Markov Games

Setup

Test Modes

Repository Structure

About

Releases

Packages

Languages

License

ninell-oldenburg/social-contracts

Folders and files

Latest commit

History

Repository files navigation

Learning and Sustaining Shared Normative Systems via Bayesian Rule Induction in Markov Games

Setup

Test Modes

Repository Structure

About

Resources

License

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages