You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
1Carnegie Mellon University 2University of Pennsylvania 3Google Research, Brain Team 4University of Toronto
Official implementation of the Lexa agent from the paper Discovering and Achieving Goals via World Models.
Setup
Create the conda environment by running :
conda env create -f environment.yml
Clone the lexa-benchmark repo, and modify the python path export PYTHONPATH=<path to lexa-training>/lexa:<path to lexa-benchmark>
Export the following variables for rendering export MUJOCO_RENDERER=egl; export MUJOCO_GL=egl
WARNING! Make sure to use the right python and mujoco version. The robobin environment code is known to break with other versions. Other environments might or might not work.
Training
First source the environment : source activate lexa
where method can be lexa_temporal, lexa_cosine, ddl, diayn or gcsl
Supported tasks are dmc_walker_walk, dmc_quadruped_run, robobin, kitchen, joint. The time limit should be 1000 for DMC and default otherwise.
To view the graphs and gifs during training, run tensorboard --logdir <log path>
Bibtex
If you find this code useful, please cite:
@misc{lexa2021,
title={Discovering and Achieving Goals via World Models},
author={Mendonca, Russell and Rybkin, Oleh and
Daniilidis, Kostas and Hafner, Danijar and Pathak, Deepak},
year={2021},
Booktitle={NeurIPS}
}