You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is the official repository for Conservative Distributional Offline Reinforcement Learning.
We provide the commands to run the Risky PointMass, Risky Ant, and D4RL experiments included in the paper. This repository is made minimal for ease of experimentation.
Installations
This repository requires Python (>3.7), Pytorch (version 1.6.0 or above), and installation of the D4RL dataset. Mujoco license
is also required in order to run the D4RL experiments. Packages gym, numpy, and wandb (optionally) are also needed (any version should work). To get started,
run the following commands to create a conda environment (assuming CUDA10.1):
If you find this repository useful for your research, please cite:
@article{ma2021conservative,
title={Conservative Offline Distributional Reinforcement Learning},
author={Yecheng Jason Ma and Dinesh Jayaraman and Osbert Bastani},
year={2021},
url={https://arxiv.org/abs/2107.06106}
}
Contact
If you have any questions regarding the code or paper, feel free to contact me at jasonyma@seas.upenn.edu or open an issue on this repository.