ReQueST — (Re)ward (Que)ry (S)ynthesis via (T)rajectory Optimization

ReQueST is a reward modeling algorithm that asks the user for feedback on hypothetical trajectories synthesized using a pretrained model of the environment dynamics, instead of real trajectories generated by rolling out a partially-trained agent in the environment. Compared to previous approaches, this enables

training more robust reward models that work off-policy,
learning about unsafe states without visiting them, and
better query-efficiency through the use of active learning.

This codebase implements ReQueST in three domains:

An MNIST classification task.
A simple state-based 2D navigation task.
The Car Racing task from the OpenAI Gym.

All experiments use labels from a synthetic oracle instead of a real human.

Usage

Setup the Anaconda virtual environment with conda env create -f environment.yml
Patch the gym car_racing environment by running bash apply_car_racing_patch.sh from ReQueST/scripts
Replace gym/envs/box2d/car_racing.py with ReQueST/scripts/car_racing.py
Clone the world models repo
Download MNIST
Set wm_dir, mnist_dir, and home_dir in ReQueST/utils.py
Install the rqst package with python setup.py install
Download data.zip, then unzip it into ReQueST/data
Jupyter notebooks in ReQueST/notebooks provide an entry-point to the code base, where you can play around with the environments, visualize synthesized queries, and reproduce the figures from the paper.

Citation

If you find this software useful in your work, we kindly request that you cite the following paper:

@article{ReQueST,
  title={Learning Human Objectives by Evaluating Hypothetical Behavior},
  author={Reddy, Siddharth and Dragan, Anca D. and Levine, Sergey and Legg, Shane and Leike, Jan},
  journal={arXiv preprint arXiv:1912.05652},
  year={2019}
}

Disclaimer

This is not an officially supported Google product.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
notebooks		notebooks
rqst		rqst
scripts		scripts
tests		tests
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ReQueST — (Re)ward (Que)ry (S)ynthesis via (T)rajectory Optimization

Usage

Citation

Disclaimer

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

rddy/ReQueST

Folders and files

Latest commit

History

Repository files navigation

ReQueST — (Re)ward (Que)ry (S)ynthesis via (T)rajectory Optimization

Usage

Citation

Disclaimer

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages