Replacing Rewards with Examples: Example-Based Policy Search via Recursive Classiﬁcation

Currently only supports the training of env door-human-v0. The support of the training of other environments will come out subsequently.

Requirements

pip install -e .
git clone https://github.com/rail-berkeley/d4rl.git
cd d4rl
pip install -e .

All the arguments can be found in argments.py.

python trainer.py

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
imgs		imgs
README.md		README.md
agent.py		agent.py
argments.py		argments.py
models.py		models.py
replaybuffer.py		replaybuffer.py
requirements.txt		requirements.txt
run.sh		run.sh
trainer.py		trainer.py