Skip to content
/ RCE Public

pytorch implementation of paper - "Replacing Rewards with Examples: Example-Based Policy Search via Recursive Classification"

Notifications You must be signed in to change notification settings

Ricky-Zhu/RCE

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Replacing Rewards with Examples: Example-Based Policy Search via Recursive Classification

MIT License
This is the pytorch implementation of "Replacing Rewards with Examples: Example-Based Policy Search via Recursive Classification". The original tensorflow version could be found here.

Currently only supports the training of env door-human-v0. The support of the training of other environments will come out subsequently.

Requirements

  • python 3.7
  • register wandb account
  • mujoco
  • other packages can be found in requirements.txt

Setup the environment

pip install -e .
git clone https://github.com/rail-berkeley/d4rl.git
cd d4rl
pip install -e .

TODO List

  • support the training of other envs in the metaworld.

Reproduce experiments

All the arguments can be found in argments.py.

python trainer.py

Results

Door-human-v0

door-human

About

pytorch implementation of paper - "Replacing Rewards with Examples: Example-Based Policy Search via Recursive Classification"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published