Self-Supervised Skill Abstraction and Decision Making

Model specific installations

conda with python 3.8
pip install vector-quantize-pytorch==1.8.1
pip install positional-encodings==6.0.3
pip install einops==0.3.2
pip install hydra-core==1.2.0
Other dependencies like torch, torchvision, transformers, etc
Libero specific dependencies are in the requirements.txt file.

Model Files

Model files are in the libero/lifelong/models directory. Description:

skill_vae.py: stage 1 policy file, uses base_policy class, imports stage 1 model and observation encoder.
modules/skill_vae_modules.py: SkillVAE model - stage 1.
skill_GPT.py: stage 2 policy file, uses base_policy class, imports stage 1 model and stage 2 GPT model both.
modules/skill_utils.py: contains stage 2 GPT model and other stage 1 modules.
modules/rgb_modules.py: contains visual observation encoder.

Config Files

Config files are in the libero/configs directory. Description:

pretrain.yaml: stage 1 training.
skillGPT.yaml: stage 2 training.
few_shot.yaml: few-shot training on libero-10. All of these files use default files in the libero/configs directory.
Following are model specific config files:
policy/skill_vae.yaml: stage 1 model.
policy/skill_GPT.yaml: stage 2 model.

Read comments in the config files for more details.

Training Scripts

These are libero specific scripts that loads the model and data and environment (for evals) and trains the model.
Main scripts for training are in the libero/lifelong directory. The main scripts are:

pretrain.py: Pretrain the SkillVAE model.
skill_policy.py: Train the skill policy. These scripts load algo/multitask.py algo that loads multitask data and trains the model. (libero specific)
Command to train stage 1:
export CUDA_VISIBLE_DEVICES=0 && export MUJOCO_EGL_DEVICE_ID=0 && python libero/lifelong/pretrain.py
Command to train stage 2: export CUDA_VISIBLE_DEVICES=0 && export MUJOCO_EGL_DEVICE_ID=0 && python libero/lifelong/skill_policy.py
Command to finetune: export CUDA_VISIBLE_DEVICES=0 && export MUJOCO_EGL_DEVICE_ID=0 && python libero/lifelong/few_shot.py
Command to evaluate: export CUDA_VISIBLE_DEVICES=0 && export MUJOCO_EGL_DEVICE_ID=0 && python libero/lifelong/skill_policy_eval.py

Running this on PACE

Start a free interactive session with a RTX6000 GPU: salloc -A gts-agarg35 -N1 --mem-per-gpu=12G -q embers -t8:00:00 --gres=gpu:RTX_6000:1
Load in Conda: module load anaconda3/2022.05.0.1
sbatch slurm/run.sbatch python libero/lifelong/pretrain.py

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
benchmark_scripts		benchmark_scripts
evaluations_clip/LIBERO_90/Multitask/SkillGPT_Model/ResnetEncoder		evaluations_clip/LIBERO_90/Multitask/SkillGPT_Model/ResnetEncoder
images		images
libero		libero
notebooks		notebooks
pace_exe_files		pace_exe_files
scripts		scripts
slurm		slurm
templates		templates
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
compress.py		compress.py
execute.sh		execute.sh
execute_eval.sh		execute_eval.sh
execute_eval_few.sh		execute_eval_few.sh
execute_fine.sh		execute_fine.sh
get_scores.py		get_scores.py
old_README.md		old_README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Self-Supervised Skill Abstraction and Decision Making

Model specific installations

Model Files

Config Files

Training Scripts

Running this on PACE

About

Releases

Packages

Languages

License

atharvamete/LIBERO

Folders and files

Latest commit

History

Repository files navigation

Self-Supervised Skill Abstraction and Decision Making

Model specific installations

Model Files

Config Files

Training Scripts

Running this on PACE

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages