This is the codebase accompanying the IJCAI2022 paper "Online Planning in POMDPs with Self-Improving Simulators" by Jinke He, Miguel Suau, Hendrik Baier, Michael Kaisers, Frans A. Oliehoek.
- Singularity
- libtorch (C++ version of PyTorch cpu):
download and unzip libtorch (version 1.10) into third-party/libtorch
- yaml-cpp:
git clone https://github.com/jbeder/yaml-cpp third-party/yaml-cpp
We implemented our online planning experiments in C++ and provided a Singularity definition file (singularity/FADMEN.def) to resolve the dependencies.
To run the code, first build the singularity container with the command: sudo singularity build singularity/FADMEN.sif singularity/FADMEN.def
.
To execute a command under the singularity container, use ./run
+ command.
This codebase does not support the use of GPUs. All inference is done in CPUs.
./run bash scripts/build.sh
./run ./scripts/run_benchmark
+ path to config file
We repeat this experiment for
for lambda ./run scripts/run_benchmark configs/GAC/simulation_controlled/mec_0.3_lam_{lambda}.yaml
We repeat this experiment for
./run scripts/run_benchmark configs/GTC/time_controlled/global.yaml
for lambda ./run scripts/run_benchmark configs/GTC/time_controlled/0.015625sec_mec_0.3_lam_{lambda}.yaml
We repeat this experiment for
./run scripts/run_benchmark configs/GTC/time_controlled/global.yaml
for lambda ./run scripts/run_benchmark configs/GTC/time_controlled/ep1500_gru_8_mean_mec_0.1_lam_{lambda}_0.0625sec.yaml
Results are smoothed with gaussian_filter1d
from scipy.ndimage
where sigma
is set to
Feel free to contact us if you are interested in this work!
[email protected] / [email protected]
This project had received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No. 758824 — INFLUENCE).