Skip to content

Learning RL by implementing and analysing different RL methods.

Notifications You must be signed in to change notification settings

mvanaltvorst/reinf_learn

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Reinforcement learning experiments

Learning RL by implementing and analysing different RL methods from scratch.

RL Snake game visualisation

Directory Game Number of agents RL method
nim-dqn Nim-21 2 Deep Q-network
nim-a2c Nim-21 2 Advantage Actor Critic
matching-pennies-a2c Matching Pennies 2 Advantage Actor Critic
snake-a2c Snake 1 Advantage Actor Critic
snake-ppo Snake 1 Proximal Policy Optimisation

MLFlow

I'm also using this project to learn more about MLFlow. Some of the train scripts depend on an actively running tracking server. Please check MLFlow documentation on how to start a tracking server and set the MLFLOW_URI environment variable to the correct tracking server URL.

MLflow performance metrics

About

Learning RL by implementing and analysing different RL methods.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published