COMP-579-Reinforcement-Learning

Independently completed all assignments for McGill's reinforcement learning course taught by Prof. Doina Precup.

Assignment 1

Experimented with the k-armed bandit problem: implemented the epsilon-greedy, UCB, and Thompson sampling algorithms and tested them on different hyperparameter values; computed and plotted regret

Assignment 2

Implemented and compared the performance of SARSA and expected SARSA on the Frozen Lake domain from OpenAI Gym; Implemented and compared the performance of Q-learning and actor-critic with linear function approximation on the cart-pole problem.

Assignment 3

Experimented with offline RL: ran the Q-learning agent from assignment 2 (the expert) and a random agent on the cart-pole problem and gathered 500 behavioral episodes for each; trained an imitation learning agent and a fitted Q-learning agent on each dataset and compared the results.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
A1.ipynb		A1.ipynb
A2.ipynb		A2.ipynb
A3.ipynb		A3.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

COMP-579-Reinforcement-Learning

Assignment 1

Assignment 2

Assignment 3

About

Releases

Packages

Languages

ycYiwei/COMP-579-Reinforcement-Learning

Folders and files

Latest commit

History

Repository files navigation

COMP-579-Reinforcement-Learning

Assignment 1

Assignment 2

Assignment 3

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages