Easy21

Assignment from David Silver's Reinforcement Learning course. Coded for clarity, not efficiency.

Requires Torch7 with the Moses package.

Run monte-carlo.lua first to generate Q* and the plot of V (below), then sarsa-lambda.lua and lin-fun-approx.lua to generate their plots.

Includes an additional method without value functions - policy-gradient.lua - that uses a simple neural network.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
plots		plots
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
environ.lua		environ.lua
lin-func-approx.lua		lin-func-approx.lua
monte-carlo.lua		monte-carlo.lua
policy-gradient.lua		policy-gradient.lua
sarsa-lambda.lua		sarsa-lambda.lua

Provide feedback