Reinforcement Learning Course

First Class:
Handouts:
1. course description (postscript,html)
2. template for scribe notes (postscript, latex, html)
and explanation about latex (postscript, latex, html)
3. Slides of first class (power point,postscript,html).

Second Class:
Finished the overview.

Third and Fourth Class:
Model of Markov decision Processes (MDP) and
Finite Horizon Problems.

Fifth and Six Class:
Infinite Horizon Discounted Problems.

Seven, Eight and Nine Class:
Learning with unknown model.

Lecture 7 (postscript, latex, html)    Monte-carlo Algorithms
Lecture 8 (postscript, latex, html)    Temporal Diffrence (TD) Algorithms
Lecture 9 (postscript, latex, html)    Q-Learning (and SARSA) Algorithms

Lecture Ten and Eleven:
Learning with large state space.

Lecture 10 (postscript, latex, html) TD-Gammon

Lecture 11 (postscript, latex, html) Large state space

Lecture Twelve:
Partially Observable MDP.

Lecture Thirteen:
Generator model and sparse sampling in Large MDPs.