| Authored By: | Satinder Singh, Tommi Jaakkola, Michael L. Littman and Csaba Szepesvári |
| Paper Title: | Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms |
| In: | Machine Learning |
| Number 3 Vol. 38 | |
| Publisher: | Kluwer Academic Publishers, Boston |
| Publication Date: | 2000 |
| Pages: | 287-308 |