 | Jaakkola, Tommi |
 |
 | Jaakkola, Tommi (Satinder Singh, Tommi Jaakkola, Michael L. Littman and Csaba Szepesvári) -- Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms - 2000 |
 | Jaakkola, Tommi (Satinder P. Singh, Tommi Jaakkola and Michael I. Jordan) -- Learning without state-estimation in partially observable Markovian decision processes - 1994 |
|