 | Mahadevan, Sridhar |
 |
 | Mahadevan, Sridhar (Sridhar Mahadevan, Nicholas Marchalleck, Tapas K. Das and Abhijit Gosavi) -- Self-improving factory simulation using continuous-time average-reward reinforcement learning - 1997 |
 | Mahadevan, Sridhar (Sridhar Mahadevan) -- Average Reward Reinforcement Learning: Foundations, Algorithms, and Empirical Results - 1996 |
 | Mahadevan, Sridhar (Sridhar Mahadevan and Prasad Tadepalli) -- Quantifying Prior Determination Knowledge Using the PAC Learning Model - 1994 |
 | Mahadevan, Sridhar (Sridhar Mahadevan) -- To discount or not to discount in reinforcement learning: a case study comparing R learning and Q learning - 1994 |
 | Mahadevan, Sridhar (Sridhar Mahadevan) -- Sensitive discount optimality: unifying discounted and average reward reinforcement learning - 1996 |
 | Mahadevan, Sridhar (Mohammad Ghavamzadeh and Sridhar Mahadevan) -- Continuous-Time Hierarchial Reinforcement Learning - 2001 |
 | Mahadevan, Sridhar (Gang Wang and Sridhar Mahadevan) -- Hierarchical optimization of policy-coupled semi-Markov decision processes - 1999 |
|