 Mahadevan, Sridhar 

 Mahadevan, Sridhar (Sridhar Mahadevan, Nicholas Marchalleck, Tapas K. Das and Abhijit Gosavi)  Selfimproving factory simulation using continuoustime averagereward reinforcement learning  1997 
 Mahadevan, Sridhar (Sridhar Mahadevan)  Average Reward Reinforcement Learning: Foundations, Algorithms, and Empirical Results  1996 
 Mahadevan, Sridhar (Sridhar Mahadevan and Prasad Tadepalli)  Quantifying Prior Determination Knowledge Using the PAC Learning Model  1994 
 Mahadevan, Sridhar (Sridhar Mahadevan)  To discount or not to discount in reinforcement learning: a case study comparing R learning and Q learning  1994 
 Mahadevan, Sridhar (Sridhar Mahadevan)  Sensitive discount optimality: unifying discounted and average reward reinforcement learning  1996 
 Mahadevan, Sridhar (Mohammad Ghavamzadeh and Sridhar Mahadevan)  ContinuousTime Hierarchial Reinforcement Learning  2001 
 Mahadevan, Sridhar (Gang Wang and Sridhar Mahadevan)  Hierarchical optimization of policycoupled semiMarkov decision processes  1999 
