Andrew Ng
Papers
-
Policy Search by Dynamic Programming
(2003)
Dynamic programming techniques can make direct policy search computationally and sample efficient. -
Solving Uncertain Markov Decision Problems
(2001)
Finding good policies in uncertain models -
Applying online search techniques to Continuous-State reinforcement learning
(1998)
Using a specialized version of A-star to boost the performance of approximate value functions