CSTalks - On machine learning - 2 Mar
Learning From Demonstration Atkeson and Schaal Dang, RLAB Feb 28 th, 2007.
7. Experiments 6. Theoretical Guarantees Let the local policy improvement algorithm be policy gradient. Notes: These assumptions are insufficient to give.
Using Inaccurate Models in Reinforcement Learning