Train Your Dog
Batch mode reinforcement learning based on the synthesis of artificial trajectories
Value and Planning in MDPs. Administrivia Reading 3 assigned today Mahdevan, S., “Representation Policy Iteration”. In Proc. of 21st Conference on Uncertainty.
Learning from how dogs learn Prof. Bruce Blumberg The Media Lab, MIT [email protected] Prof. Bruce Blumberg The Media Lab, MIT.