Naureen Karachiwalla, University of Oxford Albert Park, HKUST.
I NTERNATIONAL C LIMATE C HANGE A GREEMENTS : A N O VERVIEW Ann Chou April 14, 2010 Professor Nordhaus ECON 331b.
Reinforcement Learning Basic idea: Receive feedback in the form of rewards Agent’s utility is defined by the reward function Must learn to act.
91.420/543: Artificial Intelligence UMass Lowell CS – Fall 2010
Reinforcement Learning
CS 188: Artificial Intelligence Spring 2007