Reinforcement Learning : A Beginners Tutorial
Between MDPs and Semi-MDPs: Learning, Planning and Representing Knowledge at Multiple Temporal Scales Richard S. Sutton Doina Precup University of Massachusetts.
Richard S. Sutton Doina Precup University of Massachusetts Satinder Singh University of Colorado
Artificial Intelligence