Reinforcement Learning. Overview Introduction Q-learning Exploration Exploitation Evaluating RL algorithms On-Policy learning: SARSA Model-based.