Lecture 1 - What is Reinforcement Learning? / N-arm Bandit Problems