Reinforcement Learning. Instructor: Prof. Balaraman Ravindran, Department of Computer Science and Engineering, IIT Madras. Reinforcement learning is a paradigm that aims to model the trial-and-error learning process that is needed in many problem situations where explicit instructive signals are not available. It has roots in operations research, behavioral psychology and AI. The goal of the course is to introduce the basic mathematical foundations of reinforcement learning, as well as highlight some of the recent directions of research. (from nptel.ac.in)
|Lecture 07 - Introduction to Immediate RL|
In this lecture we discuss a sub-problem of reinforcement learning known as immediate RL or multi-arm bandits. We talk about the exploration-exploitation dilemma in the context of bandit problems and also discuss some practical issues with regards to sampling which will be useful in simulating bandit problems.
Go to the Course Home or watch other lectures: