InfoCoBuild

A Short Course on Reinforcement Learning

A Short Course on Reinforcement Learning by Satinder Singh Baveja - Machine Learning Summer School at Purdue, 2011. This short course will be a three-part tutorial on reinforcement learning (RL) interpreted broadly to include related methods from decision theoretic planning, optimal control and operations research.

The first part of the course will cover the basics of RL. Topics covered will include Bandit problems and algorithms for solving them, Markov decision problems (MDPs) and foundational algorithms for solving them in planning and learning settings, as well as Partially observable MDPs (POMDPs) and foundational algorithms for solving them in the planning and learning settings. The second part of the course will cover advanced methods for solving MDPs and POMDPs, the use of function approximation in RL, a case study of a couple of applications of RL, and narrower topics such as inverse RL and apprenticeship learning. Time permitting I might cover some results from RL in multiagent settings. The third part of the course will cover cutting-edge topics including approaches to state estimation such as predictive state representations (PSRs), the use and learning of structured probabilistic models in controlled dynamical systems, and the recently defined optimal reward problem. I will conclude with some open challenge problems in RL.

Lecture 1 - What is Reinforcement Learning? / N-arm Bandit Problems
Lecture 2 - Small MDPs: Planning, Model-Free Learning
Lecture 3 - Small MDPs: Model-Free Learning, Model-Based Learning
Lecture 4 - Between MDPs and Semi-MDPs
Lecture 5 - On the Optimal Reward Problem
Lecture 6 - Predictive State Representations


Machine Learning Summer School at Purdue, 2011
A Machine Learning Approach for Complex Information Retrieval Applications
A Short Course on Reinforcement Learning
Classic and Modern Data Clustering
Divide and Recombine for the Analysis of Big Data
Graphical Models for the Internet
Introduction to Machine Learning
Large-Scale Machine Learning and Stochastic Algorithms
Machine Learning for a Rainy Day
Machine Learning for Discovery in Legal Cases
Machine Learning for Statistical Genetics
Mining Heterogeneous Information Networks
Modeling Complex Social Networks
Optimization for Machine Learning
Privacy Issues with Machine Learning: Fears, Facts, and Opportunities
Survey of Boosting from an Optimization Perspective
The MASH Project