The goal of this course is to provide an introduction to the theory of Markov chains, with an emphasis on their applications to algorithms, learning and control. The main tool we will focus on is the mixing properties of finite-state, discrete-time, reversible Markov chains, i.e., how “fast” a given chain converges to its stationary distribution from any starting state. This may seem like a narrow question; however, it is central to the analysis of Markov Chain Monte Carlo, with connections to classical questions in statistical physics, numerical integration, randomized algorithms and combinatorial optimization, and increasingly, applications in distributed computing, large-scale graph mining, reinforcement learning and high-dimensional optimization.
Lectures: MF 8.40am-11.15am, Phillips 213
Instructor: Sid Banerjee, 229 Rhodes Hall, email
Scribing: Each student should scribe 1-2 lectures (depending on class size), in groups of 2.
There are two excellent sources for a lot of the techniques we will cover
The first is a remarkable unfinished monograph, which has been online since the 1990s, and inspired a lot of the work in this field. The second is a more recent book which gives a nice and accessible introduction to many of the topics we will study.
Set 0: Wilson’s algorithm for sampling spanning trees (Demo)
Set 1: Markov chains, ergodic theorem and convergence [notes]
Set 2: Coupling and mixing times [notes]
Set 3: Convergence of random variables [notes]
Set 4: Path coupling [notes]
Set 5: Perfect sampling [notes]
Set 6: Intro to spectral techniques [notes]
Set 7: Cheeger’s inequality and Jerrum-Sinclair [notes]
Set 8: Canonical flows [notes]
Set 9: Mixing and approximate counting [notes]