| CARVIEW |
Select Language
HTTP/2 301
server: GitHub.com
content-type: text/html
location: https://deeprl.cs.washington.edu/schedule
x-github-request-id: 1494:318CF6:9FCBE3:B38BBB:6953C0F7
accept-ranges: bytes
age: 0
date: Tue, 30 Dec 2025 12:09:29 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210030-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1767096569.056778,VS0,VE197
vary: Accept-Encoding
x-fastly-request-id: ff2c13b459cb309e755acbbe6cfaead07a10d971
content-length: 162
HTTP/1.1 200 OK
Connection: keep-alive
Content-Length: 1847
Server: GitHub.com
Content-Type: text/html; charset=utf-8
Last-Modified: Mon, 18 Jun 2018 19:19:40 GMT
Access-Control-Allow-Origin: *
ETag: W/"5b2805cc-16f9"
expires: Tue, 30 Dec 2025 12:19:29 GMT
Cache-Control: max-age=600
Content-Encoding: gzip
x-proxy-cache: MISS
X-GitHub-Request-Id: 793C:2B0FD4:A1248A:B4E4D0:6953C0F8
Accept-Ranges: bytes
Age: 0
Date: Tue, 30 Dec 2025 12:09:29 GMT
Via: 1.1 varnish
X-Served-By: cache-bom-vanm7210039-BOM
X-Cache: MISS
X-Cache-Hits: 0
X-Timer: S1767096569.282434,VS0,VE207
Vary: Accept-Encoding
X-Fastly-Request-ID: bd948bab6d65c23ad066e1f5c1ae4ab3d0564ba4
Schedule
Schedule
The schedule is tentative and subject to change. We will have guest lectures and may accomodate the schedule accordingly.
| Date | Topic | Files and Reading |
|---|---|---|
| Mar 26 | Course overview | Slides, Book Ch. 1 |
| Mar 28 | Intro to Markov Decision Processes | Slides, Book Ch. 2,3 |
| Apr 2 | Planning with a known model in the tabular case | Slides, Book Ch. 4 |
| Apr 4 | Policy Iteration (contd) and MuJoCo setup | Slides, Book Ch. 4 |
| Apr 9 | Policy gradient methods - I | Slides, policy gradient |
| Apr 11 | Policy gradient methods - II | Slides, policy gradient |
| Apr 16 | Off-policy learning - I | |
| Apr 18 | Off-policy learning - II | |
| Apr 23 | Imitation learning | |
| Apr 25 | MCTS and UC Trees | Slides |
| Apr 30 | Trajectory Optimization | Slides |
| May 2 | Combining Trajectories and Policies | Slides |
| May 7 | Guest lecture - Prof. Emanuel Todorov (UW) | Slides |
| May 9 | Guest lecture - Dr. Igor Mordatch (OpenAI) | |
| May 14 | Guest lecture - Dr. Vikash Kumar (Google Brain) | Slides |
| May 16 | No class | |
| May 21 | Learning to learn, meta learning | |
| May 23 | General duality between control & inference, compositionality in LDMPs | |
| May 28 | No class | |
| May 30 | Hierarchical RL | Slides |