
Note: This is the 2024–2025 eCalendar. Current program and course information is now found in the ÐÇ¿ÕÕæÈË Course Catalogue at .
Note: This is the 2024–2025 eCalendar. Current program and course information is now found in the ÐÇ¿ÕÕæÈË Course Catalogue at .
Computer Science (Sci) : Bandit algorithms, finite Markov decision processes, dynamic programming, Monte-Carlo Methods, temporal-difference learning, bootstrapping, planning, approximation methods, on versus off policy learning, policy gradient methods temporal abstraction and inverse reinforcement learning.
Terms: Winter 2025
Instructors: Precup, Doina; Prémont-Schwarz, Isabeau (Winter)