COM SCI XLC 260R

CS 260R: Reinforcement Learning explores how autonomous agents learn to make decisions by maximizing reward in complex, uncertain environments. Topics include Markov decision processes, model-free and model-based RL, policy optimization, distributed RL systems, and applications such as game playing (e.g., AlphaGo), traffic simulation, autonomous driving, multi-agent RL, human-in-the-loop methods, and imitation learning.

View Course Options

Duration

As few as 10 weeks

Units

4.0

Cost

Starting at $4,700.00

Get More Info

About This Course

CS 260R. Reinforcement Learning. (Instructor: Zhou, B.) Lecture, four hours; discussion, two hours; outside study, six hours. Fundamentals and advanced topics of reinforcement learning (RL), computational learning approach where agent tries to maximize total amount of reward it receives while interacting with complex and uncertain environments. Includes introduction of Markov decision processes, model-free RL and model-based RL methods, policy optimization, RL distributed system design, as well as case studies of RL in game playing such as AlphaGo, traffic simulation, autonomous driving, and other machine autonomy applications. Advanced topics of RL such as multi-agent RL, human-in-loop method, and imitation learning. Letter grading.

Spring 2026 Schedule

Date

Details

Format

Invested In Your Success

Have questions or need guidance?

Visit the Help Center

Reinforcement Learning