Computer Science Department

DS551/CS525 - Reinforcement Learning - Fall 2024

Version: June 24th, 2024

Home	Class Info	Schedule	Projects
Grading	Reviews	Presentation	Resources

Tentative Schedule:

Slides will be uploaded on Canvas before each lecture.

-1. Week 1 (8/27 T):

Topic:

Readings:

-2. Week 2 (9/3 T):

Topic:

Optional readings:

Reinforcement Learning: An Introduction, Sutton and Barto, 2nd Edition

Note: Project 1 starts.

-3. Week 3 (9/10 T):

Topic:

Optional readings:

Reinforcement Learning: An Introduction, Sutton and Barto, 2nd Edition

-4. Week 4 (9/17 T):

Topic:

Note: Quiz 1 on Markov Decision Process and Model-based Control (30mins).

Optional readings:

Reinforcement Learning: An Introduction, Sutton and Barto, 2nd Edition

Note: Project 1 due.

-5. Week 5 (9/24 T):

Topic:

Optional readings:

Reinforcement Learning: An Introduction, Sutton and Barto, 2nd Edition

Note: Project 2 starts.

-6. Week 6 (10/1 T):

Topic:

Optional readings:

Playing Atari with Deep Reinforcement Learning

Note: Quiz 2 on Model-free Policy Evaluation.

-7. Week 7 (10/8 T):

Topic:

Optional Reading #1:

Optional Reading #2:

Optional Reading #3:

Optional Reading #4:

Note: Quiz 3 on Model-free Control.

Note: Project 2 due.

Note: Project 3 starts.

-8. Week 8 (10/15 T): No Class; Fall Break

Fall Break

-9. Week 9 (10/22 T): .

Topic:

Note: Quiz 4 on linear function approximation for policy evaluation and Control.

Note: We will have an inclass selfintroduction session, so you can start forming a team for project 4.

-10. Week 10 (10/29 T):

Topic:

Optional readings:

Reinforcement Learning: An Introduction, Sutton and Barto, 2nd Edition

optional Reading:

Policy Gradient RL algorithms (a good and comprehensive blog)

Note: Project 4 starts.

Note: Project 3 due.

-11. Week 11 (11/5 T): No Class!

this link

Note: Project 4 Proposal due.

-12. Week 12 (11/12 T):

Topic:

Optional Reading #1:

Optional Reading #2:

Optional Reading #3:

[Actor-critic RL algorithms]

Optional Reading #2:

-13. Week 13 (11/19 T):

Topic:

Optional Readings:

DDPG

MA-DDPG

AlphaTensor

Note: Quiz 5 on policy gradient (including Basic PG, REINFORCE PG, and Vanilla PG).

-14. Week 14 (11/26 T):

Topic:

Optional Readings:

GAIL

MA-GAIL

Optional Readings: A Beginner's Guide to Generative Adversarial Networks (GANs) (link).
Optional Readings: Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S.,Bengio, Y. (2014). Generative adversarial nets. In Advances in neural information processing systems (pp. 2672-2680). (paper)
Note: Project #4 Progressive Report is due. Please submit it to Canvas discussion board in teams.

-15. Week 15 (12/3 T):
Topic:Meta-RL, and Class Review..
Optional Readings: Meta-RL.

-16. Week 16 (12/10 T):
Topic: Project #4 Presentations.
Note: Project 4 due.

--> To be updated.

yli15 at wpi.edu