WPI Worcester Polytechnic Institute

Computer Science Department
------------------------------------------

DS551/CS525 - Reinforcement Learning - Fall 2024

Version: June 24th, 2024

------------------------------------------

Home Class Info Schedule Projects
Grading Reviews Presentation Resources

------------------------------------------

Tentative Schedule:

Slides will be uploaded on Canvas before each lecture.

-1. Week 1 (8/27 T):

    Topic: Overview of Reinforcement Learning and Class Logistics
    Readings: N/A

-2. Week 2 (9/3 T):

-3. Week 3 (9/10 T):

-4. Week 4 (9/17 T):

-5. Week 5 (9/24 T):

-6. Week 6 (10/1 T):

    Topic: Review of Deep Learning.
    Topic: Deep Reinforcement Learning.
    Optional readings: Mnih, Volodymyr, et al., Playing Atari with Deep Reinforcement Learning, arXiv preprint arXiv:1312.5602 (2013).
    Note: Quiz 2 on Model-free Policy Evaluation.

-7. Week 7 (10/8 T):

    Topic: Advanced Deep Reinforcement Learning by Prof Li, and Deep Learning Implementation in Pytorch (by TA).
    Optional Reading #1: [AAAI 2016, Double DQN] Deep Reinforcement Learning with Double Q-learning, Hado van Hasselt and Arthur Guez and David Silver Google DeepMind https://arxiv.org/pdf/1509.06461.pdf.
    Optional Reading #2: [ICLR 2016] PRIORITIZED EXPERIENCE REPLAY, Tom Schaul, John Quan, Ioannis Antonoglou and David Silver Google DeepMind https://arxiv.org/pdf/1511.05952.pdf.
    Optional Reading #3: [ICML 2016, Dueling DQN] Dueling Network Architectures for Deep Reinforcement Learning, Ziyu Wang, Tom Schaul, Matteo Hessel, Hado van Hasselt, Marc Lanctot, Nando de Freitas https://arxiv.org/pdf/1511.06581.pdf.
    Optional Reading #4: [AAAI 2018, Rainbow] Rainbow: Combining Improvements in Deep Reinforcement Learning, Matteo Hessel, Joseph Modayil, Hado van Hasselt, Tom Schaul, Georg Ostrovski, Will Dabney, Dan Horgan, Bilal Piot, Mohammad Azar, David Silver, AAAI 2018, https://arxiv.org/pdf/1710.02298.pdf.
    Note: Quiz 3 on Model-free Control.
    Note: Project 2 due.
    Note: Project 3 starts.

-8. Week 8 (10/15 T): No Class; Fall Break

-9. Week 9 (10/22 T): .

    Topic: Advanced DQNs (Continued) and Inverse Reinforcement Learning and Imitation learning..
    Note: Quiz 4 on linear function approximation for policy evaluation and Control.
    Note: We will have an inclass selfintroduction session, so you can start forming a team for project 4.

-10. Week 10 (10/29 T):

-11. Week 11 (11/5 T): No Class!

    Wellness day. No Class. See this link.
    Note: Project 4 Proposal due.

-12. Week 12 (11/12 T):
    Topic: Policy Gradient RL (continued) (See the slides from last week.)
    Topic: Advanced Policy Gradient (PPO, TRPO, PPO2) (continued), Actor-Critic Approaches (A2C, A3C, Pathwise Derivative PG), Sparse Reward, Hierarchical RL..
    Optional Reading #1: [TRPO] https://arxiv.org/pdf/1502.05477.pdf
    Optional Reading #2: [PPO] https://arxiv.org/pdf/1707.06347.pdf
    Optional Reading #3: [Actor-critic RL algorithms]
    Optional Reading #2: [DDPG] https://spinningup.openai.com/en/latest/algorithms/ddpg.html

-13. Week 13 (11/19 T):

    Topic: Advanced RL techniques (Sparse Reward, Hierarchical RL), Multi-agent RL, DeepMind AlphaTensor..
    Optional Readings: DDPG, MA-DDPG, AlphaTensor.
    Note: Quiz 5 on policy gradient (including Basic PG, REINFORCE PG, and Vanilla PG).

-14. Week 14 (11/26 T):

    Topic: DeepMind AlphaTensor (Continued), Review of Generative Adversarial Networks (GANs), and Deep Inverse Reinforcement Learning (IRL), Multi-Agent IRL.
    Optional Readings: GAIL, MA-GAIL.
    Optional Readings: A Beginner's Guide to Generative Adversarial Networks (GANs) (link).
    Optional Readings: Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S.,Bengio, Y. (2014). Generative adversarial nets. In Advances in neural information processing systems (pp. 2672-2680). (paper)
    Note: Project #4 Progressive Report is due. Please submit it to Canvas discussion board in teams.

-15. Week 15 (12/3 T):

    Topic:Meta-RL, and Class Review..
    Optional Readings: Meta-RL.

-16. Week 16 (12/10 T):

    Topic: Project #4 Presentations.
    Note: Project 4 due.

--> To be updated.



yli15 at wpi.edu