Reinforcement Learning: ARENA
I spent some time as both a student and a teaching assisstant at the second edition of ARENA, an AI Alignment upskilling workshop running in London, June 2023.
I gave some talks on Reinforcement Learning, covering the basics (bandits, MDPs, tabular learning, SARSA, Q-Learning) and some exploration of other RL methods such as DQN and VPG. Here are the slides (handout and present) and the corresponding tex source.
NB: don’t trust the PPO material, I didn’t end up using it, but it’s included for posterity, and will likely change for future iterations.