ARENA learning experience
I found ARENA quite helpful for self-study AI safety related topics and it can work well in together with Deep Learning Curriculum. It offers colab choice for the exercise part, which is great since I don’t have much GPU support for my own computer.
Here is how I spent my time on various topics in ARENA and hope it can work as a reference for someone also interested in self-studying this material.
Total Hours: 42.7. Split into the following:
- 8.7h Chapter 0: Fundamentals
- skipped exercise 0.1 Ray Tracing since I found it too advanced to be necessary.
- 2.3h exercise 0.2 CNN. I learnt about
torch.as_strided
related stuff. - 2.4h exercise 0.3 Resnet. I learnt a more detailed view about resnet, pytorch_lighting, batchnorm.
- 2.0h exercise 0.4 Optimization. I learnt about details of various optimizer,
Weight & Bias
related usage. - 2.0h exercise 0.5 Backprop (skipped part 3 & 4 & 5). I learnt about details of back propagation & Autograd.
- 8.0h Chapter 1: Transformers & Mech Interp
- 3.0h exercise 1.1 transformer (I skipped part of sampling.). I learnt about details of transformer, sampling, training and inference sampling.
- 5.0h exercise 1.2 mechanistic interpretability. I learnt about induction circuits, transformerLens, induction heads, hooks, reverse-engineering induction circuits. These material opened a new view for me about how to understand LLM. I find some part hard to understand though and skipped some of the exercise as I don’t want to spend too much time on this topic for now.
- 16.0h Chapter 2: Reinforcement Learning
- 2.5h exercise 2.1 Introduction to RL. it works like a memory refresher about some RL concepts. It is nice to check the detail of some RL environments.
- 7.0h exercise 2.2 Deep Q Learning
- 4.0h exercise 2.3 PPO. The Atari Breakout game result is shown here.
- 2.5h exercise 2.4 RLHF.
- 10.0h Chapter 3 Training at Scale
- I spent most of my time on doing the exercise in Data Parallelism part and skipped most of the exercise in other sections.