Blog Posts
(Things I Wish I Knew About) Fine-Tuning
Lessons from getting my hands dirty fine-tuning GPT-2
Choice of Loss in Proximal Policy Optimization
Why does PPO use a different surrogate loss from VPG, and does it matter?
An Invisible Disability
An essay about stuttering
