aaball94
-
(Things I Wish I Knew About) Fine-Tuning
Lessons from getting my hands dirty fine-tuning GPT-2
-
Choice of Loss in Proximal Policy Optimization
Why does PPO use a different surrogate loss from VPG, and does it matter?
-
An Invisible Disability
An essay about stuttering