Louis recently has been founding a new startup focused on synthetic data for alignment,
Synth Labs, and is a researcher at Eleuether AI. This interview should speak for itself, and it’ll need re-listens, even for myself.
The list of topics we cover touches on pretty much every major and minor issue facing model fine-tuning. Please reach out or comment if there’s a paper we mention that I didn’t link before. Happy to dig it up for you. This post is very technical. If you’re having a hard time with it, I suggest you listen to my
RLHF 201 post on
Latent Space first.
Full transcript available here: https://www.interconnects.ai/p/rlhf-interview-1-louis