Episodes

Latest Episode
Llama 3.1 405b, Meta's AI strategy, and the new open frontier model ecosystem

Llama 3.1 405b, Meta's AI strategy, and the new open frontier model ecosystem

Episode 44 · · 15:22

Defining the future of the AI economy and regulation. Is Meta's AI play equivalent to the Unix stack for open-source software?This is AI generated audio with Python and 11Labs.Source...

SB 1047, AI regulation, and unlikely allies for open models

SB 1047, AI regulation, and unlikely allies for open models

Episode 43 · · 14:20

SB 1047, AI regulation, and unlikely allies for open modelsThe rallying of the open-source community against CA SB 1047 can represent a turning point for AI regulation.This is AI gen...

Switched to Claude 3.5

Switched to Claude 3.5

Episode 42 · · 06:40

I Switched to Claude 3.5Speculations on the role of RLHF and why I love the model for people who pay attention.This is AI generated audio with Python and 11Labs.Source code: https://...

Interviewing Dean Ball on AI policy

Interviewing Dean Ball on AI policy

Episode 41 · · 56:31

I’m really excited to resume the Interconnects Interviews with Dean W. Ball from the Hyperdimensional Substack. We cover the whole stack of recent happenings in AI policy, focusing o...

RLHF Roundup: Trying to get good at PPO, charting RLHF's impact, RewardBench retrospective, and a reward model competition

RLHF Roundup: Trying to get good at PPO, charting RLHF's impact, RewardBench retrospective, and a reward model competition

Episode 40 · · 11:52

Things to be aware of if you work on language model fine-tuning.This is AI generated audio with Python and 11Labs.Source code: https://github.com/natolambert/interconnects-toolsOrigi...

Frontiers in synthetic data

Frontiers in synthetic data

Episode 39 · · 11:27

Synthetic data is known to be a super powerful tool for every level of the language modeling stack. It's documented as being used for expanding vanilla pretraining data and creating ...

Text-to-video AI is already abundant

Text-to-video AI is already abundant

Episode 38 · · 08:18

Signs point to a general-use Sora-like model coming very soon, maybe even with open-weights.This is AI generated audio with Python and 11Labs.Source code: https://github.com/natolamb...

AI for the rest of us

AI for the rest of us

Episode 37 · · 12:35

Apple Intelligence makes a lot of sense when you get out of the AI bubble.This is AI generated audio with Python and 11Labs.Source code: https://github.com/natolambert/interconnects-...

A realistic path to robotic foundation models

A realistic path to robotic foundation models

Episode 36 · · 07:49

A realistic path to robotic foundation modelsNot "agents" and not "AGI." Some thoughts and excitement after revisiting the industry thanks to Physical Intelligence founders Sergey Le...

We aren't running out of training data, we are running out of open training data

We aren't running out of training data, we are running out of open training data

Episode 35 · · 08:29

Data licensing deals, scaling, human inputs, and repeating trends in open vs. closed.This is AI generated audio with Python and 11Labs.Source code: https://github.com/natolambert/int...

Name, image, and AI's likeness

Name, image, and AI's likeness

Episode 34 · · 09:03

Celebrity's power will only grow in the era of infinite content.This is AI generated audio with Python and 11Labs.Source code: https://github.com/natolambert/interconnects-toolsOrigi...

OpenAI chases Her

OpenAI chases Her

Episode 33 · · 12:28

ChatGPT leaves the textbox, and Google is building the same, and more, as practical tools.This is AI generated audio with Python and 11Labs.Source code: https://github.com/natolamber...

OpenAI's Model (behavior) Spec, RLHF transparency, and personalization questions

OpenAI's Model (behavior) Spec, RLHF transparency, and personalization questions

Episode 32 · · 14:05

Now we will have some grounding for when weird ChatGPT behaviors are intended or side-effects -- shrinking the Overton window of RLHF bugs.This is AI generated audio with Python and ...

RLHF: A thin line between useful and lobotomized

RLHF: A thin line between useful and lobotomized

Episode 31 · · 13:08

Many, many signs of life for preference fine-tuning beyond spoofing chat evaluation tools.This is AI generated audio with Python and 11Labs.Source code: https://github.com/natolamber...

Phi 3 and Arctic: Outlier LMs are hints

Phi 3 and Arctic: Outlier LMs are hints

Episode 30 · · 09:46

Models that seem totally out of scope from recent open LLMs give us a sneak peek of where the industry will be in 6 to 18 months.This is AI generated audio with Python and 11Labs.Sou...

AGI is what you want it to be

AGI is what you want it to be

Episode 29 · · 10:38

Certain definitions of AGI are backing people into a pseudo-religious corner.This is AI generated audio with Python and 11Labs.Source code: https://github.com/natolambert/interconnec...

Llama 3: Scaling open LLMs to AGI

Llama 3: Scaling open LLMs to AGI

Episode 28 · · 15:05

Meta shows that scaling won't be a limit for open LLM players in the near future.This is AI generated audio with Python and 11Labs.Source code: https://github.com/natolambert/interco...

Stop "reinventing" everything to "solve" alignment

Stop "reinventing" everything to "solve" alignment

Episode 27 · · 07:32

Integrating some non computing science into reinforcement learning from human feedback can give us the models we want.This is AI generated audio with Python and 11Labs.Source code: h...

The end of the "best open LLM"

The end of the "best open LLM"

Episode 26 · · 06:45

Modeling the compute versus performance tradeoff of many open LLMs.This is AI generated audio with Python and 11Labs.Source code: https://github.com/natolambert/interconnects-toolsOr...

Why we disagree on what open-source AI should be

Why we disagree on what open-source AI should be

Episode 25 · · 08:57

Last minute title change from: The tech industry can't agree on what open-source AI means. That's the process.How to read what multiple people mean by the word openness and see throu...

DBRX: The new best open LLM and Databricks' ML strategy

DBRX: The new best open LLM and Databricks' ML strategy

Episode 24 · · 16:33

Databricks' new model is surpassing the performance of Mixtral and Llama 2 while still being in a size category that's reasonably accessible.This is AI generated audio with Python an...

Evaluations: Trust, performance, and price (bonus, announcing RewardBench)

Evaluations: Trust, performance, and price (bonus, announcing RewardBench)

Episode 23 · · 12:40

Evaluation is not only getting harder with modern LLMs, it's getting harder because it means something different.This is AI generated audio with Python and 11Labs. Music generated by...

Model commoditization and product moats

Model commoditization and product moats

Episode 22 · · 10:56

Where moats are tested now that so many people have trained GPT4 class models. Claude 3, Gemini 1.5, Inflection 2.5, and Mistral Large are here to party.This is AI generated audio wi...

The koan of an open-source LLM

The koan of an open-source LLM

Episode 21 · · 23:06

A proposal for a new definition of an "open source" LLM and why no definition will ever just work.This is AI generated audio with Python and 11Labs. Music generated by Meta's MusicGe...

Interviewing Louis Castricato of Synth Labs and Eleuther AI on RLHF, Gemini Drama, DPO, founding Carper AI, preference data, reward models, and everything in between

Interviewing Louis Castricato of Synth Labs and Eleuther AI on RLHF, Gemini Drama, DPO, founding Carper AI, preference data, reward models, and everything in between

Episode 20 · · 01:26:28

Louis recently has been founding a new startup focused on synthetic data for alignment, Synth Labs, and is a researcher at Eleuether AI. This interview should speak for itself, and i...

How to cultivate a high-signal AI feed

How to cultivate a high-signal AI feed

Episode 19 · · 10:46

Basic tips on how to assess inbound ML content and cultivate your news feed.This is AI generated audio with Python and 11Labs. Music generated by Meta's MusicGen.Source code: https:/...

Google ships it: Gemma open LLMs and Gemini backlash

Google ships it: Gemma open LLMs and Gemini backlash

Episode 18 · · 17:17

Google rejoins the open model party and gets some backlash for a frequent problem for generative AI.This is AI generated audio with Python and 11Labs. Music generated by Meta's Music...

10 Sora and Gemini 1.5 follow-ups: code-base in context, deepfakes, pixel-peeping, inference costs, and more

10 Sora and Gemini 1.5 follow-ups: code-base in context, deepfakes, pixel-peeping, inference costs, and more

Episode 17 · · 14:58

10 Sora and Gemini 1.5 follow-ups: code-base in context, deepfakes, pixel-peeping, inference costs, and moreThis is AI generated audio with Python and 11Labs. Music generated by Meta...

Releases! OpenAI’s Sora for video, Gemini 1.5's infinite context, and a secret Mistral model

Releases! OpenAI’s Sora for video, Gemini 1.5's infinite context, and a secret Mistral model

Episode 16 · · 09:07

Emergency blog! Three things you need to know from the ML world that arrived yesterday.This is AI generated audio with Python and 11Labs. Music generated by Meta's MusicGen.Source co...

Why reward models are still key to understanding alignment

Why reward models are still key to understanding alignment

Episode 15 · · 07:44

In an era dominated by direct preference optimization and LLMasajudge, why do we still need a model to output only a scalar reward?This is AI generated audio with Python and 11Labs. ...