Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Dwarkesh Podcast

28-03-2024 • 3 hrs 12 mins

Had so much fun chatting with my good friends Trenton Bricken and Sholto Douglas on the podcast.

No way to summarize it, except:

This is the best context dump out there on how LLMs are trained, what capabilities they're likely to soon have, and what exactly is going on inside them.

You would be shocked how much of what I know about this field, I've learned just from talking with them.

To the extent that you've enjoyed my other AI interviews, now you know why.

So excited to put this out. Enjoy! I certainly did :)

Watch on YouTube. Listen on Apple Podcasts, Spotify, or any other podcast platform.

There's a transcript with links to all the papers the boys were throwing down - may help you follow along.

Follow Trenton and Sholto on Twitter.

Timestamps

(00:00:00) - Long contexts

(00:16:12) - Intelligence is just associations

(00:32:35) - Intelligence explosion & great researchers

(01:06:52) - Superposition & secret communication

(01:22:34) - Agents & true reasoning

(01:34:40) - How Sholto & Trenton got into AI research

(02:07:16) - Are feature spaces the wrong way to think about intelligence?

(02:21:12) - Will interp actually work on superhuman models

(02:45:05) - Sholto’s technical challenge for the audience

(03:03:57) - Rapid fire

Get full access to Dwarkesh Podcast at www.dwarkeshpatel.com/subscribe

You Might Like

Generative AI

Kognitos

Darknet Diaries

Darknet Diaries

Jack Rhysider

Acquired

Ben Gilbert and David Rosenthal

TED Tech

TED Tech

Practical AI: Machine Learning, Data Science, LLM

Practical AI: Machine Learning, Data Science, LLM

Changelog Media

Lenny's Podcast: Product | Growth | Career

Lenny's Podcast: Product | Growth | Career

Lenny Rachitsky

Machine Learning Guide

Machine Learning Guide

Dept

Waveform: The MKBHD Podcast

Waveform: The MKBHD Podcast

Vox Media Podcast Network

Super Data Science: ML & AI Podcast with Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

Jon Krohn

Thoughtworks Technology Podcast

Thoughtworks Technology Podcast

Thoughtworks

WSJ’s The Future of Everything

WSJ’s The Future of Everything

The Wall Street Journal

System Design

Wes and Kevin

Elon Musk Podcast

Elon Musk Podcast

Stage Zero

All-In with Chamath, Jason, Sacks & Friedberg

All-In with Chamath, Jason, Sacks & Friedberg

All-In Podcast, LLC

The Stack Overflow Podcast

The Stack Overflow Podcast

The Stack Overflow Podcast

Talk Python To Me

Talk Python To Me

Michael Kennedy

The Artificial Intelligence Podcast

The Artificial Intelligence Podcast

Dr. Tony Hoang

Software Engineering Radio - the podcast for professional software developers

Software Engineering Radio - the podcast for professional software developers

se-radio@computer.org

AI Chat: ChatGPT & AI News, Artificial Intelligence, OpenAI, Machine Learning

AI Chat: ChatGPT & AI News, Artificial Intelligence, OpenAI, Machine Learning

Jaeden Schafer

The Vergecast

The Verge