Localizing and Editing Knowledge in LLMs with Peter Hase - #679

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

08-04-2024 • 49 mins

Today we're joined by Peter Hase, a fifth-year PhD student at the University of North Carolina NLP lab. We discuss "scalable oversight", and the importance of developing a deeper understanding of how large neural networks make decisions. We learn how matrices are probed by interpretability researchers, and explore the two schools of thought regarding how LLMs store knowledge. Finally, we discuss the importance of deleting sensitive information from model weights, and how "easy-to-hard generalization" could increase the risk of releasing open-source foundation models. The complete show notes for this episode can be found at twimlai.com/go/679.

You Might Like

Generative AI

Kognitos

Acquired

Ben Gilbert and David Rosenthal

Darknet Diaries

Darknet Diaries

Jack Rhysider

TED Tech

TED Tech

Thoughtworks Technology Podcast

Thoughtworks Technology Podcast

Thoughtworks

Elon Musk Podcast

Elon Musk Podcast

Stage Zero

Waveform: The MKBHD Podcast

Waveform: The MKBHD Podcast

Vox Media Podcast Network

Lenny's Podcast: Product | Growth | Career

Lenny's Podcast: Product | Growth | Career

Lenny Rachitsky

Super Data Science: ML & AI Podcast with Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

Jon Krohn

The Stack Overflow Podcast

The Stack Overflow Podcast

The Stack Overflow Podcast

System Design

Wes and Kevin

Talk Python To Me

Talk Python To Me

Michael Kennedy (@mkennedy)

The Indian Startup Show

The Indian Startup Show

Neil Patel

Machine Learning Guide

Machine Learning Guide

Dept

Software Engineering Radio - the podcast for professional software developers

Software Engineering Radio - the podcast for professional software developers

se-radio@computer.org

Amazon Unplugged

Amazon Unplugged

Amazon India

WSJ’s The Future of Everything

WSJ’s The Future of Everything

The Wall Street Journal

All-In with Chamath, Jason, Sacks & Friedberg

All-In with Chamath, Jason, Sacks & Friedberg

All-In Podcast, LLC

The Vergecast

The Verge

Product Management: The Journey 0 - 1 - 100

Product Management: The Journey 0 - 1 - 100

Krishna Ramalingam & Mayank Gelani