Are Emergent Behaviors in LLMs an Illusion? with Sanmi Koyejo - #671

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

12-02-2024 • 1 hr 5 mins

Today we’re joined by Sanmi Koyejo, assistant professor at Stanford University, to continue our NeurIPS 2024 series. In our conversation, Sanmi discusses his two recent award-winning papers. First, we dive into his paper, “Are Emergent Abilities of Large Language Models a Mirage?”. We discuss the different ways LLMs are evaluated and the excitement surrounding their“emergent abilities” such as the ability to perform arithmetic Sanmi describes how evaluating model performance using nonlinear metrics can lead to the illusion that the model is rapidly gaining new capabilities, whereas linear metrics show smooth improvement as expected, casting doubt on the significance of emergence. We continue on to his next paper, “DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models,” discussing the methodology it describes for evaluating concerns such as the toxicity, privacy, fairness, and robustness of LLMs. The complete show notes for this episode can be found at twimlai.com/go/671.

You Might Like

Generative AI

Kognitos

Acquired

Ben Gilbert and David Rosenthal

Darknet Diaries

Darknet Diaries

Jack Rhysider

TED Tech

TED Tech

Thoughtworks Technology Podcast

Thoughtworks Technology Podcast

Thoughtworks

Elon Musk Podcast

Elon Musk Podcast

Stage Zero

Waveform: The MKBHD Podcast

Waveform: The MKBHD Podcast

Vox Media Podcast Network

Lenny's Podcast: Product | Growth | Career

Lenny's Podcast: Product | Growth | Career

Lenny Rachitsky

Super Data Science: ML & AI Podcast with Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

Jon Krohn

The Stack Overflow Podcast

The Stack Overflow Podcast

The Stack Overflow Podcast

System Design

Wes and Kevin

Talk Python To Me

Talk Python To Me

Michael Kennedy (@mkennedy)

The Indian Startup Show

The Indian Startup Show

Neil Patel

Machine Learning Guide

Machine Learning Guide

Dept

Software Engineering Radio - the podcast for professional software developers

Software Engineering Radio - the podcast for professional software developers

se-radio@computer.org

Amazon Unplugged

Amazon Unplugged

Amazon India

WSJ’s The Future of Everything

WSJ’s The Future of Everything

The Wall Street Journal

All-In with Chamath, Jason, Sacks & Friedberg

All-In with Chamath, Jason, Sacks & Friedberg

All-In Podcast, LLC

The Vergecast

The Verge

Product Management: The Journey 0 - 1 - 100

Product Management: The Journey 0 - 1 - 100

Krishna Ramalingam & Mayank Gelani