Coercing LLMs to Do and Reveal (Almost) Anything with Jonas Geiping - #678

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

01-04-2024 • 48 mins

Today we're joined by Jonas Geiping, a research group leader at the ELLIS Institute, to explore his paper: "Coercing LLMs to Do and Reveal (Almost) Anything". Jonas explains how neural networks can be exploited, highlighting the risk of deploying LLM agents that interact with the real world. We discuss the role of open models in enabling security research, the challenges of optimizing over certain constraints, and the ongoing difficulties in achieving robustness in neural networks. Finally, we delve into the future of AI security, and the need for a better approach to mitigate the risks posed by optimized adversarial attacks. The complete show notes for this episode can be found at twimlai.com/go/678.

You Might Like

Generative AI

Kognitos

Acquired

Ben Gilbert and David Rosenthal

Darknet Diaries

Darknet Diaries

Jack Rhysider

TED Tech

TED Tech

Thoughtworks Technology Podcast

Thoughtworks Technology Podcast

Thoughtworks

Elon Musk Podcast

Elon Musk Podcast

Stage Zero

Waveform: The MKBHD Podcast

Waveform: The MKBHD Podcast

Vox Media Podcast Network

Lenny's Podcast: Product | Growth | Career

Lenny's Podcast: Product | Growth | Career

Lenny Rachitsky

Super Data Science: ML & AI Podcast with Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

Jon Krohn

The Stack Overflow Podcast

The Stack Overflow Podcast

The Stack Overflow Podcast

System Design

Wes and Kevin

Talk Python To Me

Talk Python To Me

Michael Kennedy (@mkennedy)

The Indian Startup Show

The Indian Startup Show

Neil Patel

Machine Learning Guide

Machine Learning Guide

Dept

Software Engineering Radio - the podcast for professional software developers

Software Engineering Radio - the podcast for professional software developers

se-radio@computer.org

Amazon Unplugged

Amazon Unplugged

Amazon India

WSJ’s The Future of Everything

WSJ’s The Future of Everything

The Wall Street Journal

All-In with Chamath, Jason, Sacks & Friedberg

All-In with Chamath, Jason, Sacks & Friedberg

All-In Podcast, LLC

The Vergecast

The Verge

Product Management: The Journey 0 - 1 - 100

Product Management: The Journey 0 - 1 - 100

Krishna Ramalingam & Mayank Gelani