Ian Osband

TalkRL: The Reinforcement Learning Podcast

07-03-2024 • 1 hr 8 mins

Ian Osband is a Research scientist at OpenAI (ex DeepMind, Stanford) working on decision making under uncertainty.

We spoke about:

- Information theory and RL

- Exploration, epistemic uncertainty and joint predictions

- Epistemic Neural Networks and scaling to LLMs

Featured References

Reinforcement Learning, Bit by Bit
Xiuyuan Lu, Benjamin Van Roy, Vikranth Dwaracherla, Morteza Ibrahimi, Ian Osband, Zheng Wen

From Predictions to Decisions: The Importance of Joint Predictive Distributions

Zheng Wen, Ian Osband, Chao Qin, Xiuyuan Lu, Morteza Ibrahimi, Vikranth Dwaracherla, Mohammad Asghari, Benjamin Van Roy

Epistemic Neural Networks

Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Morteza Ibrahimi, Xiuyuan Lu, Benjamin Van Roy

Approximate Thompson Sampling via Epistemic Neural Networks

Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Morteza Ibrahimi, Xiuyuan Lu, Benjamin Van Roy

Additional References

Thesis defence, Ian Osband
Homepage, Ian Osband
Epistemic Neural Networks at Stanford RL Forum
Behaviour Suite for Reinforcement Learning, Osband et al 2019
Efficient Exploration for LLMs, Dwaracherla et al 2024

You Might Like

Generative AI

Kognitos

Darknet Diaries

Darknet Diaries

Jack Rhysider

TED Tech

TED Tech

Acquired

Ben Gilbert and David Rosenthal

Thoughtworks Technology Podcast

Thoughtworks Technology Podcast

Thoughtworks

Lenny's Podcast: Product | Growth | Career

Lenny's Podcast: Product | Growth | Career

Lenny Rachitsky

Waveform: The MKBHD Podcast

Waveform: The MKBHD Podcast

Vox Media Podcast Network

Super Data Science: ML & AI Podcast with Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

Jon Krohn

Machine Learning Guide

Machine Learning Guide

Dept

Elon Musk Podcast

Elon Musk Podcast

Stage Zero

The Stack Overflow Podcast

The Stack Overflow Podcast

The Stack Overflow Podcast

Software Engineering Radio - the podcast for professional software developers

Software Engineering Radio - the podcast for professional software developers

se-radio@computer.org

WSJ’s The Future of Everything

WSJ’s The Future of Everything

The Wall Street Journal

All-In with Chamath, Jason, Sacks & Friedberg

All-In with Chamath, Jason, Sacks & Friedberg

All-In Podcast, LLC

AI Chat: ChatGPT & AI News, Artificial Intelligence, OpenAI, Machine Learning

AI Chat: ChatGPT & AI News, Artificial Intelligence, OpenAI, Machine Learning

Jaeden Schafer

Talk Python To Me

Talk Python To Me

Michael Kennedy

System Design

Wes and Kevin

The Vergecast

The Verge

a16z Podcast

Andreessen Horowitz

The Indian Startup Show

The Indian Startup Show

Neil Patel