Building and Deploying Real-World RAG Applications with Ram Sriharsha - #669

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

29-01-2024 • 35 mins

Today we’re joined by Ram Sriharsha, VP of engineering at Pinecone. In our conversation, we dive into the topic of vector databases and retrieval augmented generation (RAG). We explore the trade-offs between relying solely on LLMs for retrieval tasks versus combining retrieval in vector databases and LLMs, the advantages and complexities of RAG with vector databases, the key considerations for building and deploying real-world RAG-based applications, and an in-depth look at Pinecone's new serverless offering. Currently in public preview, Pinecone Serverless is a vector database that enables on-demand data loading, flexible scaling, and cost-effective query processing. Ram discusses how the serverless paradigm impacts the vector database’s core architecture, key features, and other considerations. Lastly, Ram shares his perspective on the future of vector databases in helping enterprises deliver RAG systems. The complete show notes for this episode can be found at twimlai.com/go/669.

You Might Like

Generative AI

Kognitos

Acquired

Ben Gilbert and David Rosenthal

Darknet Diaries

Darknet Diaries

Jack Rhysider

TED Tech

TED Tech

Waveform: The MKBHD Podcast

Waveform: The MKBHD Podcast

Vox Media Podcast Network

Thoughtworks Technology Podcast

Thoughtworks Technology Podcast

Thoughtworks

Elon Musk Podcast

Elon Musk Podcast

Stage Zero

Lenny's Podcast: Product | Growth | Career

Lenny's Podcast: Product | Growth | Career

Lenny Rachitsky

Super Data Science: ML & AI Podcast with Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

Jon Krohn

The Stack Overflow Podcast

The Stack Overflow Podcast

The Stack Overflow Podcast

System Design

Wes and Kevin

WSJ’s The Future of Everything

WSJ’s The Future of Everything

The Wall Street Journal

Machine Learning Guide

Machine Learning Guide

Dept

Talk Python To Me

Talk Python To Me

Michael Kennedy (@mkennedy)

The Vergecast

The Verge

The Indian Startup Show

The Indian Startup Show

Neil Patel

Software Engineering Radio - the podcast for professional software developers

Software Engineering Radio - the podcast for professional software developers

se-radio@computer.org

All-In with Chamath, Jason, Sacks & Friedberg

All-In with Chamath, Jason, Sacks & Friedberg

All-In Podcast, LLC

AI Chat: ChatGPT & AI News, Artificial Intelligence, OpenAI, Machine Learning

AI Chat: ChatGPT & AI News, Artificial Intelligence, OpenAI, Machine Learning

Jaeden Schafer

The Real Python Podcast

The Real Python Podcast

Real Python