Latent Space: The AI Engineer Podcast - podcast cover

Latent Space: The AI Engineer Podcast

Latent.Spacewww.latent.space
The podcast by and for AI Engineers! In 2025, over 10 million readers and listeners came to Latent Space to hear about news, papers and interviews in Software 3.0. We cover Foundation Models changing every domain in Code Generation, Multimodality, AI Agents, GPU Infra and more, directly from the founders, builders, and thinkers involved in pushing the cutting edge. Striving to give you both the definitive take on the Current Thing down to the first introduction to the tech you'll be using in the next 3 months! We break news and exclusive interviews from OpenAI, Anthropic, Gemini, Meta (Soumith Chintala), Sierra (Bret Taylor), tiny (George Hotz), Databricks/MosaicML (Jon Frankle), Modular (Chris Lattner), Answer.ai (Jeremy Howard), et al. Full show notes always on https://latent.space

www.latent.space
Last refreshed:
Download Metacast podcast app
Podcasts are better in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episodes

The Inventors of Deep Research

Aarush Selvan and Mukund Sridhar from Google Gemini discuss Deep Research, an agent that automates web research and generates in-depth reports. They cover the product's inspiration, implementation challenges, fine-tuning needs, evaluation methods, and diverse use cases, emphasizing the importance of transparency and control for users. The discussion also explores the balance between speed and thoroughness, future directions, and insights from other AI products.

Feb 18, 20251 hr 2 min

Bee AI: The Wearable Ambient Agent

Bundle tickets for AIE Summit NYC have now sold out. You can now sign up for the livestream — where we will be making a big announcement soon. NYC-based readers and Summit attendees should check out the meetups happening around the Summit . 2024 was a very challenging year for AI Hardware. After the buzz of CES last January, 2024 was marked by the meteoric rise and even harder fall of AI Wearables companies like Rabbit and Humane, with an assist from a pre-wallpaper-app MKBHD. Even Friend.com , ...

Feb 13, 20251 hr 9 min

The AI Architect — Bret Taylor

If you’re in SF, join us tomorrow for a fun meetup at CodeGen Night ! If you’re in NYC, join us for AI Engineer Summit ! The Agent Engineering track is now sold out, but 25 tickets remain for AI Leadership and 5 tickets for the workshops . You can see the full schedule of speakers and workshops at https://ai.engineer ! It’s exceedingly hard to introduce someone like Bret Taylor . We could recite his Wikipedia page, or his extensive work history through Silicon Valley’s greatest companies, but ev...

Feb 11, 20251 hr 36 min

Agent Engineering with Pydantic + Graphs — with Samuel Colvin

Did you know that adding a simple Code Interpreter took o3 from 9.2% to 32% on FrontierMath ? The Latent Space crew is hosting a hack night Feb 11th in San Francisco focused on CodeGen use cases, co-hosted with E2B and Edge AGI ; watch E2B’s new workshop and RSVP here! We’re happy to announce that today’s guest Samuel Colvin will be teaching his very first Pydantic AI workshop at the newly announced AI Engineer NYC Workshops day on Feb 22! 25 tickets left . If you’re a Python developer, it’s ver...

Feb 06, 20251 hr 4 min

The Agent Reasoning Interface: o1/o3, Claude 3, ChatGPT Canvas, Tasks, and Operator — with Karina Nguyen of OpenAI

Sponsorships and tickets for the AI Engineer Summit are selling fast ! See the new website with speakers and schedules live! If you are building AI agents or leading teams of AI Engineers , this will be the single highest-signal conference of the year for you, this Feb 20-22nd in NYC. We’re pleased to share that Karina will be presenting OpenAI’s closing keynote at the AI Engineer Summit. We were fortunate to get some time with her today to introduce some of her work, and hope this serves as nic...

Feb 01, 20251 hr 9 min

Outlasting Noam Shazeer, crowdsourcing Chai AI with >1.4m DAU, and becoming the "Western DeepSeek" — with William Beauchamp, Chai Research

One last Gold sponsor slot is available for the AI Engineer Summit in NYC . Our last round of invites is going out soon - apply here - If you are building AI agents or AI eng teams , this will be the single highest-signal conference of the year for you! While the world melts down over DeepSeek , few are talking about the OTHER notable group of former hedge fund traders who pivoted into AI and built a remarkably profitable consumer AI business with a tiny team with incredibly cracked engineering ...

Jan 26, 20251 hr 16 min

Everything you need to run Mission Critical Inference (ft. DeepSeek v3 + SGLang)

Sponsorships and applications for the AI Engineer Summit in NYC are live ! (Speaker CFPs have closed ) If you are building AI agents or leading teams of AI Engineers , this will be the single highest-signal conference of the year for you. Right after Christmas, the Chinese Whale Bros ended 2024 by dropping the last big model launch of the year: DeepSeek v3 . Right now on LM Arena, DeepSeek v3 has a score of 1319, right under the full o1 model, Gemini 2, and 4o latest. This makes it the best open...

Jan 19, 20251 hr

[Ride Home] Simon Willison: Things we learned about LLMs in 2024

Due to overwhelming demand (>15x applications:slots), we are closing CFPs for AI Engineer Summit NYC today. Last call! Thanks, we’ll be reaching out to all shortly! The world’s top AI blogger and friend of every pod, Simon Willison, dropped a monster 2024 recap: Things we learned about LLMs in 2024 . Brian of the excellent TechMeme Ride Home pinged us for a connection and a special crossover episode, our first in 2025. The target audience for this podcast is a tech-literate, but non-technical...

Jan 12, 20251 hr 13 min

Beating Google at Search with Neural PageRank and $5M of H200s — with Will Bryk of Exa.ai

Applications close Monday for the NYC AI Engineer Summit focusing on AI Leadership and Agent Engineering! If you applied, invites should be rolling out shortly. The search landscape is experiencing a fundamental shift. Google built a >$2T company with the “10 blue links” experience, driven by PageRank as the core innovation for ranking. This was a big improvement from the previous directory-based experiences of AltaVista and Yahoo. Almost 4 decades later, Google is now stuck in this links-bas...

Jan 10, 202556 min

AI Engineering for Art — with comfyanonymous, of ComfyUI

Applications for the NYC AI Engineer Summit, focused on Agents at Work , are open ! When we first started Latent Space, in the lightning round we’d always ask guests: “What’s your favorite AI product?”. The majority would say Midjourney. The simple UI of prompt → very aesthetic image turned it into a $300M+ ARR bootstrapped business as it rode the first wave of AI image generation. In open source land, StableDiffusion was congregating around AUTOMATIC1111 as the de-facto web UI. Unlike Midjourne...

Jan 04, 202555 min

Latent.Space 2024 Year in Review

Applications for the 2025 AI Engineer Summit are up, and you can save the date for AIE Singapore in April and AIE World’s Fair 2025 in June . Happy new year, and thanks for 100 great episodes! Please let us know what you want to see/hear for the next 100! Full YouTube Episode with Slides/Charts Like and subscribe and hit that bell to get notifs! Timestamps * 00:00 Welcome to the 100th Episode! * 00:19 Reflecting on the Journey * 00:47 AI Engineering: The Rise and Impact * 03:15 Latent Space Live...

Dec 31, 20241 hr 52 min

2024 in Agents [LS Live! @ NeurIPS 2024]

Happy holidays! We’ll be sharing snippets from Latent Space LIVE! through the break bringing you the best of 2024! We want to express our deepest appreciation to event sponsors AWS , Daylight Computer , Thoth.ai , StrongCompute , Notable Capital , and most of all all our LS supporters who helped fund the gorgeous venue and A/V production! For NeurIPS last year we did our standard conference podcast coverage interviewing selected papers (that we have now also done for ICLR and ICML ), however we ...

Dec 25, 202449 min

2024 in Synthetic Data and Smol Models [LS Live @ NeurIPS]

Happy holidays! We’ll be sharing snippets from Latent Space LIVE! through the break bringing you the best of 2024! We want to express our deepest appreciation to event sponsors AWS , Daylight Computer , Thoth.ai , StrongCompute , Notable Capital , and most of all all our LS supporters who helped fund the gorgeous venue and A/V production! For NeurIPS last year we did our standard conference podcast coverage interviewing selected papers (that we have now also done for ICLR and ICML ), however we ...

Dec 24, 202429 min

2024 in Post-Transformers Architectures (State Space Models, RWKV) [LS Live @ NeurIPS]

Happy holidays! We’ll be sharing snippets from Latent Space LIVE! through the break bringing you the best of 2024! We want to express our deepest appreciation to event sponsors AWS , Daylight Computer , Thoth.ai , StrongCompute , Notable Capital , and most of all all our LS supporters who helped fund the gorgeous venue and A/V production! Update: see followup discussion on HN and also the YouTube discussion . For NeurIPS last year we did our standard conference podcast coverage interviewing sele...

Dec 24, 202443 min

2024 in Open Models [LS Live @ NeurIPS]

Happy holidays! We’ll be sharing snippets from Latent Space LIVE! through the break bringing you the best of 2024! We want to express our deepest appreciation to event sponsors AWS , Daylight Computer , Thoth.ai , StrongCompute , Notable Capital , and most of all our LS supporters who helped fund the venue and A/V production! For NeurIPS last year we did our standard conference podcast coverage interviewing selected papers (that we have now also done for ICLR and ICML ), however we felt that we ...

Dec 23, 202442 min

2024 in Vision [LS Live @ NeurIPS]

Happy holidays! We’ll be sharing snippets from Latent Space LIVE! through the break bringing you the best of 2024! We want to express our deepest appreciation to event sponsors AWS , Daylight Computer , Thoth.ai , StrongCompute , Notable Capital , and most of all all our LS supporters who helped fund the gorgeous venue and A/V production! For NeurIPS last year we did our standard conference podcast coverage interviewing selected papers (that we have now also done for ICLR and ICML ), however we ...

Dec 22, 202457 min

2024 in AI Startups [LS Live @ NeurIPS]

Happy holidays! We’ll be sharing snippets from Latent Space LIVE! through the break bringing you the best of 2024 from friends of the pod! For NeurIPS last year we did our standard conference podcast coverage interviewing selected papers (that we have now also done for ICLR and ICML ), however we felt that we could be doing more to help AI Engineers 1) get more industry-relevant content, and 2) recap 2024 year in review from experts. As a result, we organized the first Latent Space LIVE!, our fi...

Dec 21, 202452 min

Windsurf: The Enterprise AI IDE - with Varun and Anshul of Codeium AI

Our second podcast guest ever in March 2023 was Varun Mohan, CEO of Codeium; at the time, they had around 10,000 users and how they vowed to keep their autocomplete free forever: Today, over a million developers use their products, they still have their free tier, and they recently launched Windsurf , an AI IDE. Chapters * 00:00:00: Introductions & Catchup * 00:03:52: Why they created Windsurf * 00:05:52: Limitations of VS Code * 00:10:12: Evaluation methods for Cascade and Windsurf * 00:16:...

Dec 13, 20241 hr 7 min

Generative Video WorldSim, Diffusion, Vision, Reinforcement Learning and Robotics — ICML 2024 Part 1

Regular tickets are now sold out for Latent Space LIVE! at NeurIPS ! We have just announced our last speaker and newest track, friend of the pod Nathan Lambert who will be recapping 2024 in Reasoning Models like o1 ! We opened up a handful of late bird tickets for those who are deciding now — use code DISCORDGANG if you need it. See you in Vancouver! We’ve been sitting on our ICML recordings for a while (from today’s first-ever SOLO guest cohost, Brittany Walker ), and in light of Sora Turbo’s l...

Dec 10, 20247 hr 8 min

Bolt.new, Flow Engineering for Code Agents, and >$8m ARR in 2 months as a Claude Wrapper

The full schedule for Latent Space LIVE! at NeurIPS has been announced, featuring Best of 2024 overview talks for the AI Startup Landscape, Computer Vision, Open Models, Transformers Killers, Synthetic Data, Agents, and Scaling, and speakers from Sarah Guo of Conviction, Roboflow, AI2/Meta, Recursal/Together, HuggingFace, OpenHands and SemiAnalysis. Join us for the IRL event/Livestream ! Alessio will also be holding a meetup at AWS Re:Invent in Las Vegas this Wednesday. See our new Events page f...

Dec 02, 20241 hr 39 min

The new Claude 3.5 Sonnet, Computer Use, and Building SOTA Agents — with Erik Schluntz, Anthropic

We have announced our first speaker , friend of the show Dylan Patel, and topic slates for Latent Space LIVE! at NeurIPS. Sign up for IRL/Livestream and to debate ! We are still taking questions for our next big recap episode! Submit questions and messages on Speakpipe here for a chance to appear on the show! The vibe shift we observed in July - in favor of Claude 3.5 Sonnet, first introduced in June — has been remarkably long lived and persistent, surviving multiple subsequent updates of 4o, o1...

Nov 28, 20241 hr 11 min

Why Compound AI + Open Source will beat Closed AI

We have a full slate of upcoming events : AI Engineer London, AWS Re:Invent in Las Vegas, and now Latent Space LIVE! at NeurIPS in Vancouver and online. Sign up to join and speak ! We are still taking questions for our next big recap episode! Submit questions and messages on Speakpipe here for a chance to appear on the show! We try to stay close to the inference providers as part of our coverage, as our podcasts with Together AI and Replicate will attest: However one of the most notable pull quo...

Nov 25, 202458 min

Agents @ Work: Lindy.ai

Alessio will be at AWS re:Invent next week and hosting a casual coffee meetup on Wednesday, RSVP here! And subscribe to our calendar for our Singapore, NeurIPS, and all upcoming meetups! We are still taking questions for our next big recap episode! Submit questions and messages on Speakpipe here for a chance to appear on the show! If you've been following the AI agents space, you have heard of Lindy AI; while founder Flo Crivello is hesitant to call it "blowing up," when folks like Andrew Wilkin...

Nov 15, 20241 hr 10 min

Agents @ Work: Dust.tt

We are recording our next big recap episode and taking questions! Submit questions and messages on Speakpipe here for a chance to appear on the show! Also subscribe to our calendar for our Singapore, NeurIPS, and all upcoming meetups! In our first ever episode with Logan Kilpatrick we called out the two hottest LLM frameworks at the time: LangChain and Dust. We’ve had Harrison from LangChain on twice ( as a guest and as a co-host ), and we’ve now finally come full circle as Stanislas from Dust j...

Nov 11, 20241 hr

In the Arena: How LMSys changed LLM Benchmarking Forever

Apologies for lower audio quality; we lost recordings and had to use backup tracks. Our guests today are Anastasios Angelopoulos and Wei-Lin Chiang , leads of Chatbot Arena, fka LMSYS, the crowdsourced AI evaluation platform developed by the LMSys student club at Berkeley, which became the de facto standard for comparing language models. Arena Elo is often more cited than MMLU scores to many folks, and they have attracted >1,000,000 people to cast votes since its launch, leading top model tra...

Nov 01, 202441 min

How NotebookLM Was Made

If you’ve listened to the podcast for a while, you might have heard our ElevenLabs-powered AI co-host Charlie a few times. Text-to-speech has made amazing progress in the last 18 months, with OpenAI’s Advanced Voice Mode (aka “Her”) as a sneak peek of the future of AI interactions (see our “Building AGI in Real Time” recap). Yet, we had yet to see a real killer app for AI voice ( not counting music ). Today’s guests, Raiza Martin and Usama Bin Shafqat , are the lead PM and AI engineer behind the...

Oct 25, 20241 hr 14 min

Building the AI Engineer Nation — with Josephine Teo, Minister of Digital Development and Information, Singapore

Singapore's GovTech is hosting an AI CTF challenge with ~$15,000 in prizes, starting October 26th, open to both local and virtual hackers. It will be hosted on Dreadnode's Crucible platform; signup here ! It is common to say if you want to work in AI, you should come to San Francisco. Not everyone can. Not everyone should. If you can only do meaningful AI work in one city, then AI has failed to generalize meaningfully . As non-Americans working in the US, we know what it’s like to see AI progres...

Oct 19, 202457 min

Building the Silicon Brain - with Drew Houston of Dropbox

CEOs of publicly traded companies are often in the news talking about their new AI initiatives, but few of them have built anything with it. Drew Houston from Dropbox is different; he has spent over 400 hours coding with LLMs in the last year and is now refocusing his 2,500+ employees around this new way of working, 17 years after founding the company. Timestamps 00:00 Introductions 00:43 Drew's AI journey 04:14 Revalidating expectations of AI 08:23 Simulation in self-driving vs. knowledge work ...

Oct 18, 20241 hr 12 min

Production AI Engineering starts with Evals — with Ankur Goyal of Braintrust

We are in 🗽 NYC this Monday! Join the AI Eng NYC meetup , bring demos and vibes! It is a bit of a meme that the first thing developer tooling founders think to build in AI is all the non-AI operational stuff outside the AI. There are well over 60 funded LLM Ops startups all with hoping to solve the new observability, cost tracking, security, and reliability problems that come with putting LLMs in production, not to mention new LLM oriented products from incumbent, established ops/o11y players l...

Oct 11, 20241 hr 57 min

Building AGI in Real Time (OpenAI Dev Day 2024)

We all have fond memories of the first Dev Day in 2023 : and the blip that followed soon after. As Ben Thompson has noted , this year’s DevDay took a quieter, more intimate tone. No Satya, no livestream, (slightly fewer people?). Instead of putting ChatGPT announcements in DevDay as in 2023, o1 was announced 2 weeks prior, and DevDay 2024 was reserved purely for developer-facing API announcements, primarily the Realtime API, Vision Finetuning, Prompt Caching, and Model Distillation . However the...

Oct 03, 20242 hr 9 min
For the best experience, listen in Metacast app for iOS or Android