Latent Space: The AI Engineer Podcast - podcast cover

Latent Space: The AI Engineer Podcast

Latent.Spacewww.latent.space
The podcast by and for AI Engineers! In 2025, over 10 million readers and listeners came to Latent Space to hear about news, papers and interviews in Software 3.0. We cover Foundation Models changing every domain in Code Generation, Multimodality, AI Agents, GPU Infra and more, directly from the founders, builders, and thinkers involved in pushing the cutting edge. Striving to give you both the definitive take on the Current Thing down to the first introduction to the tech you'll be using in the next 3 months! We break news and exclusive interviews from OpenAI, Anthropic, Gemini, Meta (Soumith Chintala), Sierra (Bret Taylor), tiny (George Hotz), Databricks/MosaicML (Jon Frankle), Modular (Chris Lattner), Answer.ai (Jeremy Howard), et al. Full show notes always on https://latent.space

www.latent.space
Last refreshed:
Follow this podcast in the Metacast mobile app to refresh it and see new episodes.
Download Metacast podcast app
Podcasts are better in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episodes

Reality: The Final Eval — Lukas Petersson and Axel Backlund of Andon Labs

The new AIEWF website is live! Get your tickets booked ASAP as they -will- sell out. Take the AI Engineering Survey and get >$2k in credits and free AIE WF tickets ! Most industry benchmarks compress intelligence and reasoning ability into scores. SWE-Bench Pro , MMLU , Humanity’s Last Exam , etc. These metrics are useful, but don’t always represent the full extent of how a model performs in the real world . Some of the most interesting evals today look less like exams and more like operating...

Jun 04, 20261 hr 16 min

🔬Scaling Past Informal AI - Carina Hong, Axiom Math

In 2025, seven-month-old startup Axiom solved all 12 of the problems Putnam exam (scoring 8/12 in the time limit) a prestigious undergraduate math exam. The 12/12 score is better than the top undergraduates (110/120) and the closest AI system that reported a result (DeepSeek 103/120), although it is unclear what the people and other systems would have scored with more time. Nonetheless, the Putnam exam is legendary for its difficulty, with the median score typically being 0 or 1 points. Taken by...

Jun 03, 20261 hr 33 min

⚡️Satya Nadella: No Priors x Latent Space Crossover Special at Microsoft Build

We’ve informally heard that Satya is a listener to LS for a couple years now, but it was still absolutely surreal to meet him and do a live pod at Build, together with our friends at No Priors , the leading VC AI Podcast that we also greatly admire! We covered the MAI model technical takeaways on yesterday’s AINews , so I will focus our recap of Satya’s main messages around three elements: * Satya’s adaptation of the Bill Gates Line for positioning Microsoft as the Frontier Intelligence Platform...

Jun 03, 202639 min

GitHub's plan for Agents — Kyle Daigle, GitHub

I’m excited to work with Microsoft once again as the presenting sponsors of the AI Engineer World’s Fair ! We’ll streaming live from MS Build today for a special crossover pod with our friends at No Priors and the one and only Satya Nadella . However we did not hold back with this interview - we asked all the burning questions about uptime and Copilot that we know you have in your minds. Lets go! For almost two decades, GitHub has been the home of software, where both open source and closed flow...

Jun 02, 20261 hr 23 min

Why Video Agent models are next — Ethan He, xAI Grok Imagine

We’re announcing AIEWF speakers this week! Take the AI Engineering Survey ! Today’s guest Ethan first joined us for the LS Paper Club as the lead on NVIDIA Cosmos World Model , but then joined xAI and built Grok Imagine in 3 months: He comes back on Latent Space with some nuclear hot takes: that Video Models primarily get their intelligence from LLMs , not from training on video data, and that the next frontier for truly interactive, realtime, long-horizon world models is to work on LLMs (perhap...

Jun 01, 20261 hr 43 min

The Age of Async Agents — Cognition's Walden Yan & OpenInspect's Cole Murray

The new AIEWF website is live! CFPs close in 2 days and we will run our first New Engineer Orientation this weekend, get your tickets booked ASAP as they -will- sell out. Take the AI Engineering Survey and get >$2k in credits and free AIE WF tickets ! One of the central tensions in the agents industry is that even while there are major decacorn agent labs like Sierra, Decagon, Notion and Cursor being built up, it is also true that it has never been easier to DIY agents, with a plethora of age...

May 28, 20261 hr 8 min

🔬ESM: The Bitter Lesson is Coming for Proteins - Alex Rives, BioHub

Editor’s note: In our first BioHub pod with Priscilla and Mark they discussed their acquisition of EvoScale , led by Alex Rives , who is now Head of Science at BioHub. With ESM-1 they trained language models on millions of protein sequences drawn from across life, with a simple “next token” objective: predict the amino acids that have been randomly masked out, based on the context of the rest of the sequence. But they soon found that these models also learned biological structure and function, i...

May 27, 20261 hr 10 min

Giving Agents Computers — Ivan Burazin, Daytona

Take the 2026 AI Engineering Survey and get >$2k in credits and AIE WF tickets ! On the product side, everyone is getting Computer - Perplexity , Manus , Cursor , and so on. Meanwhile on the research side, agentic evals like TerminalBench and GDPVal are also assuming computer ( Harbor ). On both ends, the consolidating LLM OS stack has become a standard toolkit, and Daytona is one of a small set of AI Infra companies that are booming because of it. “The end of localhost” has been Ivan Burazin...

May 21, 20261 hr 10 min

Railway: The Agent-Native Cloud — Jake Cooper

Take the 2026 AI Engineering Survey and get >$2k in credits and AIE WF tickets ! This was recorded before Railway suffered a major GCP outage on May 19, despite being a multi-AZ, multi-zone mesh ring, with HA fiber interconnects between their Metal <> GCP <> AWS, because workload discoverability was unintentionally still tied to GCP. All has been resolved with a post-mortem . Railway did not start as an AI infrastructure company. It was founded in 2020 years before agents became t...

May 20, 20261 hr 29 min

The Autonomous Drone Tech Stack & Economics of Drones — Yaroslav Azhnyuk, The Fourth Law & Guest Host Noah Smith, Noahpinion

The future of war has been evolving before our eyes in Ukraine, yet the west still plans to fight the last war. In this special episode, guest host Noah Smith ( @noahpinion ) and Brandon Anderson sit down with Yaroslav Azhnyuk ( @YaroslavAzhnyuk ) , a serial tech founder who went from building PetCube to founding The Fourth Law , one of the world’s most advanced AI-guided drone companies. Over two hours we cover the technology, tactics, and geopolitics of drone warfare, and why the modern battle...

May 18, 20261 hr 59 min

AI-Native Healthcare: 100M Doctor Visits, 10–20 Hours Saved, Prior Auth in Minutes — Janie Lee & Chai Asawa, Abridge

Special discounts up for AIE Melbourne ( LS discount ) and AIE World’s Fair (group discounts up to 25% - CFPs still open for Autoresearch and Vertical AI ) Cya there! Abridge did not start as an “GPT wrapper”. It was founded in 2018, years before the Cambrian explosion of AI application layer companies. OpenAI launched ChatGPT publicly on November 30, 2022 and by then, Abridge had already spent years doing the unglamorous work of building trust for one of the highest context, most important work...

May 14, 20261 hr 5 min

🔬Doing Vibe Physics — Alex Lupsasca, OpenAI

Some people are going crazy over GPT 5.5. Some people. This is the story of the Jagged Frontier . People who use AI to write emails or even code implementation work find the lift moderate whereas people pushing the limits of the model are figuring out that the limits just moved outwards . Alex Lupsaska has been tracking this limit for a year and a half now. “When GPT5 came out, it was able to reproduce one of my best papers (that took a very long time to come up with) in 30 minutes .” But Alex a...

May 05, 20261 hr 32 min

Physical AI that Moves the World — Qasar Younis & Peter Ludwig, Applied Intuition

From building Applied Intuition from YC-era autonomy tooling into a $15B physical AI company , Qasar Younis and Peter Ludwig have spent the last decade living through the full arc of autonomy: from simulation and data infrastructure for robotaxi companies, to operating systems for safety-critical machines, to deploying AI onto cars, trucks, mining equipment, construction vehicles, agriculture, defense systems, and driverless L4 trucks running in Japan today. They join us to explain why “physical...

Apr 27, 20261 hr 12 min

AIE Europe Debrief + Agent Labs Thesis: Unsupervised Learning x Latent Space Crossover Special (2026)

Today, we check in a year after the first Unsupervised Learning x Latent Space Crossover special to discuss everything that has changed (there is a lot) in the world of AI. This episode was recorded just after AIE Europe , but before the Cursor-xAI deal . Unsupervised Learning is a podcast that interviews the sharpest minds in AI about what’s real today, what will be real in the future and what it means for businesses and the world - helping builders, researchers and founders deconstruct and und...

Apr 23, 202655 min

Shopify’s AI Phase Transition: 2026 Usage Explosion, Unlimited Opus-4.6 Token Budget, Tangle, Tangent, SimGym — with Mikhail Parakhin, Shopify CTO

Early bird discounts for the San Francisco World’s Fair , the biggest AIE gathering of the year, end today - prices will go up by ~$500 tonight so do please lock in ASAP! From near-universal AI tool adoption inside Shopify to internal systems for ML experimentation, auto-research, customer simulation, and ultra-low-latency search, Mikhail Parakhin joins us for a deep dive into what it actually looks like when a 20-year-old, $200B software company goes all-in on AI . We cover why Shopify has beco...

Apr 22, 20261 hr 12 min

🔬 Training Transformers to solve 95% failure rate of Cancer Trials — Ron Alfa & Daniel Bear, Noetik

Today, we explain this piece of “clickbait” from our guest! TL;DR: 95% of cancer treatments fail to pass clinical trials , but it may be a matching problem — if we better understood what patients have which tumors which will respond to which treatments, success rates improve dramatically and millions of lives can be saved — with the treatments we ALREADY have. See our full episode dropping today: Why Big Pharma is licensing AI Models Tolstoy famously wrote, ‘All healthy cells are alike; each can...

Apr 20, 20261 hr 25 min

Notion’s Token Town: 5 Rebuilds, 100+ Tools, MCP vs CLIs and the Software Factory Future — Simon Last & Sarah Sachs of Notion

For all those who missed out on London, see you in Miami next week! Notion, the knowledge work decacorn , has been building AI tooling since before ChatGPT , with many hits from Q&A in 2023 and unified AI in 2024 and Meeting Notes in 2025 . At the end of their last Make user conference, Ryan Nystrom teased Notion 3.0’s Custom Agents - and they are finally embracing the Agent Lab playbook ! Sarah Sachs and Simon Last of Notion join us for a deep dive into how Notion built Custom Agents, why i...

Apr 15, 20261 hr 17 min

Extreme Harness Engineering for Token Billionaires: 1M LOC, 1B toks/day, 0% human code, 0% human review — Ryan Lopopolo, OpenAI Frontier & Symphony

We’re proud to release this ahead of Ryan’s keynote at AIE Europe . Hit the bell, get notified when it is live! Attendees: come prepped for Ryan’s AMA with Vibhu after . Move over, context engineering . Now it’s time for Harness engineering and the age of the token billionaires . Ryan Lopopolo of OpenAI is leading that charge, recently publishing a lengthy essay on Harness Eng that has become the talk of the town: In it, Ryan peeled back the curtains on how the recently announced OpenAI Frontier...

Apr 07, 20261 hr 13 min

Marc Andreessen introspects on The Death of the Browser, Pi + OpenClaw, and Why "This Time Is Different"

Fresh off raising a monster $15B , Marc Andreessen has lived through multiple computing platform shifts firsthand, from Mosaic and Netscape to cofounding A16z. In this episode, Marc joins swyx and Alessio in a16z’s legendary Sand Hill Road office to argue that AI is not just another hype cycle, but the payoff of an “80-year overnight success”: from neural nets and expert systems to transformers, reasoning models, coding, agents, and recursive self-improvement. He lays out why he thinks this mome...

Apr 03, 20261 hr 16 min

Moonlake: Causal World Models should be Multimodal, Interactive, and Efficient — with Chris Manning and Fan-yun Sun

We’ve been on a bit of a mini World Models series over the last quarter: from introducing the topic with Yi Tay , to exploring Marble with World Labs’ Fei-Fei Li and Justin Johnson , to previewing World Models learned from massive gaming datasets with General Intuition’s Pim de Witte (who has now written down their approach to World Models with Not Boring), to discussing the Cosmos World Model with with Andrew White of Edison Scientific on our new Science pod, to writing up our own theses on Adv...

Apr 02, 20261 hr 7 min

Mistral: Voxtral TTS, Forge, Leanstral, & what's next for Mistral 4 — w/ Pavan Kumar Reddy & Guillaume Lample

Mistral has been on an absolute tear - with frequent successful model launches it is easy to forget that they raised the largest European AI round in history last year. We were long overdue for a Mistral episode, and we were very fortunate to work with Sophia and Howard to catch up with Pavan (Voxtral lead) and Guillaume (Chief Scientist, Co-founder) on the occasion of this week’s Voxtral TTS launch : Mistral can’t directly say it, but the benchmarks do imply, that this is basically an open-weig...

Mar 30, 202649 min

🔬Why There Is No "AlphaFold for Materials" — AI for Materials Discovery with Heather Kulik

Materials science is the unsung hero of the science world. Behind every physical product you interact was decades of research into getting the properties of materials just right. Your gym clothes contain synthetic fibers developed over decades. The glass screen, diodes, and chip substrate technology needed to read this blog post were only viable due to many teams of material scientists. Our guest Prof. Heather Kulik was one of the first material scientists to realize that there was alpha in comb...

Mar 24, 202635 min

Dreamer: the Personal Agent OS — David Singleton

Mar 23 update for Latent Spacenauts: this episode was recorded before the Dreamer team announced they were joining Meta Superintelligence Labs , and it turned out to be the last interview they did before the news became public. Consider this a snapshot from just before the transition! In 2024, David Singleton left Stripe and joined forces with Hugo Barra for a buzzy stealth startup named /dev/agents . This month they emerged out as Dreamer , a consumer-first platform to discover, build, and use ...

Mar 20, 20261 hr 4 min

Why Anthropic Thinks AI Should Have Its Own Computer — Felix Rieseberg of Claude Cowork & Claude Code Desktop

Felix Rieseberg, a key figure in Electron and desktop apps, details Anthropic's Claude Cowork, an agent designed for diverse knowledge work beyond coding. He explains its VM-based architecture offers safety and capability, allowing Claude to interact with a user's local computer, run scripts, and leverage browser context for powerful automation. The discussion covers Anthropic's prototype-first approach, the rise of portable 'skills,' AI's impact on labor, and the evolving relationship between users and intelligent agents.

Mar 17, 20261 hr 27 min

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer

Turbopuffer came out of a reading app. In 2022 , Simon was helping his friends at Readwise scale their infra for a highly requested feature: article recommendations and semantic search. Readwise was paying ~$5k/month for their relational database and vector search would cost ~$20k/month making the feature too expensive to ship. In 2023 after mulling over the problem from Readwise, Simon decided he wanted to “build a search engine” which became Turbopuffer. We discuss: • Simon’s path: Denmark → S...

Mar 12, 20261 hr 1 min

NVIDIA's AI Engineers: Agent Inference at Planetary Scale and "Speed of Light" — Nader Khalil (Brev), Kyle Kranen (Dynamo)

Join Kyle, Nader, Vibhu, and swyx live at NVIDIA GTC next week ! Now that AIE Europe tix are ~sold out, our attention turns to Miami and World’s Fair ! The definitive AI Accelerator chip company has more than 10xed this AI Summer: And is now a $4.4 trillion megacorp… that is somehow still moving like a startup. We are blessed to have a unique relationship with our first ever NVIDIA guests: Kyle Kranen who gave a great inference keynote at the first World’s Fair and is one of the leading architec...

Mar 10, 20261 hr 24 min

Cursor's Third Era: Cloud Agents

All speakers are announced at AIE EU , schedule coming soon. Join us there or in Miami with the renowned organizers of React Miami! Singapore CFP also open! We’ve called this out a few times over in AINews , but the overwhelming consensus in the Valley is that “ the IDE is Dead ”. In November it was just a gut feeling, but now we actually have data : even at the canonical “VSCode Fork” company, people are officially using more agents than tab autocomplete (the first wave of AI coding): Cursor ha...

Mar 06, 20261 hr 7 min

Every Agent Needs a Box — Aaron Levie, Box

The reception to our recent post on Code Reviews has been strong . Catch up! Amid a maelstrom of discussion on whether or not AI is killing SaaS , one of the top publicly listed SaaS companies in the world has just reported record revenues, clearing well over $1.1B in ARR for the first time with a 28% margin . As we comment on the pod, Aaron Levie is the rare public company CEO equally at home in both worlds of Silicon Valley and Wall Street/Main Street, by day helping 70% of the Fortune 500 wit...

Mar 05, 20261 hr 17 min

METR’s Joel Becker on exponential Time Horizon Evals, Threat Models, and the Limits of AI Productivity

This is a free preview of a paid episode. To hear more, visit www.latent.space AIE Europe CFP and AIE World’s Fair paper submissions for CAIS peer review are due TODAY - do not delay! Last call ever. We’re excited to welcome METR for their first LS Pod, hopefully the first of many: METR are keepers of currently the single most infamous chart in AI : But every Latent Space reader should be sophisticated enough to know that the details matter and that hype and hyperbole go hand in hand in AI socia...

Feb 27, 202656 min

[LIVE] Anthropic Distillation & How Models Cheat (SWE-Bench Dead) | Nathan Lambert & Sebastian Raschka

Swyx joined SAIL ! Thank you SAIL Media , Prof. Tom Yeh , 8Lee , Hamid Bagheri , c9n , and many others for tuning into SAIL Live #6 with Nathan Lambert and Sebastian Raschka, PhD . Sharing here for the LS paid subscribers. We covered: This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit www.latent.space/subscribe...

Feb 26, 202652 min
For the best experience, listen in Metacast app for iOS or Android