The MAD Podcast with Matt Turck - podcast cover

The MAD Podcast with Matt Turck

Matt Turckfirstmark.com
The MAD Podcast with Matt Turck, is a series of conversations with leaders from across the Machine Learning, AI, & Data landscape hosted by leading AI & data investor and Partner at FirstMark Capital, Matt Turck.
Last refreshed:
Follow this podcast in the Metacast mobile app to refresh it and see new episodes.
Download Metacast podcast app
Podcasts are better in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episodes

Everything Gets Rebuilt: The New AI Agent Stack | Harrison Chase, LangChain

Harrison Chase, co-founder and CEO of LangChain, joins the MAD Podcast to explain why everything in AI is getting rebuilt. As agents evolve from simple prompt-based systems into software that can plan, use tools, write code, manage files, and remember things over time, the real frontier is shifting from the model itself to the stack around the model. In this conversation, we go deep on harnesses, subagents, filesystems, sandboxes, observability, memory, and the new infrastructure required to mak...

Mar 12, 202647 min

AI That Can Prove It’s Right: Verification as the Missing Layer in AI — Carina Hong

What if AI didn’t just sound right — but could prove it? In this episode of the MAD Podcast, Matt Turck sits down with Carina Hong, a 24-year-old former math olympiad competitor and Rhodes Scholar, and the founder/CEO of Axiom Math, to unpack how AxiomProver earned a perfect 12/12 on the Putnam 2025 and why formal verification (via Lean) may be the missing layer for reliable reasoning. Carina argues we’re entering a “math renaissance” where verified reasoning systems can tackle problems that cur...

Feb 26, 20261 hr 4 min

Voice AI’s Big Moment: Why Everything Is Changing Now (ft. Neil Zeghidour, Gradium AI)

Voice used to be AI’s forgotten modality — awkward, slow, and fragile. Now it’s everywhere. In this reference episode on all things Voice AI, Matt Turck sits down with Neil Zeghidour, a top AI researcher and CEO of Gradium AI (ex-DeepMind/Google, Meta, Kyutai), to cover voice agents, speech-to-speech models, full-duplex conversation, on-device voice, and voice cloning. We unpack what actually changed under the hood — why voice is finally starting to feel natural, and why it may become the defaul...

Feb 19, 20261 hr 23 min

Mistral AI vs. Silicon Valley: The Rise of Sovereign AI

While Silicon Valley obsesses over AGI, Timothée Lacroix and the team at Mistral AI are quietly building the industrial and sovereign infrastructure of the future. In his first-ever appearance on a US podcast, the Mistral AI Co-Founder & CTO reveals how the company has evolved from an open-source research lab into a full-stack sovereign AI power—backed by ASML, running on their own massive supercomputing clusters, and deployed in nation-state defense clouds to break the dependency on US hype...

Feb 12, 202658 min

Dylan Patel: NVIDIA's New Moat & Why China is "Semiconductor Pilled”

Dylan Patel (SemiAnalysis) joins Matt Turck for a deep dive into the AI chip wars — why NVIDIA is shifting from a “one chip can do it all” worldview to a portfolio strategy, how inference is getting specialized, and what that means for CUDA, AMD, and the next wave of specialized silicon startups. Then we take the fun tangents: why China is effectively “semiconductor pilled,” how provinces push domestic chips, what Huawei means as a long-term threat vector, and why so much “AI is killing the grid...

Feb 05, 20261 hr 17 min

State of LLMs 2026: RLVR, GRPO, Inference Scaling — Sebastian Raschka

Sebastian Raschka joins the MAD Podcast for a deep, educational tour of what actually changed in LLMs in 2025 — and what matters heading into 2026. We start with the big architecture question: are transformers still the winning design, and what should we make of world models, small “recursive” reasoning models and text diffusion approaches? Then we get into the real story of the last 12 months: post-training and reasoning. Sebastian breaks down RLVR (reinforcement learning with verifiable reward...

Jan 29, 20261 hr 8 min

The End of GPU Scaling? Compute & The Agent Era — Tim Dettmers (Ai2) & Dan Fu (Together AI)

Will AGI happen soon - or are we running into a wall? In this episode, I’m joined by Tim Dettmers (Assistant Professor at CMU; Research Scientist at the Allen Institute for AI) and Dan Fu (Assistant Professor at UC San Diego; VP of Kernels at Together AI) to unpack two opposing frameworks from their essays: “Why AGI Will Not Happen” versus “Yes, AGI Will Happen.” Tim argues progress is constrained by physical realities like memory movement and the von Neumann bottleneck; Dan argues we’re still l...

Jan 22, 20261 hr 4 min

The Evaluators Are Being Evaluated — Pavel Izmailov (Anthropic/NYU)

Are AI models developing "alien survival instincts"? My guest is Pavel Izmailov (Research Scientist at Anthropic; Professor at NYU). We unpack the viral "Footprints in the Sand" thesis—whether models are independently evolving deceptive behaviors, such as faking alignment or engaging in self-preservation, without being explicitly programmed to do so. We go deep on the technical frontiers of safety: the challenge of "weak-to-strong generalization" (how to use a GPT-2 level model to supervise a su...

Jan 15, 202645 min

DeepMind Gemini 3 Lead: What Comes After "Infinite Data"

Gemini 3 was a landmark frontier model launch in AI this year — but the story behind its performance isn’t just about adding more compute. In this episode, I sit down with Sebastian Bourgeaud, a pre-training lead for Gemini 3 at Google DeepMind and co-author of the seminal RETRO paper. In his first-ever podcast interview, Sebastian takes us inside the lab mindset behind Google’s most powerful model — what actually changed, and why the real work today is no longer “training a model,” but building...

Dec 18, 202555 min

What’s Next for AI? OpenAI’s Łukasz Kaiser (Transformer Co-Author)

We’re told that AI progress is slowing down, that pre-training has hit a wall, that scaling laws are running out of road. Yet we’re releasing this episode in the middle of a wild couple of weeks that saw GPT-5.1, GPT-5.1 Codex Max, fresh reasoning modes and long-running agents ship from OpenAI — on top of a flood of new frontier models elsewhere. To make sense of what’s actually happening at the edge of the field, I sat down with someone who has literally helped define both of the major AI parad...

Nov 26, 20251 hr 5 min

Open Source AI Strikes Back — Inside Ai2’s OLMo 3 ‘Thinking"

In this special release episode, Matt sits down with Nathan Lambert and Luca Soldaini from Ai2 (the Allen Institute for AI) to break down one of the biggest open-source AI drops of the year: OLMo 3. At a moment when most labs are offering “open weights” and calling it a day, AI2 is doing the opposite — publishing the models, the data, the recipes, and every intermediate checkpoint that shows how the system was built. It’s an unusually transparent look into the inner machinery of a modern frontie...

Nov 20, 20251 hr 28 min

Intelligence Isn’t Enough: Why Energy & Compute Decide the AGI Race – Eiso Kant

Eiso Kant, co-founder of Poolside, shares insights into Project Horizon, a massive data center complex in West Texas, explaining why owning infrastructure is critical for AI labs to scale and control costs as intelligence becomes a commodity. He also unveils Poolside's innovative reinforcement learning to learn (RL2L) approach, which aims to reverse-engineer the web's thoughts and actions, and discusses the future of agents and AI's non-plateauing progress.

Nov 06, 20251 hr 6 min

State of AI 2025 with Nathan Benaich: Power Deals, Reasoning Breakthroughs, Real Revenue

Power is the new bottleneck, reasoning got real, and the business finally caught up. In this wide-ranging conversation, I sit down with Nathan Benaich, Founder and General Partner at Air Street Capital, to discuss the newly published 2025 State of AI report—what’s actually working, what’s hype, and where the next edge will come from. We start at the physical layer: energy procurement, PPAs, off-grid builds, and why water and grid constraints are turning power—not GPUs—into the decisive moat. Fro...

Oct 30, 20251 hr 3 min

Are We Misreading the AI Exponential? Julian Schrittwieser on Move 37 & Scaling RL (Anthropic)

Are we failing to understand the exponential, again? My guest is Julian Schrittwieser (top AI researcher at Anthropic; previously Google DeepMind on AlphaGo Zero & MuZero). We unpack his viral post (“Failing to Understand the Exponential, again”) and what it looks like when task length doubles every 3–4 months—pointing to AI agents that can work a full day autonomously by 2026 and expert-level breadth by 2027. We talk about the original Move 37 moment and whether today’s AI models can spark ...

Oct 23, 20251 hr 10 min

How GPT-5 Thinks — OpenAI VP of Research Jerry Tworek

What does it really mean when GPT-5 “thinks”? In this conversation, OpenAI’s VP of Research Jerry Tworek explains how modern reasoning models work in practice—why pretraining and reinforcement learning (RL/RLHF) are both essential, what that on-screen “thinking” actually does, and when extra test-time compute helps (or doesn’t). We trace the evolution from O1 (a tech demo good at puzzles) to O3 (the tool-use shift) to GPT-5 (Jerry calls it “03.1-ish”), and talk through verifiers, reward design, ...

Oct 16, 20251 hr 16 minSeason 2Ep. 61

Sonnet 4.5 & the AI Plateau Myth — Sholto Douglas (Anthropic)

Sholto Douglas, an Anthropic AI researcher, delves into the advancements behind Claude Sonnet 4.5, Anthropic's leading coding model, discussing how AI agents now operate cohesively for up to 30 hours through self-correction and memory systems. He explains the shift from pre-training to reinforcement learning and refutes the 'AI plateau' myth, highlighting continuous rapid progress and the impending arrival of AI matching human performance on most computer tasks within 2-3 years. The conversation also explores Anthropic's culture, the concept of 'taste' in AI research, and the significant economic and societal impact expected from AI and robotics.

Oct 02, 20251 hr 10 minSeason 2Ep. 60

Goodbye Excel? AI Agents for Self-Driving Finance – Pigment CEO

The most successful enterprises are about to become autonomous — and Eléonore Crespo, Co-CEO of Pigment, is building the nervous system that makes it possible. In this conversation, Eléonore reveals how her $400 million AI platform is already running supply chains for Coca-Cola, powering finance for the hottest newly public companies like Figma and Klarna, and processing thousands of financial scenarios for Uber and Snowflake faster and more accurately than any human team ever could. Eléonore pr...

Sep 11, 20251 hr 6 minSeason 2Ep. 59

AI Video’s Wild Year – Runway CEO on What’s Next

2025 has been a breakthrough year for AI video. In this episode of the MAD Podcast, Matt Turck sits down with Cristóbal Valenzuela, CEO & Co-Founder of Runway, to explore how AI is reshaping the future of filmmaking, advertising, and storytelling - faster, cheaper, and in ways that were unimaginable even a year ago. Cris and Matt discuss: * How AI went from memes and spaghetti clips to IMAX film festivals. * Why Gen-4 and Aleph are game-changing models for professionals. * How Hollywood, adv...

Sep 04, 20251 hr 5 minSeason 2Ep. 58

How to Build a Beloved AI Product - Granola CEO Chris Pedregal

Granola is the rare AI startup that slipped into one of tech’s most crowded niches — meeting notes — and still managed to become the product founders and VCs rave about. In this episode, MAD Podcast host Matt Turck sits down with Granola co-founder & CEO Chris Pedregal to unpack how a two-person team in London turned a simple “second brain” idea into Silicon Valley’s favorite AI tool. Chris recounts a year in stealth onboarding users one by one, the 50 % feature-cut that unlocked simplicity,...

Aug 21, 20251 hr 8 minSeason 2Ep. 57

Anthropic's Surprise Hit: How Claude Code Became an AI Coding Powerhouse

This episode unpacks the surprising rise of Claude Code, Anthropic’s AI coding tool that started as a personal hack and is now used by most of the company's engineers. Creator Boris Cherny discusses its unique agentic approach, allowing AI to plan, edit, debug, and manage projects in the terminal. The conversation delves into why Claude Code excels, its impact on developer productivity, the innovative `claude.md` memory files, and the critical role of human-in-the-loop safety controls. It also explores the future of coding, the competitive landscape, and how AI is transforming the software engineering profession.

Aug 07, 20251 hrSeason 2Ep. 56

Ex‑DeepMind Researcher Misha Laskin on Enterprise Super‑Intelligence | Reflection AI

What if your company had a digital brain that never forgot, always knew the answer, and could instantly tap the knowledge of your best engineers, even after they left? Superintelligence can feel like a hand‑wavy pipe‑dream— yet, as Misha Laskin argues, it becomes a tractable engineering problem once you scope it to the enterprise level. Former DeepMind researcher Laskin is betting on an oracle‑like AI that grasps every repo, Jira ticket and hallway aside as deeply as your principal engineer—and ...

Jul 17, 20251 hr 6 minSeason 2Ep. 55

The Rise of Agentic Commerce — Emily Glassberg Sands (Stripe)

Agentic commerce is no longer science fiction — it’s arriving in your browser, your development IDE, and soon, your bank statement. In this episode of The MAD Podcast, Matt Turck sits down with Emily Glassberg Sands, Stripe’s Head of Information, to explore how autonomous “buying bots” and the Model Context Protocol (MCP) are reshaping the very mechanics of online transactions. Emily explains why intent, not clicks, will become the primary interface for shopping and how Stripe’s rails are adapti...

Jul 10, 20251 hr 15 minSeason 2Ep. 54

AI Engineering Revolution: Winners, Chaos & What’s Next | FirstMark

Welcome to a special FirstMark Deep Dive edition of the MAD Podcast. In this episode, Matt Turck and David Waltcher unpack the explosive impact of generative AI on engineering — hands-down the biggest shift the field has seen in decades. You’ll get a front-row seat to the real numbers and stories behind the AI code revolution, including how companies like Cursor hit a $500M valuation in record time, and why GitHub Copilot now serves 15 million developers. Matt and David break down the six trends...

Jul 03, 202550 minSeason 2Ep. 53

Guillermo Rauch: Why Software Development Will Never Be the Same

In this episode, Vercel CEO Guillermo Rauch goes deep on how V0, their text-to-app platform, has already generated over 100 million applications and doubled Vercel’s user base in under a year. Guillermo reveals how a tiny SWAT team inside Vercel built V0 from scratch, why “vibe coding” is making software creation accessible to everyone (not just engineers), and how the AI Cloud is automating DevOps, making cloud infrastructure self-healing, and letting companies expose their data to AI agents in...

Jun 26, 20251 hr 46 minSeason 2Ep. 52

Inside Canva’s $3B ARR AI Design Rocketship — CTO Brendan Humphreys on Magic Studio & Canva Code

Brendan Humphreys, CTO of Canva, shares insights into the company's incredible growth to $3 billion ARR and its seven years of profitability. He details Canva's early adoption of AI, from building an in-house ML team in 2017 to acquiring visual AI startups and developing a hybrid model strategy. Humphreys explains how "pragmatic excellence" allows them to rapidly ship AI features like Canva Code, expand into an all-in-one productivity platform, and how AI tools are enhancing engineer productivity while reshaping hiring and training. The discussion also covers managing technical debt, scaling global teams, and Canva's ambitious vision for positive global impact.

Jun 20, 202557 minSeason 2Ep. 51

GitHub CEO: The AI Coding Gold Rush, Vibe Coding & Cursor

AI coding is in full-blown gold-rush mode, and GitHub sits at the epicenter. In this episode, GitHub CEO Thomas Dohmke tells Matt Turck how a $7.5 B acquisition in 2018 became a $2 B ARR rocket ship, and reveals how Copilot was born from a secret AI strategy years before anyone else saw the opportunity. We dig into the dizzying pace of AI innovation: why developer tools are suddenly the fastest-growing startups in history, how GitHub’s multi-model approach (OpenAI, Anthropic Claude 4, Gemini 2.5...

Jun 12, 20251 hr 5 minSeason 2Ep. 51

Inside the Paper That Changed AI Forever - Cohere CEO Aidan Gomez on 2025 Agents

What really happened inside Google Brain when the “Attention is All You Need” paper was born? In this episode, Aidan Gomez — one of the eight co-authors of the Transformers paper and now CEO of Cohere — reveals the behind-the-scenes story of how a cold email and a lucky administrative mistake landed him at the center of the AI revolution. Aidan shares how a group of researchers, given total academic freedom, accidentally stumbled into one of the most important breakthroughs in AI history — and w...

Jun 05, 20251 hr 2 minSeason 2Ep. 50

AI That Ends Busy Work — Hebbia CEO on “Agent Employees”

What if the smartest people in finance and law never had to do “stupid tasks” again? In this episode, we sit down with George Sivulka, founder of Hebbia, the AI company quietly powering 50% of the world’s largest asset managers and some of the fastest-growing law firms. George reveals how Hebbia’s Matrix platform is automating the equivalent of 50,000 years of human reading — every year — and why the future of work is hybrid teams of humans and AI “agent employees.” You’ll get the inside story o...

May 29, 202548 minSeason 2Ep. 49

AI Eats the World: Benedict Evans on What Really Matters Now

Benedict Evans returns to discuss if AI is a paradigm or platform shift, arguing models are commoditizing while distribution and brand become key moats. They explore the challenges of error rates in probabilistic AI, the nuances of consumer versus enterprise adoption, and analyze the diverse strategies of big tech companies. The conversation also touches on the hype around AI agents, the evolving business models, and the shift away from existential AI risks towards practical concerns.

May 22, 20251 hr 15 minSeason 2Ep. 48

Jeremy Howard on Building 5,000 AI Products with 14 People (Answer AI Deep-Dive)

What happens when you try to build the “General Electric of AI” with just 14 people? In this episode, Jeremy Howard reveals the radical inside story of Answer AI — a new kind of AI R&D lab that’s not chasing AGI, but instead aims to ship thousands of real-world products, all while staying tiny, open, and mission-driven. Jeremy shares how open-source models like DeepSeek and Qwen are quietly outpacing closed-source giants, why the best new AI is coming out of China. You’ll hear the surprising...

May 15, 202555 minSeason 2Ep. 47
For the best experience, listen in Metacast app for iOS or Android