Logan Kilpatrick and Tulsee Doshi of Google DeepMind join for a first-ever in-person episode recorded just days before Google I/O, covering headline launches like Gemini 3.5 Flash, the Omni video generation model, and the new Gemini Spark agentic product. The conversation digs into Google's strategic decision to lead with cost-adjusted efficiency over raw capability, how DeepMind now ships a full agent harness rather than bare models, and technical questions around context window limits and know...
May 20, 2026•59 min
Andrew Lee, CEO of Tasklet, returns for his fourth appearance to share how his team has once again rewritten their entire agent stack, now emphasizing file system context, agentic search, and multi-resolution summarization. The conversation digs into the strategic tension of competing with your own supplier, as Anthropic's Claude Max accounts offer direct customers far more tokens than API partners get at the same price. Andrew also lays out his framework for the only three types of software com...
May 15, 2026•1 hr 33 min
Diarmuid Gill and Liva Ralaivola of Criteo join Nathan Labenz to unpack how modern ad tech works, from millisecond-speed recommendation systems and realtime bidding to the role of deep learning, embeddings, and foundation models. They discuss why personalized advertising helps fund the open internet, how privacy and opt-out choices fit in, and what Criteo’s new partnership with OpenAI could mean for product discovery. The conversation also covers European AI talent, research publishing, and the ...
May 09, 2026•1 hr 27 min
Laura Burkhauser, CEO of Descript, explains how the company is navigating the tension between powerful AI tools and creator backlash against “slop.” She shares how Descript chooses which models to use, why reliability and multimodal understanding matter, and how the team balances frontier models with in-house task-specific systems. The conversation also covers Underlord, agentic video editing, API design for coding agents, and what AI means for the future of creative work. LINKS: Laura Burkhause...
May 06, 2026•1 hr 23 min
Kyle Corbitt, founder of OpenPipe, breaks down reinforcement learning and custom fine-tuning for modern AI models. He explains how RL differs from supervised fine-tuning, why GRPO and LLM-as-judge post-training matter, and how these techniques can improve performance, latency, and cost on open source models. The conversation also covers reward hacking, evaluation design, LoRA adapters, and how Chinese labs are using distillation to fast-follow frontier models. Sponsors: Sequence: Sequence handle...
May 01, 2026•1 hr 47 min
This edition of AI in the AM features Anna Patterson on Ceramic.ai’s pivot to low-cost enterprise search for LLMs, designed to combine public and private data with stronger fact-checking. Lukas Petersson returns with new Andon Labs results on Opus 4.7 and GPT-5.5, including surprising differences in performance, behavior, and “ruthless” tactics. Zvi Mowshowitz unpacks model welfare and how to interpret troubling model behavior, while Naveen Verma explains EnCharge AI’s analog in-memory computing...
Apr 26, 2026•2 hr 38 min
Cameron Berg returns to discuss the latest research on AI consciousness and model welfare. He breaks down new evidence for model introspection, including studies showing that systems can detect interventions on their own internal states and sometimes resist them. They also examine Anthropic's work on functional emotions, the implications of Claude's welfare reports, and Berg's new ideas about how reinforcement learning may shape positive and negative experience. The conversation makes the case f...
Apr 23, 2026•3 hr 34 min
Steve Newman, creator of Writely and founder of the Golden Gate Institute for AI, shares the personal AI toolkit and vibe-coding practices that have reshaped how he works. He walks through bespoke tools including an attention firewall, a reading app for surfacing new ideas, a coding-agent dashboard, workflow automations, and a universal logging system for debugging with Claude. They also discuss information security, mobile and voice workflows, Steve’s “anti-tokenmaxxing” philosophy, and his vie...
Apr 19, 2026•2 hr 10 min
This special AI in the AM episode features Sergiy Nesterenko of Quilter on using reinforcement learning for circuit board design, Andy Hall of Stanford on AI behavior in politics and new governance models, and Lukas Peterson and Axel Backlund of Andon Labs on their AI-run retail store in San Francisco. Nathan and Prakash also reflect on the pace of AI progress, the public reaction to existential risk, and why constructive civic action matters as AI systems grow more powerful and autonomous. Spon...
Apr 15, 2026•2 hr 31 min
This cross-post from the 80,000 Hours podcast features Ajeya Cotra in conversation with Rob Wiblin about AI timelines, recursive self-improvement, and the “crunch time” window when AI could rapidly accelerate its own development. Ajeya explains why widespread, compounding automation may face fewer bottlenecks than many expect, and what that could mean for the world by 2050. They also discuss transparency, early warning systems, and the emerging strategy of using each generation of AI to align an...
Apr 11, 2026•3 hr 10 min
Sam Stephenson, co-founder of Granola, explains how a deliberately minimalist design philosophy helped turn the AI note-taking app into one of the fastest-growing products in the market. He shares why Granola focuses on doing one job exceptionally well, how note sharing drives growth, and what they’ve learned from surprising use cases, recipes, and constant user research. The conversation also covers privacy and consent, transcription and cost choices, team collaboration, and Sam’s hopes for AI ...
Apr 08, 2026•1 hr 34 min
Joseph Nelson, CEO of Roboflow, breaks down the current state of computer vision and why it still lags behind language models in real-world understanding, latency, and deployment. He explains how Roboflow distills frontier vision capabilities into efficient, task-specific models using techniques like Neural Architecture Search and RF-DETR. The conversation covers Chinese leadership in vision, Meta and NVIDIA’s roles in the ecosystem, coding agents, and emerging S-curves from world models to wear...
Apr 04, 2026•1 hr 56 min
This special cross-post from The Intelligence Horizon features Nathan Labenz in a wide-ranging conversation on compressed AI timelines, expert disagreement, and why he believes the singularity is near. They discuss interpretability, RL scaling, and the balance between extraordinary upside, like curing major diseases, and serious existential risks. Nathan explains his evolving p(doom), why he’s slightly more optimistic about robustly good AI, and how defense-in-depth strategies might keep society...
Apr 01, 2026•1 hr 45 min
Vijoy Pandey of Outshift by Cisco lays out his vision for an “Internet of Cognition,” where AI agents can share context, build reputation, and collaborate safely at scale. He offers a useful mental model for superintelligence: progress has to scale in two directions — up, through better individual models, and out, through networks of agents and humans thinking together. The conversation explores how distributed, protocol-driven agent systems could give enterprises fine-grained permissions, audit...
Mar 25, 2026•1 hr 36 min
Karan Vaidya, CTO of Composio, explains how their “smart tool” platform lets AI agents access over 50,000 tools across 1,000+ apps through a single interface. He details how Composio handles tool discovery, authentication, sandboxes, and logging, and how an AI-powered feedback loop continuously improves tools in real time. The conversation explores avoiding model lock-in through robust skills and instructions, translating capabilities across model providers, and why the best agent use cases look...
Mar 22, 2026•1 hr 39 min
Zvi Mowshowitz returns to survey the current AI landscape, from recursive self-improvement and the shift from the “beginning” to the “middle” of the AI story to what true AI end-game would look like. He and Nathan dig into AI-driven job loss, real-world productivity impacts, and the ethics of trying to escape a “permanent underclass.” They assess today’s AI live players, why Anthropic may be slightly ahead, and whether Chinese, xAI, or Meta can catch up. The conversation closes with Anthropic’s ...
Mar 19, 2026•3 hr 27 min
This special AI Scouting Report episode from the Law & Artificial Intelligence Certificate Program surveys the current AI landscape for legal professionals. Nathan Labenz walks through the “Good, Bad, and Weird” of frontier models, from using AI to navigate his son’s cancer treatment to emerging forms of deception and reward hacking. He highlights how new systems are pushing the boundaries of math, physics, and legal performance while raising serious safety and governance questions. Listener...
Mar 16, 2026•1 hr 17 min
Jassi Pannu, Assistant Professor at Johns Hopkins, explains how rapidly advancing AI is transforming biological research and raising the risk of engineered pandemics. They map today’s biosecurity landscape, from pathogen detection and DNA sequencing to vaccine development, and examine how frontier models can already troubleshoot lab work and bypass data safeguards. The conversation introduces a proposed Biosecurity Data Level framework to restrict only the most dangerous functional biological da...
Mar 11, 2026•1 hr 43 min
Jesse Genet shares how she built a team of AI agents to transform homeschooling, family life, and personal productivity without a software background. She explains how agents like an AI chief of staff, curriculum planner, and content creator help design personalized lessons, analyze kids’ learning, manage educational toys, and even run TikTok. The conversation covers practical delegation workflows, guardrails and trust, and why she treats AIs like employees with onboarding and clear roles. Jesse...
Mar 08, 2026•2 hr 6 min
Dan Balsam and Tom McGrath from Goodfire return to explore the frontier of mechanistic interpretability and their new research pillar, Intentional Design. They explain the shift from sparse autoencoders to understanding geometric structure in latent spaces, and share a proof-of-concept method for reducing hallucinations using probes and RL. The conversation tackles concerns about reward hacking, principles for shaping the loss landscape instead of fighting backprop, and what this means for align...
Mar 05, 2026•1 hr 47 min
Geoffrey Irving, Chief Scientist at the UK AI Security Institute, explains why our theoretical understanding of machine learning remains fragile even as models surpass experts on critical security tasks. He details AISI’s work on frontier model evaluations, red teaming, and threat modeling across biosecurity, cybersecurity, and loss-of-control risks. The conversation explores reward hacking, eval awareness, and why current safety techniques may struggle to deliver high reliability. Listeners wil...
Mar 01, 2026•2 hr 19 min
Karan Singhal, Head of Health AI at OpenAI, explains how ChatGPT Health is achieving attending-physician-level performance and already serving hundreds of millions of users. He details how OpenAI works with over 250 doctors, built the 49,000-criteria HealthBench evaluation, and ran one of the first randomized trials of AI copilots in clinical care. The conversation explores privacy and safety safeguards, medical multimodality, N-of-1 treatment plans, and how AI could become a standard part of gl...
Feb 25, 2026•2 hr 1 min
Olive Song from MiniMax shares how her team trains the M series frontier open-weight models using reinforcement learning, tight product feedback loops, and systematic environment perturbations. This crossover episode weaves together her AI Engineer Conference talk and an in-depth interview from the Inference podcast. Listeners will learn about interleaved thinking for long-horizon agentic tasks, fighting reward hacking, and why they moved RL training to FP32 precision. Olive also offers a candid...
Feb 22, 2026•55 min
Vlad Tenev and Tudor Achim from Harmonic explain how they built Aristotle, an AI system that reaches International Mathematical Olympiad gold-medal performance using formally verified Lean proofs. They unpack the architecture behind mathematical superintelligence, including Monte Carlo Tree Search, lemma guessing, and specialized geometry modules. The conversation explores how verifiable reasoning could harden mission-critical software, reshape mathematical practice, and lead to trustworthy supe...
Feb 18, 2026•1 hr 31 min
Abhi Mahajan (@owlposting) explains how AI is reshaping biology and medicine, including foundation models to predict cancer treatment response and why he’s both skeptical and optimistic about current results. Helen Toner unpacks CSET’s “When AI Builds AI” report and why automated AI R&D is a major source of strategic surprise. Jeremie Harris then explores our lack of control over superhuman AI systems, fragile US–China coordination, and how to maintain situational awareness in a rapidly shif...
Feb 14, 2026•2 hr 23 min
Part 1 of this live special dives into AI for Science, U.S. AI policy, and the behavior of AI agents in open-ended environments. James Zou explains how interpretability and virtual labs of AI agents can accelerate scientific discovery. Sam Hammond assesses the Biden administration’s AI policy, U.S.–Gulf AI deals, and the odds current AIs are conscious. Shoshannah Tekofsky shares insights from studying agent performance and emergent behavior in the AI Village. Nathan uses Granola to uncover blind...
Feb 13, 2026•1 hr 32 min
Alexis Carlier, founder of Asymmetric Security, explains how assuming AGI-level intelligent labor should transform cybersecurity from reactive triage to proactive, continuous digital forensics. He breaks down today’s threat landscape—from “spray and pray” cybercrime to nation-state IP theft and North Korean “remote workers.” The conversation explores Asymmetric’s AI agents for deep investigations, their services-first approach to business email compromise, and how specialized digital forensics m...
Feb 08, 2026•1 hr 16 min
Blitzy founders Brian and Sid break down how their “infinite code context” system lets AI autonomously complete over 80% of major enterprise software projects in days. They dive into their dynamic agent architecture, how they choose and cross-check different models, and why they prioritize advances in AI memory over fine-tuning. The conversation also covers their 20¢/line pricing model, the path to 99%+ autonomous project completion, and what this all means for the future software engineering jo...
Feb 05, 2026•1 hr 57 min
This crossover episode from the Latent Space podcast features Mark Zuckerberg and Priscilla Chan on the 10-year anniversary of the Chan Zuckerberg Initiative and their expanded Biohub vision. They discuss how a “Frontier Biology Lab” working in sync with a “Frontier AI Lab” could enable breakthroughs like a Virtual Cell and true N-of-1 precision medicine. The conversation covers the acquisition of Evolutionary Scale and ESM3, new biological data collection at scale, and how AI-powered biology mi...
Feb 01, 2026•1 hr 2 min
Kevin Frazier and Alan Rozenshtein explore how AI is reshaping the legal profession, from “secret cyborg” lawyers using tools like Harvey to the uncertain future of junior associates and access to legal services. They discuss maximalist legal services, AI-written “complete contingent contracts,” and where AI should fall between strict formalism and legal realism, including Claude’s virtue-ethics-inspired constitution. The conversation then turns to AI’s role in legislation and governance, includ...
Jan 29, 2026•1 hr 37 min