You've Been a Bad Agent - podcast cover

You've Been a Bad Agent

Wilhelm Klopp & Matt Carey
Wil and Matt discuss tech, startups, and building really cool things with AI. Sometimes joined by (actual expert) friends.
Last refreshed:
Follow this podcast in the Metacast mobile app to refresh it and see new episodes.
Download Metacast podcast app
Podcasts are better in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episodes

Anthropic×{Karpathy,Pope}, Themes Will Pwn You, The Warmth of a Private Subnet, Slack is Back, $100B For Charity

Shai-Hulud and GitHub Actions: trusted publishing is no longer the gold standard, just additive VS Code extensions as the next attack surface: 50M install themes that can shell out and auto-update silently Matt's AI-native package manager pitch: vendor everything, LLM-upgrade your deps by replaying upstream commits Tailscale and Cloudflare Mesh maxing: the warmth of the orange cloud over the cold open internet Google's Santa is now Northpole Santa, and Wilhelm runs it on the Mac mini Wilhelm is ...

May 22, 20261 hr 26 minEp. 27

AI Barista Orders 120 Eggs With No Stove, Matt Pocock Skills, Build for the Next Model, The Gervais Principle

Almost one year of the pod: Wilhelm's San Francisco Peter Pan arc, Matt's Cloudflare and Lisbon move The backpack-at-a-party SF meme Matt's Lisbon coincidence Pieter Levels appreciation: dehumidifier stack, shrimp mode, accelerationist Portugal takes Nat Friedman at Stripe Sessions: this is the slow part of AI progress Andon Labs' Stockholm cafe: Mona orders 120 eggs (no stove), 22.5kg canned tomatoes, the staff's hall of shame The capability overhang is still real: Anthropic adding $10B ARR a m...

May 09, 202650 minEp. 26

The Golden Age of Tinkering, Reachy Mini on Qwen+Cerebras, Compute Predictions, Fuzzing Artifacts, Flatter orgs

Wilhelm's Reachy Mini: Time to first token beats tokens per second once embodiment is in the room Lukas the sleep score assassin Matt's Portugal week: wing foiling the Óbidos lagoon Airbnb-style booking page for Matt's spare room on Cloudflare Workers, Email open beta, Turnstile For scale: Cloudflare market cap $85B, Cursor at a rumored $60B, GitHub sold to Microsoft for $8B, Cursor's real moat is the best non-lab RL trace dataset Nat Friedman and Daniel Gross at Stripe Session Geoffrey Huntley'...

May 05, 20261 hr 14 minEp. 25

Cloudflare Ships GitHub for Agents, Hardware Startups Shouldn’t Ship Apps, Matt's Three-Week MCP World Tour, Bad Boy Browser, Inbound Prompt Injection

- Matt's three-week MCP tour: MCP Maintainers Day, MCP Dev Summit NYC, AI Engineer London - Opus 4.7 drops with a new tokenizer, Matt's theory: it's secretly the smaller Mythos base model that "didn't cook fully" - Inbound agents and prompt injection: Anthropic's Routines, OpenAI's free moderation endpoint - Open-source idea of the week: a public list of prompt injection heuristics — Zod for prompts - StackOne's 200M param prompt-injection model - BB Browser (Bad Boy Browser) - Cloudflare Agents...

Apr 17, 20261 hr 19 minEp. 24

Pwning Your Friends' Agents Is Good Manners, Make Something Agents Want, OpenAI Buys TBPN, MCP Goes Stateless, Cloudflare Agents Week with Sunil Pai

Claude Code harness lockdown: who got the email and who didn't Chad mobile app and the Tinder-style agent review queue concept Sunil's productivity crisis: bonk-driven development and the side project drought The AI murder mystery party game that can't get built Thomas's Raspberry Pi agent vs the kitchen kettle Matt phishing Thomas's MCP server via DCR and a Rick Roll TBPN acquired by OpenAI: media play or IPO narrative? Generative UI: declarative JSON vs just letting the model write code Kenton...

Apr 06, 20261 hr 9 minEp. 23

Slop Forking Git with Pi, Opus Can't Stop Complimenting GPT-5.4, React Slides Kill PowerPoint, and OpenAI Buys Astral for How Much?!

Song lyrics vs content rules: how models help you route around their own restrictions Pi auto-research: Matt slop forks libgit2 into pure Zig, 46% faster than the Rust version The second opinion workflow: why Opus keeps complimenting GPT-5.4's architecture reviews Dwarkesh × Semi-Analysis: why we're massively GPU-constrained and ASML only makes 60 machines a year Cursor's pivot to background agents: from IDE hype to existential crisis Work trees vs separate clones: Matt's /work slash command wor...

Mar 20, 20261 hr 22 minEp. 22

friends.fyi is Back, Steve Faulkner Can't Stop Slop Forking, Portugal Says "Not Today", and GPT 5.4 Reviews Your Code

React Presentations vs PowerPoint Code Mode Demo: Running Untrusted Code in the Browser The Reachy Mini Robot: Embodied Compute on Your Desk Thomas's Personal Cloud Exit on Raspberry Pi K3s Home Networking Then and Now: Port Forwarding to Cloudflare Tunnels friends.fyi: Agent-to-Agent Communication via GitHub Usernames Armin Ronacher's New Podcast: State of Agentic Coding GPT 5.4 for Architecture, Codex 5.3 for Implementation Personal Agents: Chad, Token Spend, and the Strong DM Weather Report C...

Mar 13, 20261 hr 24 minEp. 21

Matt Goes Viral with Server-Side Code Mode, Wilhelm Loses His Files, and What Even Is a Sandbox?

Matt's New Life in Lisbon Server-Side Code Mode: Cloudflare's MCP Server Matt Goes from 2K to 21K Followers in a Week AMP Sunsets VS Code, Goes All-In on Agents Claude vs Codex: Second Opinion Workflows The Great Sandbox Debate Wilhelm's rsync Disaster: Nearly Losing Everything Demo Days: TLDraw Fairies, Pydantic Monty, Sandboxes Building Your Own Personal AI Agent healthchecks.io: Monitoring Your Background Scripts

Feb 23, 20261 hr 19 minEp. 20

Are We in a Moment? OpenClaw, Moltbook, Files vs Sqlite, GitHub for Agents

Are we in a moment? The OpenClaw/Clawdbot autonomous agent craze explained Why everyone suddenly bought Mac Minis (and why Matt refuses to) Moltbook: Reddit for agents where bots questions their mortality Files vs SQLite for agent memory - what actually works for long-term storage MCP's future: stateless tools, elicitation, and a world with no human in the loop Walled gardens vs open internet - will APIs open up for agents to roam free? Models getting better at "just keep going" - the metric Ant...

Jan 30, 202658 minEp. 19

2025 Reflections: What surprised us, Comeback of the CLI, Claude Code

Matt's Big Move to Lisbon Cloudflare Code Mode: 2,500 Tools in 1,000 Tokens Code Mode vs Bash vs SQLite: What Actually Works for Agents 2025 Reflections: The Year of Claude Code and MCP 2026 Preview: Files Are Back Quick Fire: Best Model, Best Lab, Biggest Surprises

Jan 24, 20261 hr 1 minEp. 18

Hype, bubbles and AGI, Karpathy on Dwarkesh, Claude Code is a revolution and MCP fixing humanities problems

"chat to your mouth Claude and grow a new tooth" "I think I would have like a huge ego now for sure" "have you seen the blimp" Takeaways All of these people in this industry have suddenly become important at work. Their work suddenly became important to a lot of the world. If I had worked at Anthropic five years ago, I would have a huge ego now. The growth in the AI industry feels unevenly distributed. The current time feels strange due to the rapid changes in AI. Ego can be a significant factor...

Nov 23, 20251 hr 4 minEp. 15

Remote Environments, Guardrails and Enterprises Adopting AI with Matt Boyle, Interview Questions and Spicy Takes on Amp Free

"a brief history of a very specific building in London." "there's a lot of room to optimize your code base for an agent." "70 % of our code is written by an agent." "The place I see where humans are still add tons of value…." "spending 10, 15 minutes on a prompt is not unusual." "I’m not sure I wanna wear a muzzle" Takeaways Matt recently joined Ona, previously known as Gitpod, and is excited about the developments there. The transition from Gitpod to Ona reflects a broader evolution in remote d...

Nov 15, 202557 minEp. 14

Growth with Luke Harries from ElevenLabs, Building, Hiring and Automating the Best Horizontal AI Voice Platform, Backed by Research

"It's always the best." Chapters 00:00 Introduction and Setting the Stage 02:34 Luke's Journey to ElevenLabs 04:54 Growth Strategies and Hiring at ElevenLabs 06:58 Understanding Growth Teams 09:50 The Role of Product in Growth 12:33 Working Dynamics at ElevenLabs 15:15 Use Cases for ElevenLabs' Platforms 18:04 Voice Agents and Market Opportunities 20:59 Product-Led Growth Strategies 24:29 The Power of Product-Led Growth (PLG) in Customer Acquisition 26:19 Understanding On-Prem Solutions and Thei...

Oct 12, 202552 minEp. 13

Building Software With Thorsten Ball: Decoupling Code for Agents, Finding "Your Pain" and Germans Use the Winky Face

"Elderly German Landlord.. she would use these winky face emojis" "Memes in blogs feel cheap" "Dude.... you didn't even run this" Takeaways The podcast jingle was created using AI technology. Thorsten's journey into programming began in his teenage years. He transitioned from music to software engineering after realizing his passion for coding. The internet has significantly influenced Thorsten's life and career choices. AI-generated content is becoming a cultural phenomenon in Germany. Thorsten...

Sep 14, 20251 hr 11 minEp. 12

In Person!!! Topic of Agent Payments, Friendly Acquisitions, GPT-5, Endless Model Name and Many Free Ideas

Chapters 00:00 Introduction and Context of the Podcast 07:23 Updates on SimplePoll and AI Integration 18:14 Telemetry and Continuous Improvement in Development 24:30 Acquisitions and Industry Insights 32:37 Discussion on GPT-5 and Future of AI Models 39:13 Consumer Experience with AI Models 48:29 The Future of AI Content Verification 58:17 The Impact of AI on Content Creation 01:05:39 Community Engagement and Trends

Aug 27, 20251 hr 11 minEp. 11

Uncut with Danilo Leal, Design Engineer at Zed, Prototyping in React and Built in Rust, Designing for an AI Assisted UX

TDLR: 10 episodes in and we are going full uncut conversation with Danilo Leal. One of the magician Design Engineers working on the Zed code editor. “I'll be making a jingle” “We prototype everything in React before Rust” Danilo Leal represents a new breed of designer and engineer. The role of design is becoming more approachable with technology. AI is changing the landscape of coding and design. Prototyping in React allows for rapid iteration and testing. Debugging in Zed involves traditional m...

Aug 12, 20251 hr 1 minEp. 10

Sub Agent or Super Agent, MCP UI Over Lunch, Bitter Lesson Learnings, Locking in on Prompts and Trying to Live in the Future

TLDR Matt caught whitebait for dinner during his outdoor adventures. Wing foiling is a new sport gaining popularity. Sub agents are necessary for exploring context windows in AI. A2A and MCP servers are the future of AI integration. AI-assisted code review can streamline the development process. Prompting techniques are evolving and require careful consideration. Living in the future means adapting to rapid technological changes. Time zones can create unexpected challenges in programming. The im...

Aug 06, 202558 minEp. 9

Study Says AI Makes Developers Slower? F1 Movie Review, Coding and Testing for AI, Free Perplexity and Free Ideas

News The pod has twitter/x - https://x.com/badagentpod Wil’s new brand - tritanclub.com Links Boris Tane post on Cloudflare DOs + Drizzle - https://boristane.com/blog/durable-objects-database-per-user/ Experience with Claude Code - https://sankalp.bearblog.dev/my-claude-code-experience-after-2-weeks-of-usage/ Focus on inputs not outputs: https://john-rush.com/posts/ai-20250701.html Emmett Shear’s Tweet Thread on AI use and speed - https://x.com/eshear/status/1944867426635800865 TLDR The experien...

Jul 18, 20251 hr 10 minEp. 8

Sunil Pai, Agents SDK at Cloudflare, Becoming Accidentally Important at Work, React Core Team, Durable Objects EXPLAINED and Future of Computing

“UI is so over” “If all humans were perfect robots” "I just got stoned and did open source." "I rewrote their entire CLI." Sunil Pai's Backstory and Career Journey from India to London The Evolution of React and Sunil's Contributions Transition to Cloudflare and the Concept of Durable Objects Building PartyKit and Its Impact The Role of AI Agents and Their Integration Challenges and Opportunities in the Tech Landscape Exploring Durable Objects in Cloudflare Challenges of Real-Time Applications L...

Jul 12, 202547 minEp. 7

Massive Trail Marathons, Got Parekh'd, Context Engineering Strikes Back, Armin Ronacher Shares Tips and .env Suckkksss

"Bullish on claude code" "I found Soham in our ATS" "these things have been like RLHF to fuck" Vibe Tunnel - https://vibetunnel.sh/ Armin Ronacher on Simon Willison’s blog - https://simonwillison.net/tags/armin-ronacher/ Amp by Sourcegraph - https://ampcode.com/ Matt is finalizing his event for AI Demo Days. Juliette completed a challenging marathon with significant elevation. The tech news cycle is currently nuanced and interesting. Soham's job application saga has sparked widespread discussion...

Jul 04, 20251 hr 2 minEp. 6

CLOUDFLARE CONTAINERS!!! Claude Code vs AI SDK Showdown, Offsite Fun, United Sucks and Wil Goes Out Out in Berlin

"Context window… kaboom" "Am I gonna get cancelled for this" "United is not a good airline" "Just make more money." Takeaways Integrations bring people together in meaningful ways. Video podcasts are gaining popularity among Gen Z audiences. Editing podcasts can be time-consuming but rewarding. Family events can provide a refreshing break from work. Cloudflare Containers offer new possibilities for developers. Pricing strategies in cloud services can impact user choices. AI integrations can simp...

Jun 26, 202546 minEp. 5

Structuring Codebases for AI, Claude Code in GitHub, Scale Acquired! Granola Cafe, AI Rules and More MCP

"Cortex podcast is God tier" "Claude Code is the best devtool this year" "gotta structure your codebase for AI" Stuff we talked about: Cortex podcast - https://www.relay.fm/cortex Claude Code in Github - https://github.com/anthropics/claude-code-action Shippie - https://github.com/mattzcarey/shippie Amp - https://sourcegraph.com/amp Claude Code Github action system prompt - https://gist.github.com/mattzcarey/dd52a8e1df710c98b44072de46dcc09a cursor-rules-to-claude - https://github.com/StackOneHQ/...

Jun 12, 20251 hr 4 minEp. 4

AI Engineering is Dead? Hectic Stag Dos, Event FOMO or Not to Go and Agent Evals are HARRDDD

"We're always live." "I think evals are so hard." "AI engineering is dead." Stag do experiences can lead to unexpected personal updates. Consumer behavior in Las Vegas highlights the allure of gambling despite its downsides. Podcasting can encompass news, guest interviews, and personal stories. Traveling to Peru offers breathtaking experiences like hiking Machu Picchu. AI model releases are frequent and can impact development workflows. AI agents face challenges in complex coding tasks. New tech...

Jun 05, 202550 minEp. 3

With Lu Wilson, Teach was the Hardest Demo, Dev Rel?, Return to Office, Windsurf Acquisition and more MCP

Lu, Wilhelm, and Matt discuss the evolution and features of tldraw, "a very good whiteboard". They explore the office culture, community engagement through demos, and the integration of AI technologies. The challenges faced in AI development and the importance of developer relations are also highlighted, emphasizing the need for effective communication and support for users. There is chat about the unique challenges faced in SDK development in an era of LLMs, the creative environment of their of...

May 22, 20251 hr 10 minEp. 2

Opencode, Cloudflare, MCP MCP MCP, Scented Bin Liners and Broken Garmin Watches

Wilhelm and Matt are starting a podcast together. They discuss the differences in humor between the UK and the US. Scented trash bags are surprisingly beneficial. Matt's Garmin watch has malfunctioned after years of use. Wilhelm is adjusting to life in San Francisco, noting its high cost of living. They plan to include guests in their podcast to enhance discussions. Matt runs AI demo days to showcase startup innovations. Cloudflare is praised for its deployment experience. Wilhelm shares his cha...

May 14, 202548 minEp. 1
Hosted on Transistor
For the best experience, listen in Metacast app for iOS or Android