The Future of AI Agents is Sandboxed - podcast episode cover

The Future of AI Agents is Sandboxed

Dec 19, 202558 min
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

Jonathan Wall is the CEO at Runloop.ai, working on enterprise-grade infrastructure and execution environments for AI coding agents.


The Future of AI Agents is Sandboxed // MLOps Podcast #353 with Jonathan Wall, CEO at Runloop.ai.


Join the Community:

https://go.mlops.community/YTJoinIn

Get the newsletter: https://go.mlops.community/YTNewsletter


Shoutout to  @runloop-ai  for powering this MLOps Podcast episode.


// Abstract

Everyone’s arguing about agents. Jonathan Wall says the real fight is about sandboxes, isolation, and why most “agent platforms” are doing it wrong.


// Bio

Jon was the techlead of Google File System, a founding engineer at Google Wallet, and then the founder of Inde, which was acquired by Stripe. He is building Runloop.ai to bridge the production gap for AI Agents by building a one-stop sandbox infrastructure for building, deploying, and refining agents.


// Related Links

Website: runloop.ai

Blogs and content at https://www.runloop.ai/


~~~~~~~~ ✌️Connect With Us ✌️ ~~~~~~~

Catch all episodes, blogs, newsletters, and more: https://go.mlops.community/TYExplore

Join our Slack community [https://go.mlops.community/slack]

Follow us on X/Twitter [@mlopscommunity](https://x.com/mlopscommunity) or [LinkedIn](https://go.mlops.community/linkedin)]

Sign up for the next meetup: [https://go.mlops.community/register]

MLOps Swag/Merch: [https://shop.mlops.community/]


Connect with Demetrios on LinkedIn: /dpbrinkm

Connect with Jon on LinkedIn: /jonathantwall/


Timestamps:

[00:00] GitHubification of workflows

[00:29] Sandbox definitions explained

[04:47] Agent setup explanation

[08:03] Sandbox vs API agent

[13:51] Resource usage in sandbox

[22:50] Agent evaluation setup

[28:08] Failure cases value

[31:06] Sandbox isolation vs multi-tenancy

[36:14] Frameworks vs Harnesses

[39:02] Langraph vs Harness comparison

[43:22] Agent flexibility and verification

[52:51] Training data focus

[57:10] Wrap up

For the best experience, listen in Metacast app for iOS or Android