$4 Self-Modifying Coding Agents?! Evan Boyle Breaks Down GenSX - podcast episode cover

$4 Self-Modifying Coding Agents?! Evan Boyle Breaks Down GenSX

Oct 17, 20251 hrSeason 1Ep. 16
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

What if building complex AI agents felt as natural as composing React components—and they could even rewrite their own code? 🤯

In this episode of One Shot / AI Tinkerers, host Joe sits down with Evan Boyle, founder of GenSX, to explore a radically new way to design, run, and ship long-running agent workflows:

🔑 Key takeaways

- React-inspired component model for agents – why JSX-style, type-safe functions beat static graphs for scalability and code reuse.

- Traces, telemetry & evals baked-in – see every prompt, variable, and LLM call in real time.

- $4 self-modifying coding agent – Evan demos an agent that checks out its own repo, refactors 3 K lines, runs tests, and pushes to GitHub… iteratively.

- Real-world production use cases – from million-document legal discovery to inbox-wide entity extraction and analytics.

- Durable execution & infra shift – why 5-second latencies and massive parallelism are forcing a rethink of serverless, queues, and caching.

- Developer experience first – faster dev loops with component-level caching, cursor rules, and LLM “rubber-duck” debugging tricks.

🛠️ Tools & frameworks mentioned

GenSX, React/JSX, OpenAI & Anthropic models, Temporal, Pulumi, Cursor, LangChain, LlamaIndex, Crew AI…and more.

🔗 Try GenSX → https://www.gensx.com

💬 Join the community → https://github.com/gensx-inc/gensx

🐦 Follow Evan on X/Twitter → https://x.com/_Evan_Boyle

🙌 Enjoyed the conversation?

👍 Like, 🔔 subscribe, and drop your questions or aha moments in the comments. It helps more builders discover the pod!

📍 Chapters

00:00 Intro & Evan’s background

04:28 Why existing agent frameworks break at scale

12:55 Inside the React-style component model

23:10 Live demo: Hacker News Analyzer (1,000 LLM calls in parallel)

32:45 Tracing, telemetry, and evals

38:20 The self-modifying code agent ($4/iteration)

50:40 Real production agent use cases

59:05 Dev-tooling tips: caching, logging-only debug loops

1:08:30 The future of AI infrastructure & closing thoughts

#GenSX #AIAgents #DeveloperExperience #React #SelfModifyingCode #AIWorkflow #OneShotPodcast

For the best experience, listen in Metacast app for iOS or Android