From DevOps ‘Heart Attacks’ to AI-Powered Diagnostics With Traversal’s AI Agents - podcast episode cover

From DevOps ‘Heart Attacks’ to AI-Powered Diagnostics With Traversal’s AI Agents

Jun 24, 202541 min
--:--
--:--
Listen in podcast apps:
Metacast
Spotify
Youtube
RSS

Episode description

Anish Agarwal and Raj Agrawal, co-founders of Traversal, are transforming how enterprises handle critical system failures. Their AI agents can perform root cause analysis in 2-4 minutes instead of the hours typically spent by teams of engineers scrambling in Slack channels. Drawing from their academic research in causal inference and gene regulatory networks, they’ve built agents that systematically traverse complex dependency maps to identify the smoking gun logs and problematic code changes. As AI-generated code becomes more prevalent, Traversal addresses a growing challenge: debugging systems where humans didn’t write the original code, making AI-powered troubleshooting essential for maintaining reliable software at scale. Hosted by Sonya Huang and Bogomil Balkansky, Sequoia Capital Mentioned in this episode: SRE: Site reliability engineering. The function within engineering teams that monitors and improves the availability and performance of software systems and services. Golden signals: four key metrics used by Site Reliability Engineers (SREs) to monitor the health and performance of IT systems: latency, traffic, errors and saturation. MELT data:  Metrics, events, log, and traces. A framework for observability. The Bitter Lesson: Another mention of Nobel Prize  winner Rich Sutton’s influential post.
For the best experience, listen in Metacast app for iOS or Android
Open in Metacast
From DevOps ‘Heart Attacks’ to AI-Powered Diagnostics With Traversal’s AI Agents | Training Data podcast - Listen or read transcript on Metacast