Lexiang Huang | Metastable Failures in the Wild | #17 - podcast episode cover

Lexiang Huang | Metastable Failures in the Wild | #17

Jan 09, 202353 minSeason 3Ep. 2
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

Summary:

In this episode Lexiang Huang talks about a framework for understanding a class of failures in distributed systems called metastable failures. Lexiang tells us about his study on the prevalence of such failures in the wild and how he and his colleagues scoured over publicly available incident reports from many organizations, ranging from hyperscalers to small companies. Listen to the episode to find out about his main findings and gain a deeper understanding of metastable failures and how you can identity, prevent, and mitigate against them!


Links:

Hosted on Acast. See acast.com/privacy for more information.

For the best experience, listen in Metacast app for iOS or Android