Machine Learning Solution for Failed Job Auto Remediation [Netflix] - podcast episode cover

Machine Learning Solution for Failed Job Auto Remediation [Netflix]

May 06, 202414 min
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

Description: In this episode, we will talk about the importance of remediating failed workflow jobs to reduce business infrastructure costs. We delve into Netflix's approach, which involves enhancing their existing rule-based error classifier with advanced machine learning models. This allowed for auto-remediation, improving the handling of memory configuration and unclassified errors, ultimately leading to substantial cost savings.


Based on their published tech blog, with the link provided here for your reference: https://netflixtechblog.com/evolving-from-rule-based-classifier-machine-learning-powered-auto-remediation-in-netflix-data-039d5efd115b

For the best experience, listen in Metacast app for iOS or Android