635: The Perils of Manually Labeling Data for Machine Learning Models - podcast episode cover

635: The Perils of Manually Labeling Data for Machine Learning Models

Dec 13, 20221 hr 19 min
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

Hand labeling data and information bias: Jon Krohn speaks with Watchful CEO Shayan Mohanty about the pitfalls of data analysis when bias comes into the equation (spoiler alert: it always does), the importance of the Chomsky hierarchy in data management, and the importance of simulation engines for returning real-time results to users. This episode is brought to you by Iterative (iterative.ai), your mission control center for machine learning. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn:• Why bias in general is good [04:06]• The arguments against hand labeling [09:47]• How Shayan solves the problem of labeling at his company [24:26]• Misconceptions concerning hand-labeled data [43:25]• What the Chomsky hierarchy is [52:38]• Watchful’s high-performance simulation engine [1:04:51]• What Shayan looks for in his new hires [1:08:15] Additional materials: www.superdatascience.com/635
For the best experience, listen in Metacast app for iOS or Android
Open in Metacast
635: The Perils of Manually Labeling Data for Machine Learning Models | Super Data Science: ML & AI Podcast with Jon Krohn - Listen or read transcript on Metacast