275: Machine Learning Through Reinforcement & Contextual Bandits

Super Data Science: ML & AI Podcast with Jon Krohn

Jul 03, 2019•1 hr 2 min

--:--

Listen in podcast apps:

Apple Podcasts

Spotify

Download

Listen to this episode in Metacast mobile app

Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

In this episode of the SuperDataScience Podcast, I chat with the Machine Learning Research Scientist, John Langford. You will hear about unsupervised, supervised learning and reinforcement learning, and the differences between the three. You will learn about applications of contextual bandits and reinforcement learning in general, YOLO style algorithms versus simulator algorithms, technics for avoiding local optimums. You will also learn about the balance between exploration and exploitation, learning to search and active learning.

If you enjoyed this episode, check out show notes, resources, and more at www.superdatascience.com/275

For the best experience, listen in Metacast app for iOS or Android