4 - Risks from Learned Optimization with Evan Hubinger - podcast episode cover

4 - Risks from Learned Optimization with Evan Hubinger

Feb 17, 20212 hr 14 min
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

In machine learning, typically optimization is done to produce a model that performs well according to some metric. Today's episode features Evan Hubinger talking about what happens when the learned model itself is doing optimization in order to perform well, how the goals of the learned model could differ from the goals we used to select the learned model, and what would happen if they did differ.

 

Link to the paper - Risks from Learned Optimization in Advanced Machine Learning Systems: arxiv.org/abs/1906.01820

Link to the transcript: axrp.net/episode/2021/02/17/episode-4-risks-from-learned-optimization-evan-hubinger.html

Evan Hubinger's Alignment Forum profile: alignmentforum.org/users/evhub

For the best experience, listen in Metacast app for iOS or Android