Scheming AIs | Joe Carlsmith | EA Global Bay Area 2024 - podcast episode cover

Scheming AIs | Joe Carlsmith | EA Global Bay Area 2024

Mar 06, 202452 min
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

This talk examines whether advanced AIs that perform well in training will be doing so in order to gain power later — a behavior Joe Carlsmith calls "scheming" (also often called "deceptive alignment"). This talk gives an overview of his recent report on the topic, available on arXiv here: https://arxiv.org/abs/2311.08379. Joe Carlsmith is a senior research analyst at Open Philanthropy, where he focuses on existential risk from advanced artificial intelligence. He also writes independently about various topics in philosophy and futurism, and he has a doctorate in philosophy from the University of Oxford.


Watch on Youtube: https://www.youtube.com/watch?v=AxUTiGS6BHM

For the best experience, listen in Metacast app for iOS or Android