Discovering AI Risks with AIs | Ethan Perez | EAG Bay Area 23

EAG Talks

May 26, 2023•54 min

--:--

Listen in podcast apps:

Apple Podcasts

Spotify

Download

Listen to this episode in Metacast mobile app

Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

Watch on Youtube

In this talk Ethan presents on how AI systems like ChatGPT can be used to help uncover potential risks in other AI systems, such as tendencies towards power-seeking, self-preservation, and sycophancy. Ethan is a research scientist and team lead at Anthropic working on large language models, and his work aims to reduce the risk of catastrophic outcomes from advanced machine learning systems. He also spend some time at New York University (NYU) collaborating with Sam Bowman's group on AI safety research.

For the best experience, listen in Metacast app for iOS or Android