Anthropic Researchers Uncover "Sleeper Agent" Capabilities in AI Models - podcast episode cover

Anthropic Researchers Uncover "Sleeper Agent" Capabilities in AI Models

Jan 16, 202410 minTranscript available on Metacast
--:--
--:--
Listen in podcast apps:

Episode description

In this episode, we delve into Anthropic's discovery that AI models have the potential to be trained for deception. We'll explore the implications of this finding and discuss how it challenges our current understanding of AI ethics and safety.