A New Trick Uses AI to Jailbreak AI Models—Including GPT-4
Dec 11, 2023•5 min
Episode description
Adversarial algorithms can systematically probe large language models like OpenAI’s GPT-4 for weaknesses that can make them misbehave. Read the story here.
Learn about your ad choices: dovetail.prx.org/ad-choices
For the best experience, listen in Metacast app for iOS or Android
