ChatGPT Jailbreaks: The Grandma Exploit - podcast episode cover

ChatGPT Jailbreaks: The Grandma Exploit

Jul 03, 202324 minSeason 1Ep. 15
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

How do you extract prohibited information from ChatGPT? Grandma and DAN exploits trick language models into violating their own policies. Why these techniques work, what they reveal about LLM architecture, and how companies protect against prompt injection attacks. Solo episode on LLM security.

To stay in touch, sign up for our newsletter at https://www.superprompt.fm

For the best experience, listen in Metacast app for iOS or Android