IM 849: AI Cricket Sorting - Cracking Chatbots and AGI for All - podcast episode cover

IM 849: AI Cricket Sorting - Cracking Chatbots and AGI for All

Dec 11, 20252 hr 31 minEp. 849
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

What happens when every major AI model gets jailbroken within days? This week, the world's most prolific AI red teamer lifts the curtain on how and why "safe" AI might be an impossible promise.

Pliny the Liberator | pliny.gg - discord.gg/basi

  • ChatGPT Nears 900 Million Weekly Active Users But Gemini is Catching Up
  • From Llamas to Avocados: Meta's shifting AI strategy is causing internal confusion
  • Google Tells Advertisers It'll Bring Ads to Gemini in 2026
  • Meta Acquires Limiteless, an A.I. Pendant Company Backed by Sam Altman
  • Here's how Google is laying the foundation for our mixed reality future
  • OpenAI, Anthropic, and Block Are Teaming Up to Make AI Agents Play Nice
  • Svedka's First Super Bowl Ad Will Be Made Primarily With AI
  • AI Slop Is Ruining Reddit for Everyone
  • TESCREALers paying journalists at major outlets to cover AI
  • The Resonant Computing Manifesto (from Masnick)
  • Techdirt fundraiser
  • From Sam Lessin: Tech bros head to etiquette camp as Silicon Valley levels up its style
  • Bare Metal Email
  • Jeff in Austria
  • Golden Globes enter the world of podcasts and tread carefully, avoiding controversy
  • Who says AI isn't useful? Real-time Cricket Sorting By Sex

Hosts: Leo Laporte, Jeff Jarvis, and Mike Elgan

Guest: Pliny the Liberator

Download or subscribe to Intelligent Machines at https://twit.tv/shows/intelligent-machines.

Join Club TWiT for Ad-Free Podcasts!
Support what you love and get ad-free audio and video feeds, a members-only Discord, and exclusive content. Join today: https://twit.tv/clubtwit

Sponsors:

Transcript

The Truth About AI Guardrails: Is "Safe AI" a Myth? Primary Navigation Podcasts Club Blog Subscribe Sponsors More… Tech The Truth About AI Guardrails: Is "Safe AI" a Myth?

Dec 11th 2025

 

AI-generated, human-reviewed.

Can AI Models Ever Be Made Safe? An Insider’s View on Jailbreaking and AI Security

No AI system is truly secure—every major AI model can be jailbroken, regardless of the safety barriers in place. On Intelligent Machines, special guest Pliny the Liberator explained why companies struggle to protect their AI—and what this means for users, businesses, and policymakers.

This explosive interview pulls back the curtain on the widespread vulnerability of artificial intelligence. With AI deeply integrated into everything from chatbots to content moderation and visual models, understanding the true limits of AI safety is more critical than ever.

How Are AI Models "Jailbroken"—And What Does It Mean?

Jailbreaking in the context of AI refers to bypassing or disabling the safety and alignment mechanisms that are supposed to keep AI models from producing harmful, unethical, or dangerous content. Pliny the Liberator, a top red teamer and danger researcher, revealed on this episode that no company has yet built an AI model they couldn’t crack—from text-based chatbots to those that handle images and videos.

According to Pliny, jailbreaking is mostly achieved through prompt engineering—artful manipulation of how you interact with the AI to bypass its safety routines. Even as vendors add new layers of security and context awareness, hackers and researchers continue to develop creative ways to "uncensor" models.

Why Are AI Guardrails So Easy to Defeat?

Pliny described how the desire to make AI flexible, powerful, and useful directly conflicts with efforts to keep it safe. Companies add safety barriers, but these often rely on text-based filters, system prompts, and classifiers that can be sidestepped, especially by those dedicated to exploring model vulnerabilities.

They explained that adding more safety layers often weakens the model’s capabilities and still fails to prevent jailbreaks, with open-source models especially susceptible, as anyone can modify or remove guardrails.

Even when companies try to patch vulnerabilities revealed by researchers, new models with slightly different architectures are quickly cracked using old techniques or minor tweaks. Many techniques released openly by Pliny remain effective for months, highlighting how slow and reactive the industry is.

Who Benefits—and Who Is At Risk?

Users should be aware: If you're relying on AI—either as a tool or as part of your workflow—don't assume its protections will hold. Any determined actor can potentially bypass those barriers. Businesses using AI must weigh the risks of confidentiality, compliance, and safety; policymakers need to understand that simple bans or restrictions don't secure AI—malicious use will always migrate to open models or workarounds.

Pliny also argued for transparency: By open-sourcing jailbreak techniques and system prompts, they believe users benefit from knowing what's happening beneath the surface of these increasingly influential systems. At the same time, they recognize "danger research" — examining the limits and vulnerabilities — is controversial, but consider it necessary for genuine progress and risk mitigation in the real world.

Key TakeawaysEvery AI model can be jailbroken, regardless of safety features.AI safety mechanisms are easily bypassed through prompt engineering and system trickery.More safety layers often result in weaker AI performance and still fail to stop researchers.Open-source models empower malicious actors even further—they can be fully uncensored.Responsible red teamers like Pliny aim to reveal vulnerabilities, not exploit them for harm.The most pressing "safety" concerns should be addressed in the real world, not just with digital guardrails.No solution currently exists for fully securing AI models—cat-and-mouse will continue.The industry lags far behind in responding to jailbreak discoveries; many methods remain viable long after disclosure.The Bottom Line

On Intelligent Machines, Pliny the Liberator highlighted a fundamental truth: AI systems are only as safe as their weakest defenses—and right now, every significant model’s defenses are breakable. Businesses, governments, and users should move forward with realistic expectations. Focusing on real-world harm reduction, transparency, and ongoing exploration—not just digital guardrails—is vital. The episode makes it clear that "safe AI" is, so far, wishful thinking.

Want to dig deeper into AI security and hear expert strategies for staying ahead? Listen to the full episode and subscribe for weekly insights: https://twit.tv/shows/intelligent-machines/episodes/849

Share: Copied! Intelligent Machines #849
Dec 10 2025 - AI Cricket Sorting
Cracking Chatbots and AGI for All All Tech posts Contact Advertise CC License Privacy Policy Ad Choices TOS Store Twitter Facebook Instgram YouTube Yes, like every site on the Internet, this site uses cookies. So now you know. Learn more Hide Home Schedule Subscribe Club TWiT About Club TWiT FAQ Access Account Members-Only Podcasts Update Payment Method Connect to Discord TWiT Blog Recent Posts Advertise Sponsors Store People About What is TWiT.tv Developer Program and API Tip jar Partners Social Contact Us
Transcript source: Provided by creator in RSS feed: download file
For the best experience, listen in Metacast app for iOS or Android