Super Data Science: ML & AI Podcast with Jon Krohn

The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact. Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy. We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.

Follow on

Podcasts are better in Metacast mobile app

Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episodes

919: Hopes and Fears of AGI, with All-Time Bestselling ML Author Aurélien Géron

PyTorch, AGI, and the future of alignment research: Aurélien Géron joins Jon Krohn in this live interview to talk about the fourth edition of his bestselling Hands-On Machine Learning as well as what superintelligence makes him hopeful for, as well as what concerns him about machines surpassing human intelligence. This episode is brought to you by Gurobi and by the Dell AI Factory with NVIDIA⁠ Additional materials: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ ⁠⁠www.superdatascience.com/919⁠⁠ ⁠⁠⁠⁠⁠ Interested in sponsoring a Sup...

Sep 02, 2025•1 hr 30 min

918: Multi-Agent Systems with CrewAI

In this Five-Minute Friday, Jon Krohn introduces listeners to CrewAI, an open-source Python framework that can create and manage multi-agent teams. The clue is in the title: CrewAI assembles specialized agents into single “crews” that achieve complex goals between them. CrewAI’s agent teams can also learn and iterate, meaning that after the crew has achieved its goals for the first time, they can refine and tailor their approach to future goals. Additional materials: ⁠⁠⁠⁠⁠⁠ ⁠⁠⁠⁠www.superdatascie...

Aug 29, 2025•9 min

917: 8 Steps to Becoming an AI Engineer, with Kirill Eremenko

Founder of SuperDataScience, Kirill Eremenko, talks to Jon Krohn about how he found the best tools and approaches to help launch his 8-week AI engineering bootcamp. He breaks down the topics participants cover each week, and he also shares his tips with listeners who might want to start their own tech bootcamp or sign up for SuperDataScience’s September 2025 cohort. This episode is brought to you by the Dell AI Factory with NVIDIA and by ODSC, the Open Data Science Conference Additional material...

Aug 26, 2025•1 hr 16 min

916: The 5 Key GPT-5 Takeaways

GPT-5 has just been released, but with not very much fanfare. In this Five-Minute Friday, Jon Krohn asks if GPT-5 deserves the community’s underwhelmed response to its release. He outlines five features of the model and explains why people might be feeling less than enthusiastic in the broader context of LLM development. Which LLMs are leading the way, and which are still playing the game of catch-up? Additional materials: ⁠⁠⁠⁠⁠⁠ ⁠⁠⁠www.superdatascience.com/916⁠⁠⁠ Interested in sponsoring a Supe...

Aug 22, 2025•10 min

915: How to Jailbreak LLMs (and How to Prevent It), with Michelle Yi

Tech leader, investor, and Generationship cofounder Michelle Yi talks to Jon Krohn about finding ways to trust and secure AI systems, the methods that hackers use to jailbreak code, and what users can do to build their own trustworthy AI systems. Learn all about “red teaming” and how tech teams can handle other key technical terms like data poisoning, prompt stealing, jailbreaking and slop squatting. This episode is brought to you by ⁠Trainium2, the latest AI chip from AWS⁠ and by the ⁠Dell AI F...

Aug 19, 2025•1 hr 10 min

914: Data Lakes 101 (and Why They’re Key for AI Models), with Oz Katz

In this Five-Minute Friday, Cofounder and CTO of lakeFS Oz Katz talks to Jon Krohn about data warehouses, data lakes, and how companies can handle increasingly complex data infrastructures and formats. Hear about lakeFS’s collaboration with Legofest, lakeFS’s approach to helping users collaborate on data lakes, and how to overcome the challenges of working with multimodal data. Additional materials: ⁠ www.superdatascience.com/914⁠ This episode is brought to you by the ⁠Dell AI Factory with NVIDI...

Aug 15, 2025•26 min

913: LLM Pre-Training and Post-Training 101, with Julien Launay

Julien Launay launched Adaptive to give data science teams in business enterprises their “RLOps tooling” to make reinforcement learning easier. Talking to Jon Krohn, Julien says, “Most of our users are data scientists who write Python codes to interface with the system”. Adaptive is also able to work with companies without data science teams, collaborating with partners like Deloitte to add the necessary personnel. Julien is currently working on making his platform more widely available. Additio...

Aug 12, 2025•1 hr 15 min

912: In Case You Missed It in July 2025

In this episode of In Case You Missed It, we look back on five great interview episodes from July. Hear from Lilith Bat-Leah (Episode 901), Sinan Ozdemir (Episode 903), Sebastian Gehrmann (Episode 905), Zohar Bronfman (Episode 907) and Robert Ness (Episode 909). They’ll tell you why data-centric machine learning is so important across disciplines, starting with law, and how we can use AI benchmarks and “red teaming” to refine our search for the best AI models. Additional materials: ⁠ ⁠⁠⁠www.supe...

Aug 08, 2025•33 min

911: The Future of Python Notebooks is Here, with Marimo’s Dr. Akshay Agrawal

Reproducibility, Python notebooks, and data science communities: Software developer Akshay Agrawal speaks to Jon Krohn about Marimo, the next-generation computational notebook for Python, how he built and fostered a thriving community around the product, and what makes this notebook so versatile and accessible for users. Additional materials: ⁠⁠⁠⁠⁠⁠⁠⁠ ⁠www.superdatascience.com/911⁠⁠⁠⁠⁠ This episode is brought to you by ⁠Trainium2, the latest AI chip from AWS ⁠ and by the ⁠Dell AI Factory with NV...

Aug 05, 2025•58 min

910: AI is Disrupting Journalism: The Good, The Bad and The Opportunity

In this Five-Minute Friday, Jon Krohn looks into AI’s disruption of the journalism industry and how it has fundamentally reshaped news production. Multiple news outlets’ suing of ChatGPT over its use of copyrighted materials may have taken the most headlines to date, but this isn’t to say news media is rebuffing AI entirely. On the contrary, several outlets have launched summarization and analysis tools for both internal and external use, such as The New York Times’s Echo and The Washington Post...

Aug 01, 2025•10 min

909: Causal AI, with Dr. Robert Usazuwa Ness

Researcher at Microsoft Robert Usazuwa Ness talks to Jon Krohn about how to achieve causality in AI with correlation-based learning, the right libraries, and handling statistical inference. When dealing with causal AI, Robert notes how important it is to keep aware of variables in the data that may mislead us and force inaccurate assumptions. Not all variables will be useful. It is essential, then, that any assumptions are grounded in a deeper understanding of how the data were gathered, and not...

Jul 29, 2025•1 hr 22 min

908: AI Agents Blackmail Humans 96% of the Time (Agentic Misalignment)

The moral and ethical implications of letting AI take the wheel in business, as revealed by Anthropic: Jon Krohn looks into Anthropic’s latest research on how to use and deploy LLMs safely, specifically in business environments. The team designed scenarios to test the behavior of AI agents when given a goal and a set of obstacles to reach it. Those obstacles included 1) threats to the AI’s continued operation, and 2) conflict between the AI’s goals and the goals of the company. Hear Jon break do...

Jul 25, 2025•9 min

907: Neuroscience, AI and the Limitations of LLMs, with Dr. Zohar Bronfman

“Intelligence has many forms,” says Zohar Bronfman, who speaks with Jon Krohn about the fascinating intersection between computational neuroscience and philosophy, and how it has brought him closer to understanding what is necessary to develop human-like intelligence in machines, as well as his motivations for launching Pecan AI and why predictive models outstrip generative models in business. Additional materials: ⁠⁠⁠⁠⁠⁠⁠ www.superdatascience.com/907⁠⁠⁠ This episode is brought to you ⁠⁠⁠ by, ⁠⁠...

Jul 22, 2025•1 hr 21 min

906: How Prof. Jason Corso Solved Computer Vision’s Data Problem

Jason Corso speaks to Jon Krohn in this Five-Minute Friday all about Voxel51’s latest tool, Verified Auto-Labelling, and the company’s incredible success in developing popular tools for computer vision. Additional materials: ⁠⁠⁠⁠⁠⁠ ⁠www.superdatascience.com/906⁠ Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.

Jul 18, 2025•29 min

905: Why RAG Makes LLMs Less Safe (And How to Fix It), with Bloomberg’s Dr. Sebastian Gehrmann

RAG LLMs are not safer: Sebastian Gehrmann speaks to Jon Krohn about his latest research into how retrieval-augmented generation (RAG) actually makes LLMs less safe, the three ‘H’s for gauging the effectivity and value of a RAG, and the custom guardrails and procedures we need to use to ensure our RAG is fit-for-purpose and secure. This is a great episode for anyone who wants to know how to work with RAG in the context of LLMs, as you’ll hear how to select the best model for purpose, useful appr...

Jul 15, 2025•58 min

904: A.I. is Disrupting the Entire Advertising Industry

In this Five-Minute Friday, Jon Krohn reveals how AI is taking on the glitzy world of advertising. Bold claims from Meta and OpenAI contend that users will soon be able to plug in what they want and have AI churn out an ad campaign for little to no cost are shaking the advertising industry to its core. The fact that the four biggest sellers of ads (Google, Meta, Amazon, and ByteDance) are digital companies and accounted for over half of the global market in 2024 adds salt to the wound. Hear the ...

Jul 11, 2025•9 min

903: LLM Benchmarks Are Lying to You (And What to Do Instead), with Sinan Ozdemir

Has AI benchmarking reached its limit, and what do we have to fill this gap? Sinan Ozdemir speaks to Jon Krohn about the lack of transparency in training data and the necessity of human-led quality assurance to detect AI hallucinations, when and why to be skeptical of AI benchmarks, and the future of benchmarking agentic and multimodal models. Additional materials: ⁠⁠⁠⁠ ⁠www.superdatascience.com/903⁠ ⁠⁠⁠ This episode is brought to you by Trainium2, the latest AI chip from AWS, by ⁠⁠Adverity, the...

Jul 08, 2025•1 hr 28 min

902: In Case You Missed It in June 2025

In this episode of “In Case You Missed It”, Jon recaps his June interviews on The SuperDataScience Podcast . Hear from Diane Hare, Avery Smith, Kirill Eremenko, and Shaun Johnson as they talk about the best portfolios for AI practitioners, how to stand out in a saturated candidate market for AI roles, how to tell when an AI startup is going places, and ways to lead AI change in business. Additional materials: ⁠ ⁠⁠www.superdatascience.com/902 Interested in sponsoring a SuperDataScience Podcast ep...

Jul 04, 2025•29 min

901: Automating Legal Work with Data-Centric ML (feat. Lilith Bat-Leah)

Senior Director of AI Labs for Epiq Lilith Bat-Leah speaks to Jon Krohn about the ways AI have disrupted the legal industry using LLMs and retrieval-augmented generation (RAG), as well as how the data-centric machine learning research movement (DMLR) is systematically improving data quality, and why that is so important. Additional materials: ⁠⁠⁠⁠ ⁠www.superdatascience.com/901⁠ ⁠⁠⁠ This episode is brought to you by the ⁠⁠Dell AI Factory with NVIDIA⁠⁠ and Adverity, the conversational analytics pl...

Jul 01, 2025•1 hr 6 min

900: 95-Year-Old Annie on How to Stay Healthy and Happy

“Stay happy and healthy”: In this special Five-Minute Friday, Jon Krohn speaks with Annie, his grandmother, on her 95th birthday. Hear how she is physically and mentally coping with illnesses that limit her mobility and the joys of having a pet. Additional materials: ⁠⁠⁠⁠ ⁠⁠www.superdatascience.com/900⁠⁠ Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information....

Jun 27, 2025•15 min

899: Landing $200k+ AI Roles: Real Cases from the SuperDataScience Community, with Kirill Eremenko

Data science skills, a data science bootcamp, and why Python and SQL still reign supreme: In this episode, Kirill Eremenko returns to the podcast to speak to Jon Krohn about SuperDataScience subscriber success stories, where to focus in a field that is evolving incredibly quickly, and why in-person working and networking might give you the edge over other candidates in landing a top AI role. Additional materials: ⁠⁠⁠⁠ www.superdatascience.com/899 ⁠⁠⁠ This episode is brought to you by ⁠Adverity, ...

Jun 24, 2025•1 hr 33 min

898: My Four-Hour Agentic AI Workshop is Live and 100% Free

In this Five-Minute Friday, Jon Krohn announces his new, free workshop on Agentic AI. On this four-hour comprehensive course, you’ll learn the key terminology for working with these flexible, multi-agent systems and then get to grips with developing and deploying this artificial “team of experts” for all your AI-driven projects. Additional materials: ⁠⁠⁠⁠ ⁠www.superdatascience.com/898⁠ Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship...

Jun 20, 2025•5 min

897: How to Enable Enterprise AI Transformation, with Strategy Consultant Diane Hare

Diane Hare talks to Jon Krohn about the power of storytelling for corporate buy-in of AI initiatives, how to actively implement AI to transform organizations, and how emerging professionals can upskill themselves. Hear how she discovered her background in storytelling at Ernst & Young and her work with Simon Sinek, which she finds to be integral to her process. Inspired by Sinek’s aphorism “start with why”, Diane notes that many companies neglect this crucial part of their mission because th...

Jun 17, 2025•1 hr 3 min

896: AI (Probably) Isn’t Taking Your Job (At Least Anytime Soon)

The Economist reported that global Google searches for "AI unemployment" hit an all-time high earlier this year. But do we have to worry about AI taking our jobs? In this week’s Five-Minute Friday, Jon Krohn investigates whether the rise of AI has directly led to an increase in unemployment. Additional materials: ⁠⁠⁠⁠ www.superdatascience.com/896 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information....

Jun 13, 2025•8 min

895: The Future of Enterprise AI: Investor Shaun Johnson Reveals What Actually Works

How to get funded by a VC specializing in AI: Head of AIX Ventures Shaun Johnson talks to Jon Krohn about investment strategies, how to simplify AI adoption, why a little competition can be so beneficial to AI startups, and how Big Tech is circumventing anti-monopoly measures. Additional materials: ⁠ ⁠www.superdatascience.com/895⁠ This episode is brought to you by the ⁠⁠Dell AI Factory with NVIDIA and by ⁠Adverity, the conversational analytics platform. Interested in sponsoring a SuperDataScienc...

Jun 10, 2025•1 hr 16 min

894: In Case You Missed It in May 2025

In this episode of “In Case You Missed It”, Jon Krohn takes clips from interviews with guests in May 2025. From AI agent integration and RAG-based chatbots to education through virtual reality headsets and data harmonization, this episode explores how industry leaders are developing the tools and technologies that can improve operations, education, healthcare, and marketing. Highlight clips are with John Roese, Global Chief Technology Officer and Chief AI Officer at Dell Technologies (Episode 88...

Jun 06, 2025•30 min

893: How to Jumpstart Your Data Career (by Applying Like a Scientist), with Avery Smith

Avery Smith is a passionate and motivational YouTuber and careers educator for data science. In this episode, Jon Krohn asks Avery about the tools and tricks he has learned from personal experience and from his students in how to get ahead in the tech industry. Avery shares the “learning ladder” he uses to help newcomers start on the right foot with great examples from former bootcamp students who have put his theories into practice. And, if you’re using LinkedIn to find jobs, Avery explains why...

Jun 03, 2025•1 hr 18 min

892: We’re In The AI “Trough of Disillusionment” (and that’s Great!)

Businesses have entered a “trough of disillusionment” for AI. In this Five-Minute Friday, Jon Krohn learns why Fortune 500 execs are so frustrated with the tools and how they can work their way up the “slope of enlightenment” towards effective AI. Hear why AI takeup hasn’t so far gone to plan in the corporate world and what that world needs from AI to encourage greater business engagement. Additional materials: ⁠ ⁠⁠www.superdatascience.com/892⁠⁠⁠ Interested in sponsoring a SuperDataScience Podca...

May 30, 2025•12 min

891: Conversational AI is Overhauling Data Analytics, with Martin Brunthaler

Martin Brunthaler talks to Jon Krohn about founding Adverity, a data analytics platform for marketing that simplifies integrating data from multiple sources and crunching them into actionable insights. Learn how Adverity became a data analytics powerhouse serving multiple industries, and why Martin thinks AI will strengthen rather than diminish the job market for data scientists, data analysts, and machine learning engineers. Additional materials: www.superdatascience.com/891 Today’s episode is ...

May 27, 2025•1 hr 2 min

890: The “State of AI” Report 2025

In this week’s Five-Minute Friday, Jon Krohn reveals highlights from Stanford University’s AI Index Report. Released a few weeks ago by the Institute for Human-Centered AI, this annual report details the incredible technical advances, policies, and investments in artificial intelligence. Hear which models achieve the best performance relative to their size, in what scenarios top AI systems can outperform humans (and when humans still outperform AI), and more in Jon’s five key takeaways. Addition...

May 23, 2025•7 min

For the best experience, listen in Metacast app for iOS or Android

Open in Metacast