Data contracts are redefining data quality and governance, and Chad Sanderson, CEO of Gable.ai, joins host Jon Krohn to explain how they can transform your data strategy. He breaks down what data contracts are, how they shift data quality checks closer to production, and why they’re essential for reducing data debt. Chad also highlights how better alignment between data producers and consumers can elevate data reliability and tackle change-management challenges in modern organizations. This epis...
Oct 08, 2024•1 hr 2 min
Llama 3.2 brings a new era of AI innovation with lightweight models tailored for on-device applications and powerful vision models for handling complex image inputs. Host Jon Krohn explores how this release pushes the boundaries of open-source AI, making it more accessible and versatile for developers. He also covers the Llama Stack toolkit, designed to streamline deployment, and Llama Guard 3, Meta’s latest content moderation solution. With extensive support from major cloud and hardware partne...
Oct 04, 2024•14 min
Virtual humans are rewriting the rules of digital communication and reshaping entire industries. This week, Jon Krohn welcomes Natalie Monbiot, Head of Strategy at Hour One, to shed light on how AI avatars are revolutionizing L&D and e-commerce by turning traditional training and product listings into captivating, presenter-led content. This episode is brought to you by epic LinkedIn Learning instructor Keith McCormick, by Gurobi, the Decision Intelligence Leader, and by ODSC, the Open Data Scie...
Oct 01, 2024•1 hr 21 min
NotebookLM, Google’s latest AI tool, takes content creation to a new level. This week, Jon Krohn shares how the platform transformed his 200-page dissertation into a fascinating 11-minute podcast. Discover how AI can turn vast amounts of information into engaging and digestible content, opening up new possibilities for content creation. Additional materials: www.superdatascience.com/822 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorshi...
Sep 27, 2024•19 min
Marck Vaisman speaks to Jon Krohn about his paradigm for understanding core data practitioner types. Hear Marck detail the four data practitioner personas that he has identified in his research, why he believes the roadmaps that influencers like to promote as surefire ways to a data science career don’t work in practice, and why the term “data scientist” is still so elusive and hard to recruit for. This episode is brought to you by Gurobi, the Decision Intelligence Leader. Interested in sponsori...
Sep 24, 2024•1 hr 13 min
Jon Krohn takes OpenAI’s new models (o1-preview and o1-mini) for a spin in this Five-Minute Friday, learning their key strengths and limitations, and how the o1 series may represent yet another landmark for generative AI. Additional materials: www.superdatascience.com/820 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Sep 20, 2024•27 min
SuperDataScience veteran and Udemy teacher Luka Anicin is on the podcast to talk about his brand-new course, “PyTorch: From Zero to Hero”, available exclusively on superdatascience.com. Host Jon Krohn asks Luka why he feels that every data scientist should consider PyTorch as their default Python library, and why “keeping it simple” can secure the success of a machine learning project. This episode is brought to you by AWS Inferentia and AWS Trainium, and by Gurobi, the Decision Intelligence Lea...
Sep 17, 2024•1 hr 6 min
Experts from AI and data science discuss the impact and benefits of decentralization, the importance of structuring AI systems in business, and why knowing the basics will always matter for data engineers. Listen to Shingai Manjengwa (episode 809), Daniel Hulme (episode 807), Jerry Yurchisin (episode 813) and Nick Elprin (episode 811) explore a future world of work that rewards continuing learners, sets tasks for the people best suited to complete them rather than those whose job titles reflect ...
Sep 13, 2024•30 min
Dr. Julia Silge, Engineering Manager at Posit, introduces the brand-new Positron IDE, perfect for exploratory data analysis and visualization. She also lays out her top picks for LLMs that boost coding efficiency and discusses when traditional NLP methods might be the smarter choice over LLMs. Plus, Julia highlights some must-know open-source libraries that make managing MLOps easier than ever. Tune in for insights that every data scientist, ML engineer, and developer will find useful. This epis...
Sep 10, 2024•1 hr 36 min
Jon Krohn takes on a listener's challenge to explain his work in data science to his 94-year-old grandmother, Annie. This heartwarming conversation covers what data is, the role of a data scientist, and breaks down artificial intelligence (AI) and artificial general intelligence (AGI) in simple terms. The episode provides a fresh take on how to communicate complex topics to a lay audience, offering both clarity and insight. Additional materials: www.superdatascience.com/816 Interested in sponsor...
Sep 06, 2024•20 min
Polars, Python, Narwhals, Rust, and Pandas: Marco Gorelli talks to Jon Krohn about the many ways to use the newest data libraries available, the joys of open-source development, and the best method to win prizes in forecasting competitions. This episode is brought to you by AWS Inferentia and AWS Trainium, by Babbel, the science-backed language-learning platform, and by Gurobi, the Decision Intelligence Leader. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatas...
Sep 03, 2024•1 hr 27 min
As summer winds down, this episode shifts focus from the usual tech discussions to something more personal: reflecting on the importance of balancing work with life’s simple pleasures. While the world of data science and AI continues to evolve rapidly, it's essential to remember that true success isn't just about professional milestones. It’s also about cherishing the moments that make life meaningful. Tune in for a brief but impactful reflection on how to redefine success to include not just ac...
Aug 30, 2024•4 min
Jerry Yurchisin from Gurobi joins Jon Krohn to break down mathematical optimization, showing why it often outshines machine learning for real-world challenges. Find out how innovations like NVIDIA’s latest CPUs are speeding up solutions to problems like the Traveling Salesman in seconds. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: • The Burrito Optimization Game and mathematical optim...
Aug 27, 2024•1 hr 44 min
In this episode of Five-Minute Friday, Jon Krohn investigates published findings from the startup Sakana AI and its paper’s co-authors from the University of Oxford, the University of British Columbia and the Vector Institute in Toronto. These authors explore the potential of The AI Scientist, a framework that could change the way we conduct scientific research forever. Additional materials: www.superdatascience.com/812 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@s...
Aug 23, 2024•12 min
Nick Elprin talks to Jon Krohn about how and when to scale a data science team and its workflows to secure a company’s commercial viability. You’ll also hear how to launch your own data science startup and why it’s so important to understand that AI tools are not one-size-fits-all. This episode is brought to you by AWS Inferentia and AWS Trainium. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will ...
Aug 20, 2024•1 hr 14 min
Self-driving cars are here, and Jon Krohn is breaking down the five levels of automation that could change driving forever. From full human control at Level 0 to cars that drive themselves in any condition at Level 5, get the real story on what these levels mean. With firsthand insights from a recent autonomous vehicle experience, this episode cuts through the buzz and tells you what’s coming next. Additional materials: www.superdatascience.com/810 Interested in sponsoring a SuperDataScience Pod...
Aug 16, 2024•9 min
Agentic AI is revolutionizing the tech landscape, and Shingai Manjengwa from ChainML is here to tell us why. Discover how AI agents are becoming an integral part of our lives, automating tasks like travel bookings and daily inspiration. Shingai explains the power of multi-agent systems, where AI agents collaborate to solve complex challenges, and highlights how blockchain technology is enhancing AI transparency and trust. Plus, get an inside look at ChainML’s innovative Theoriq protocol and the ...
Aug 13, 2024•1 hr 10 min
Advice for emerging data scientists, the latest in model merging, and how GenAI can supercharge your creativity: Host Jon Krohn gives us his highlights from a month of interviews, packed with tips from some of the leading names in data science and beyond. Guests include Daliana Liu, Charles Duhigg, Charles Goddard, Rosanne Liu and Andrey Kurenkov. Additional materials: www.superdatascience.com/808 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for...
Aug 09, 2024•38 min
The singularity could soon be upon us. The PESTLE framework, developed by this episode’s guest Daniel Hulme, expresses not one but six types of singularity that could occur: political, environmental, social, technological, legal and economic. Jon Krohn and Daniel Hulme discuss how each of these singularities could bring good to the world, aligning with human interests and pushing forward progress. They also talk about neuromorphic computing, machine consciousness, and applying AI at work. This e...
Aug 06, 2024•1 hr 9 min
Llama 3.1 is here, and it’s a game-changer. Meta’s latest AI model, especially the massive 405B variant, finally brings an open-source option to compete with giants like OpenAI's GPT-4o and Anthropic's Claude 3.5 Sonnet. While Meta didn’t fully open-source everything, the availability of "open weights" is a strategic move to shake up the AI landscape. The model boasts an impressive 128,000-token context window and multilingual support in eight languages. Meta is also focusing on responsible AI d...
Aug 02, 2024•19 min
Become a Supercommunicator! New York Times bestselling author Charles Duhigg, known for The Power of Habit and Smarter Faster Better, gets real about mastering communication in this episode. Discover insights from his latest book, Supercommunicator, where he reveals how to align conversation styles for deeper connections, handle conflicts effectively, and why AI can't replicate the emotional depth of human interactions. This episode is brought to you by Gurobi, the Decision Intelligence Leader. ...
Jul 30, 2024•55 min
Solar power now provides 6% of the world's electricity, thanks to rapid growth. Host Jon Krohn discusses the factors driving this rise, the challenges ahead, and how AI and data science are optimizing solar technologies. Tune in for insights on the future of solar power, and don't forget to like, share, and subscribe! Additional materials: www.superdatascience.com/804 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Jul 26, 2024•14 min
Daliana Liu is a big name in data science teaching, and she has always been generous in sharing everything she knows about getting a job in data science. In this episode, she continues to extend her generosity, helping listeners define their approach to achieving a fulfilling career in data science and tech. This episode is brought to you by AWS Inferentia and AWS Trainium, by Babbel, the science-backed language-learning platform, and by Gurobi, the Decision Intelligence Leader. Interested in sp...
Jul 23, 2024•1 hr 55 min
How to grab investor interest with your AI startup idea, revisiting algorithms, and helping practitioners ensure AI safety with regulatory frameworks and beyond: This month, you missed a whole bunch of great interviews. But don’t worry, Jon Krohn is here to recap all the best bits for you! Additional materials: www.superdatascience.com/802 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
Jul 19, 2024•24 min
Merged LLMs are the future, and we’re exploring how with Mark McQuade and Charles Goddard from Arcee AI on this episode with Jon Krohn. Learn how to combine multiple LLMs without adding bulk, train more efficiently, and dive into different expert approaches. Discover how smaller models can outperform larger ones and leverage open-source projects for big enterprise wins. This episode is packed with must-know insights for data scientists and ML engineers. Don’t miss out! Interested in sponsoring a...
Jul 16, 2024•1 hr 17 min
The SuperDataScience Podcast is celebrating its 800th episode! Host Jon Krohn speaks to his grandmother, Annie, about growing up at a time when so many technologies we take for granted today were yet to be developed. Listen in to hear Annie’s experience of the changes in technology across 94 years and how she and her family fared in 1940s Ukraine with no electricity or running water. Additional materials: www.superdatascience.com/800
Jul 12, 2024•44 min
No-code games with GenAI, the creative possibilities of LLMs, and our proximity to AGI: In this episode, Jon Krohn talks to Andrey Kurenkov about what turned him from an AGI skeptic to a positivist. You’ll also hear about his wildly popular podcast “Last Week in AI” and how the NVIDIA-backed startup Astrocade is helping videogame enthusiasts to create their own games through generative AI. A must-listen! This episode is brought to you by AWS Inferentia and AWS Trainium. Interested in sponsoring ...
Jul 09, 2024•1 hr 46 min
Claude 3.5 Sonnet, Anthropic’s newest model, is making waves in the AI community. This mid-size model outshines the larger Claude 3 Opus in tasks like code generation, content creation, and document summarization, and it’s twice as fast. In this episode of The Super Data Science Podcast, Jon Krohn discusses its top-notch performance across benchmarks like MMLU, GPQA, and HumanEval, along with its improved machine vision capabilities. Plus, learn about the new Artifacts UI feature, which makes ma...
Jul 05, 2024•15 min
Dr. Rosanne Liu, Research Scientist at Google DeepMind and co-founder of the ML Collective, shares her journey and the mission to democratize AI research. She explains her pioneering work on intrinsic dimensions in deep learning and the advantages of curiosity-driven research. Jon and Dr. Liu also explore the complexities of understanding powerful AI models, the specifics of character-aware text encoding, and the significant impact of diversity, equity, and inclusion in the ML community. With pu...
Jul 02, 2024•1 hr 10 min
Want to feel optimistic about your day? In this Friday episode, Simon Kuestenmacher talks to Jon Krohn about demography: What it is, why it’s so important, and why its forecasts should give us reason to hope for a better future. In an increasingly globalized world, and with an aging population in countries with the biggest GDPs, demography is more valuable than ever. Additional materials: www.superdatascience.com/796 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@supe...
Jun 28, 2024•43 min