Super Data Science: ML & AI Podcast with Jon Krohn - podcast cover

Super Data Science: ML & AI Podcast with Jon Krohn

The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact. Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy. We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.

Episodes

705: Feeding the World with ML-Powered Precision Agriculture

Join Jon Krohn as he chats with Syngenta Group's Feroz Sheikh, Jeremy Groeteke, and Thomas Jung about the digital revolution in agriculture. Learn how data science is evolving farming, from precision techniques to global food solutions. A compelling blend of tech meets nature.This episode is brought to you by AWS Inferentia and by Modelbit, for deploying models in seconds. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.In this ...

Aug 15, 20231 hr 29 min

704: Jon’s “Generative A.I. with LLMs” Hands-on Training

Take on the world of GPT and learn to develop your own, commercially successful Large Language Models (LLMs) with Jon Krohn’s comprehensive, guided training video for generative AI. Get to grips with the technology, learn which tools to use, and find out how to get an eye for business-viable models with Jon’s (ad-)free educational video.Additional materials: www.superdatascience.com/704Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship informa...

Aug 11, 20235 min

703: How Data Happened: A History, with Columbia Prof. Chris Wiggins

Statistics history, interdisciplinarity, and data and society. Chris Wiggins talks with Jon Krohn about the power dynamics of data, the transformation of the field of biology through data-driven approaches to genetic sequencing, and the New York Times’ data science team’s cutting-edge approach to accommodating its tech stack.This episode is brought to you by the AWS Insiders Podcast and by Modelbit, for deploying models in seconds. Interested in sponsoring a SuperDataScience Podcast episode? Vis...

Aug 08, 20231 hr 9 min

702: Llama 2 — It's Time to Upgrade your Open-Source LLM

This week, Jon Krohn is examining Meta's newly released open-source large language model, Llama 2, highlighting its commercial prospects, immense capacity, model variety, and unique 'time awareness' feature. He also discusses its innovative two-stage RLHF approach that enhances its performance.Additional materials: www.superdatascience.com/702Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

Aug 04, 202311 min

701: Generative A.I. without the Privacy Risks (with Prof. Raluca Ada Popa)

Dr. Raluca Ada Popa, renowned computer scientist, entrepreneur, and President of Opaque Systems, joins Jon Krohn to share her insights on securely interacting with AI APIs like OpenAI's GPT-4, the pros and cons of open vs. closed-source AI development, and the seamless operation of compute pipelines across multiple clouds.This episode is brought to you by AWS Inferentia and by Modelbit, for deploying models in seconds. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.c...

Aug 01, 20231 hr 21 min

700: "The Dream of Life" by Alan Watts

Yoga and Hindu mythology: This special episode continues the thread of our centenary episodes, SDS 500: Yoga Nidra with Jes Allen and SDS 600: Yoga Nidra Practice with Steve Fazzari, which talked through guided meditation techniques to help improve posture, sleep, and expand consciousness. Inspired by these sessions, host Jon Krohn explores Hindu mythology via Alan Watts’ “The Dream of Life”.Additional materials: www.superdatascience.com/700Interested in sponsoring a SuperDataScience Podcast epi...

Jul 28, 20235 min

699: The Modern Data Stack, with Harry Glaser

Model deployment, data warehouse options for running models, and how to best leverage BI tools: Harry Glaser and Jon Krohn discuss Modelbit’s capabilities to automate ML models from notebooks into production-ready models, reducing the time and effort in ‘translating’ information from one mode to another. Harry’s conversation with host Jon Krohn expanded on the importance of automating this task, and how developments in ML modeling have widened access to entire teams to analyze data, whatever the...

Jul 25, 202351 min

698: How Firms Can Actually Adopt A.I., with Rehgan Avon

Company-wide AI adoption can take a lot of persuasion. Rehgan Avon talks to host Jon Krohn about why AI has become necessary for forward-thinking businesses and the steps to implement AI in an institution so that everyone benefits.Additional materials: www.superdatascience.com/698Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

Jul 21, 202328 min

697: The (Short) Path to Artificial General Intelligence, with Dr. Ben Goertzel

AI visionary and CEO of SingularityNET Dr. Ben Goertzel provides a deep dive into the possible realization of Artificial General Intelligence (AGI) within 3-7 years. Explore the intriguing connections between self-awareness, consciousness, and the future of Artificial Super Intelligence (ASI) and discover the transformative societal changes that could arise.This episode is brought to you by AWS Inferentia, by the AWS Insiders Podcast, and by Modelbit, for deploying models in seconds. Interested ...

Jul 18, 20231 hr 27 min

696: Brain-Computer Interfaces and Neural Decoding, with Prof. Bob Knight

Jon Krohn welcomes Professor Dr. Bob Knight to explore human intelligence, the prefrontal cortex, and the transformative potential of brain implants for data collection. Discover the pivotal role of machine learning in treating Parkinson's and delve into exciting future advancements.Additional materials: www.superdatascience.com/696Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

Jul 14, 20231 hr 3 min

695: NLP with Transformers, feat. Hugging Face's Lewis Tunstall

What are transformers in AI, and how do they help developers to run LLMs efficiently and accurately? This is a key question in this week’s episode, where Hugging Face’s ML Engineer Lewis Tunstall sits down with host Jon Krohn to discuss encoders and decoders, and the importance of continuing to foster democratic environments like GitHub for creating open-source models.This episode is brought to you by the AWS Insiders Podcast, by WithFeeling.ai, the company bringing humanity into AI, and by Mode...

Jul 11, 20231 hr 38 min

694: CatBoost: Powerful, efficient ML for large tabular datasets

Modeling tabular data and spreadsheets doesn’t have to be tedious with CatBoost’s open-source tree-boosting algorithm. CatBoost does what it says on the tin, blending categories with boosting that allows you to train your models faster and handle large datasets for ML tasks across multiple GPUs. In this week’s Five-Minute Friday, host Jon Krohn gets to grips with the technical components of CatBoost that give it the speed and accuracy so acclaimed by its users.Additional materials: www.superdata...

Jul 07, 20238 min

693: YOLO-NAS: The State of the Art in Machine Vision, with Harpreet Sahota

Harpreet Sahota, a data science expert and deep learning developer at Deci AI, joins Jon Krohn to explore the fascinating realm of object detection and the revolutionary YOLO-NAS model architecture. Discover how machine vision models have evolved and the techniques driving compute-efficient edge device applications..This episode is brought to you by AWS Inferentia, by WithFeeling.ai, the company bringing humanity into AI, and by Modelbit, for deploying models in seconds. Interested in sponsoring...

Jul 04, 20231 hr 20 min

692: Lossless LLM Weight Compression: Run Huge Models on a Single GPU

Join Jon as he navigates listeners through the innovative SpQR approach—a cutting-edge, lossless LLM weight compression technique that harnesses the power of quantization. Tune in as Jon delves into the four steps behind this groundbreaking method in this week's episode.Additional materials: www.superdatascience.com/692Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

Jun 30, 20238 min

691: A.I. Accelerators: Hardware Specialized for Deep Learning

GPUs vs CPUs, chip design and the importance of chips in AI research: This highly technical episode is for anyone who wants to learn what goes into chip development and how to get into the competitive industry of accelerator design. With advice from expert guest Ron Diamant, Senior Principal Engineer at AWS, you’ll get a breakdown of the need-to-know technical terms, what chip engineers need to think about during the design phase and what the future holds for processing hardware.This episode is ...

Jun 27, 20231 hr 35 min

690: How to Catch and Fix Harmful Generative A.I. Outputs

Krishna Gade, the founder and CEO of Fiddler.AI, discusses the challenges faced by Large Language Models (LLMs) in Generative AI, including inaccuracies, biases, and privacy risks. He emphasizes the importance of monitoring to build trust in AI and highlights Fiddler's explainability algorithms and pre-built bias detection tools as vital solutions.Additional materials: www.superdatascience.com/690Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsors...

Jun 23, 202326 min

689: Observing LLMs in Production to Automatically Catch Issues

Arize's Amber Roberts and Xander Song join Jon Krohn this week, sharing invaluable insights into ML Observability, drift detection, retraining strategies, and the crucial task of ensuring fairness and ethical considerations in AI development.This episode is brought to you by Posit, the open-source data science company, by AWS Inferentia, and by Anaconda, the world's most popular Python distribution. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for spons...

Jun 20, 20231 hr 18 min

688: Six Reasons Why Building LLM Products Is Tricky

Prompt injection, prompt engineering, context windows, and more: In this week’s Five-Minute Friday, Jon explains why anyone looking to build their own product leveraging LLMs should stop to consider these and three more issues before jumping in. Phillip Carter first outlined these six issues in his article “All the Hard Stuff Nobody Talks About when Building Products with LLMs”.Additional materials: www.superdatascience.com/688Interested in sponsoring a SuperDataScience Podcast episode? Visit Jo...

Jun 16, 202314 min

687: Generative Deep Learning, with David Foster

Autoencoders, transformers, latent space: Learn the elements of generative AI and hear what data scientist David Foster has to say about the potential for generative AI in music, as well as the role that world models play in blending generative AI with reinforcement learning.This episode is brought to you by Posit, the open-source data science company, by Anaconda, the world's most popular Python distribution, and by WithFeeling.ai, the company bringing humanity into AI. Interested in sponsoring...

Jun 13, 20231 hr 47 min

686: Open-Source "Responsible A.I." Tools, with Ruth Yakubu

Mircosoft’s Ruth Yakubu joins Jon Krohn to discuss Responsible AI principles and the open-source Responsible AI Toolbox, allowing users to assess their models for fairness, inclusiveness, privacy, explainability, accountability, and reliability before deployment.Additional materials: www.superdatascience.com/686Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

Jun 09, 202330 min

685: Tools for Building Real-Time Machine Learning Applications, with Richmond Alake

Richmond Alake, a Machine Learning Architect at Slalom Build, sits down with Jon to share real-time ML insights, tools and career experiences for a high-energy and high impact episode. From his work at Slalom Build to his two AI startups, discover the software choices, ML tools, and front-end development techniques used by a leader in the field.This episode is brought to you by Posit, the open-source data science company, by AWS Inferentia, and by WithFeeling.ai, the company bringing humanity in...

Jun 06, 20231 hr 6 min

684: Get More Language Context out of your LLM

Open-source LLMs, FlashAttention and generative AI terminology: Host Jon Krohn gives us the lift we need to explore the next big steps in generative AI. Listen to the specific way in which Stanford University’s “exact attention” algorithm, FlashAttention, could become a competitor for GPT-4’s capabilities.Additional materials: www.superdatascience.com/684Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

Jun 02, 20236 min

683: Contextual A.I. for Adapting to Adversaries, with Dr. Matar Haller

Monitoring malicious, user-generated content; contextual AI; adapting to novel evasion attempts: Matar Haller speaks to Jon Krohn about the challenges of identifying, analyzing and flagging malicious information online. In this episode, Matar explains how contextual AI and a “database of evil” can help resolve the multiple challenges of blocking dangerous content across a range of media, even those that are live-streamed.This episode is brought to you by Posit, the open-source data science compa...

May 30, 20231 hr 21 min

682: Business Intelligence Tools, with Mico Yuk

In this week's episode, Mico Yuk, host of 'Analytics on Fire', joins Jon Krohn to share her effective business intelligence and analytics framework, BIDS, for persuading key decision makers. She crowns one "power" tool as the analytics king and discusses emerging tools that could challenge its dominance. Tune in for unapologetic insights on future and current BI trends and happenings from the world of BI and analytics.Additional materials: www.superdatascience.com/682Interested in sponsoring a S...

May 26, 202328 min

681: XGBoost: The Ultimate Classifier, with Matt Harrison

Unlock the power of XGBoost by learning how to fine-tune its hyperparameters and discover its optimal modeling situations. This and more, when best-selling author and leading Python consultant Matt Harrison teams up with Jon Krohn for yet another jam-packed technical episode! Are you ready to upgrade your data science toolkit in just one hour? Tune-in now!This episode is brought to you by Pathway, the reactive data processing framework, by Posit, the open-source data science company, and by Anac...

May 23, 20231 hr 12 min

680: Automating Industrial Machines with Data Science and the Internet of Things (IoT)

Industrial machinery’s dependence on data science, tech stacks to build IoT platforms, and transitioning from data science to product: This week’s Friday episode with Allegra Alessi explores the minutiae of product ownership for the Internet of Things at packaging company Bobst. Join host Jon Krohn and his guest as they unpack how the IoT is leading factory production.Additional materials: www.superdatascience.com/680Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com...

May 19, 202330 min

679: The A.I. and Machine Learning Landscape, with investor George Mathew

Generative AI, MLOps, and making smart investments in AI: This week’s episode is critical listening for AI investors and generative AI creators. AI investor George Mathew talks with host Jon Krohn about the emerging generative AI stack, the critical elements of MLOps to ensure a scalable model, and the tools developers can use for a saleable product.This episode is brought to you by Posit, the open-source data science company, by AWS Inferentia, and by Anaconda, the world's most popular Python d...

May 16, 20231 hr 34 min

678: StableLM: Open-source "ChatGPT"-like LLMs you can fit on one GPU

StableLM, the new family of open-source language models from the brilliant minds behind Stable Diffusion is out! Small, but mighty, these models have been trained on an unprecedented amount of data for single GPU LLMs. This week, Jon breaks down the mechanics of this model–see you there! Additional materials: www.superdatascience.com/678 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

May 12, 202312 min

677: Digital Analytics with Avinash Kaushik

How does one use marketing analytics to drive business success? Avinash Kaushik, Chief Strategy Officer at Croud and former Sr. Director of Global Strategic Analytics at Google joins Jon Krohn live for an exciting episode that covers the transformative power of AI, his 'four clusters of intent' framework and the value of hands-on data tools. This episode is brought to you by Pathway, the reactive data processing framework, by Posit, the open-source data science company, and by Anaconda, the worl...

May 09, 20231 hr 28 min

676: The Chinchilla Scaling Laws

Chinchilla AI, and fine-tuning proprietary tasks with large language models: On this week’s Five-Minute Friday, host Jon Krohn outlines the principles of the Chinchilla Scaling Laws, the incredible power of models such as Cerebras-GPT based on these laws, and the impact of scaling on the number of viable applications and commercial use cases.Additional materials: www.superdatascience.com/676Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship in...

May 05, 202313 min
For the best experience, listen in Metacast app for iOS or Android
Open in Metacast