Super Data Science: ML & AI Podcast with Jon Krohn - podcast cover

Super Data Science: ML & AI Podcast with Jon Krohn

The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact. Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy. We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.
Download Metacast podcast app
Podcasts are better in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episodes

601: Venture Capital for Data Science

This week, Sarah Catanzaro, General Partner at Amplify Partners joins Jon for an episode that dives into the venture capital side of data science. Learn how to fund your data science business idea, take note of what start-ups can do to survive or raise capital in the current economic climate, and discover how to break into the field of venture capital yourself. In this episode you will learn: • Angel vs. venture capital vs. private equity investment [7:27] • How early-stage investment is made pr...

Aug 16, 202256 min

600: Yoga Nidra Practice with Steve Fazzari

Rest and relaxation await as Steve Fazzari joins us this week for a special edition of the podcast! Tune in for a rejuvenating session of Yoga Nidra led beautifully by the expert. Additional materials: www.superdatascience.com/600

Aug 12, 202235 min

599: MLOps: Machine Learning Operations

This week, Mikiko Bazeley, Senior Software Engineer at Mailchimp joins the podcast to share her in-depth knowledge of MLOps: Machine Learning Operations. Tune in to hear her discuss what it entails, why it's so critical for the efficiency of any data science team, and the most important tools you need to master for career success in this field. In this episode you will learn: • What MLOps is [11:40] • Mikiko’s role at Mailchimp and why MLOps is critical for the efficiency of any data science tea...

Aug 09, 20221 hr 21 min

598: Getting Kids Excited about STEM Subjects

Ben Taylor makes a fourth appearance on Five-Minute Friday to discuss the best ways to introduce STEM to children. Tune in to hear the many ways in which he thinks STEM education will evolve in the future. Additional materials: www.superdatascience.com/598

Aug 05, 202212 min

597: A.I. Policy at OpenAI

Dr. Miles Brundage, Head of Policy Research at OpenAI, joins Jon Krohn this week to discuss AI model production, policy, safety, and alignment. Tune in to hear him speak on GPT-3, DALL-E, Codex, and CLIP as well. In this episode you will learn: • Miles’ role as Head of Policy Research at OpenAI [4:35] • OpenAI's DALL-E model [7:20] • OpenAI's natural language model GPT-3 [30:43] • OpenAI's automated software-writing model Codex [36:57] • OpenAI’s CLIP model [44:01] • What sets AI policy, AI safe...

Aug 02, 20221 hr 23 min

596: The A.I. Platforms of the Future

Ben Taylor returns for a third Five-Minute Friday episode! This week, he looks ahead and digs into what we can expect from the A.I. platforms of the future. Additional materials: www.superdatascience.com/596

Jul 29, 20227 min

595: Data Engineering 101

Tune in as Joe Reis and Matt Housley, co-founders of Ternary Data and co-authors of the book “Fundamentals of Data Engineering” join Jon Krohn to discuss major undercurrents across the data engineering lifecycle, and their top tools and techniques. In this episode you will learn: • What is data engineering? [3:55] • Why Joe and Matt identify as “recovering data scientists” [6:12] • What kinds of people tend to become data scientists vs. data engineers [10:38]? • Key components of Joe and Matt’s ...

Jul 26, 20221 hr 19 min

593: The Real-World Impact of Cross-Disciplinary Data Science Collaboration

Jon welcomes Professor Philip Bourne, Founding Dean of the School of Data Science at the University of Virginia to discuss his biomedical data science research, the importance of open-source and open-access within the industry and the data science skills you need to succeed today. In this episode you will learn: • Why Philip founded a School of Data Science [6:08] • How computing and data science have evolved across academic departments [15:55] • The improvements needed in higher education [26:4...

Jul 19, 20221 hr 22 min

592: How to Sell a Multimillion Dollar A.I. Contract

In this episode, Jon Krohn welcomes A.I. industry veteran Ben Taylor to discuss how to sell multimillion dollar A.I. contracts. Tune in to hear why trust and proof of value are some of the critical steps in his sales process. Additional materials: www.superdatascience.com/592

Jul 15, 20223 min

591: Simulations and Synthetic Data for Machine Learning

Mars Buttfield-Addison, PhD Candidate at the University of Tasmania, joins Jon Krohn for a high-energy episode covering everything from Machine Learning simulations to Swift, space junk, and more! In this episode you will learn: • What simulations and synthetic data are, and why they can be invaluable for real-life applications [5:47] • How simulated bots can solve any problem [9:07] • Practical uses of simulated data [21:49] • Why the mobile operating system language Swift is interesting for A....

Jul 12, 20221 hr 15 min

590: Artificial General Intelligence is Not Nigh (Part 2 of 2)

In this episode, Jon continues his two-part series on artificial general intelligence (AGI) and why we are unlikely to realize it anytime soon. Listen in as Jon reviews Meta's Yann LeCun's seven-part perspective on the topic. Additional materials: www.superdatascience.com/590

Jul 08, 20226 min

589: Narrative A.I. with Hilary Mason

Hilary Mason, Co-Founder and CEO of Hidden Door, joins Jon Krohn for a live discussion that explores narrative A.I., emerging ML techniques, and how her OSEMN data science process developed. In this episode you will learn: How narrative A.I. can assist creativity [5:14] How to build ML products that have no quantitative error function to optimize [10:31] How to ensure creative A.I. systems do not output non-sense or explicit content [16:58] Hilary's OSEMN data science process [21:05] The emergin...

Jul 05, 202256 min

588: Artificial General Intelligence is Not Nigh

In this episode, Jon kicks off a two-part series that sees him explore the popular topic of artificial general intelligence and why it might–or might not–be only a few years away. Listen in as Jon explains the several reasons why he doesn't believe that AGI is nigh. Additional materials: www.superdatascience.com/588

Jul 01, 20226 min

587: Data Engineering for Data Scientists

Mark Freeman, Senior Data Scientist at Humu, joins Jon Krohn to talk about all things data engineering and offers listeners some critical tips for their data science career journey – from what it takes to get promoted to his number one tip for getting hired at a fast-growing capital-backed startup. In this episode you will learn: How Humu leverages data and machine learning to improve workplace behaviors [10:38] What is data engineering? [14:21] What it takes to get promoted into more senior dat...

Jun 28, 20221 hr 25 min

586: Daily Habit #10: Limit Social Media Use

In this episode, Jon dives into the popular topic of social media and its impact on his productivity. Tune in to hear how minimizing the use of social media can positively impact your days, mental health and work. Additional materials: www.superdatascience.com/586

Jun 24, 20225 min

585: PyMC for Bayesian Statistics in Python

In this episode, Dr. Thomas Wiecki, Core Developer of the PyMC Library and CEO of PyMC Labs, joins Jon for a masterclass in Bayesian statistics. Tune in to hear about PyMC, and discover why Bayesian statistics can be more powerful and interpretable than any other data modeling approach. In this episode you will learn: What Bayesian statistics is [7:30] Why Bayesian statistics can be more powerful and interpretable than any other data modeling approach [17:20] How PyMC was developed [20:41] Comme...

Jun 21, 20221 hr 26 min

584: OpenAI Codex

In this episode, Jon reviews the remarkable natural language model Codex by OpenAI. Learn why it has amassed a waitlist and how you can leverage its practical applications in your work. Additional materials: www.superdatascience.com/584

Jun 17, 20224 min

583: The State of Natural Language Processing

In this episode, natural language processing (NLP) expert and Lead Data Scientist at CB Insights, Rongyao Huang, joins Jon Krohn to discuss NLP. Listen in for a thorough review of the field over the past decade and how the coming iron age of NLP will help us overcome the limitations of today's approaches. In this episode you will learn: The evolution of NLP techniques over the past decade [4:14] What's next in the coming iron age of NLP [35:33] Rongyao’s Bauhaus-inspired model for effective data...

Jun 14, 20221 hr 15 min

582: Model Speed vs Model Accuracy

In this episode, Jon wraps up his three-part series on business value and machine learning. Listen in as he explains why starting with simple models is best, and why speed is likely more important to your users than accuracy. Additional materials: www.superdatascience.com/582

Jun 10, 20223 min

581: Bayesian, Frequentist, and Fiducial Statistics in Data Science

In this episode founding Editor-in-Chief of the Harvard Data Science Review and Professor of Statistics at Harvard University, Prof. Xiao-Li Meng, joins Jon Krohn to dive into data trade-offs that abound, and shares his view on the paradoxical downside of having lots of data. In this episode you will learn: What the Harvard Data Science Review is and why Xiao-Li founded it [5:31] The difference between data science and statistics [17:56] The concept of 'data minding' [22:27] The concept of 'data...

Jun 07, 20221 hr 25 min

580: Collecting Valuable Data

In this episode, Jon resumes his series on strategies for getting business value from machine learning. Part one saw him review several ways to identify a commercial problem before starting data collection or ML model development. And now, in part two, Jon digs into the data collection process. Additional materials: www.superdatascience.com/580

Jun 03, 20226 min

579: Transforming Dentistry with A.I.

In this episode, the CEO of Overjet, Dr. Wardah Inam, joins Jon Krohn to discuss the classification and quantification of dental diagnoses with computer vision, her data labeling challenges, and tips for building a successful A.I. business. In this episode you will learn: How Overjet leverages computer vision to qualify and quantify dental diagnoses [5:11] How A.I. solutions reduce the under-diagnosis of common diseases like periodontal disease [8:15] Overjet's particular ML challenges within th...

May 31, 202247 min

578: Identifying Commercial ML Problems

In this episode, Jon kicks off a new Five-Minute Friday series that explores the strategies for getting business value from machine learning. Part one sees him review several ways to identify a commercial problem before starting data collection or ML model development. Additional materials: www.superdatascience.com/578

May 27, 20224 min

577: Scaling A.I. Startups Globally

In this episode, the former CEO and co-founder behind Onfido, an AI-based ID verification, joins Jon Krohn to discuss his path to start-up success. Tune in to hear valuable information from Husayn Kassai. In this episode you will learn: How Husayn's start-up journey began [5:55] How Husayn determined that his challenge could be solved by machine vision [11:18] Onfido's initial seed stages [18:23] Launching and scaling your start-up in the U.S. market [22:00] The most important component in build...

May 24, 202255 min

576: Tech Startup Dramas

Hollywood has officially fallen for the drama of tech startups! Tune in to hear Jon Krohn review the small-screen adaptations of WeWork (WeCrashed), Uber (Super Pumped), and Theranos (The Dropout). Additional materials: www.superdatascience.com/576

May 20, 20223 min

575: Optimizing Computer Hardware with Deep Learning

In this episode, the Director of Architecture at NVIDIA, Dr. Magnus Ekman, joins Jon Krohn to discuss how machine learning, including deep learning, can optimize computer hardware design. The pair also review his exceptional book 'Learning Deep Learning.' In this episode you will learn: What hardware architects do [10:15] How ML can optimize hardware speed [ 13:19] Magnus’s Deep Learning Book [21:14] Is understanding how ML models work important? [36:16] Algorithms inspired by biological evoluti...

May 17, 20221 hr 24 min

574: Music for Deep Work

In this episode, Jon shares how the right music can power your productivity. It's no secret that he's a big fan of 'deep work,' but this week, he opens up about the artists, sites, and playlists that propel his productivity to new levels. Additional materials: www.superdatascience.com/574

May 13, 20224 min

573: Automating ML Model Deployment

In this episode, co-founder and CEO of Linea, Dr. Doris Xin, joins Jon Krohn to discuss how automating ML model deployment delivers groundbreaking change to data science productivity, and shares what it's like being the CEO of an exciting, early-stage tech start-up. In this episode you will learn: How Linea reduces ML model deployment down to a couple of lines of Python code [5:14] Linea use cases [11:30] How DAGs can 10x production workflow efficiency [22:12] ML model graphlets and reducing was...

May 10, 20221 hr 7 min

572: Daily Habit #9: Avoiding Messages Until a Set Time Each Day

In this episode, Jon shares his habit of blocking out two hours in his mornings that are free from email and social media distractions. Tune in to learn how this habit helps him deeply focus on his most delightful tasks of the day. Additional materials: www.superdatascience.com/572

May 06, 20223 min
For the best experience, listen in Metacast app for iOS or Android
Open in Metacast