Super Data Science: ML & AI Podcast with Jon Krohn - podcast cover

Super Data Science: ML & AI Podcast with Jon Krohn

The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact. Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy. We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.

Episodes

592: How to Sell a Multimillion Dollar A.I. Contract

In this episode, Jon Krohn welcomes A.I. industry veteran Ben Taylor to discuss how to sell multimillion dollar A.I. contracts. Tune in to hear why trust and proof of value are some of the critical steps in his sales process. Additional materials: www.superdatascience.com/592

Jul 15, 20223 min

591: Simulations and Synthetic Data for Machine Learning

Mars Buttfield-Addison, PhD Candidate at the University of Tasmania, joins Jon Krohn for a high-energy episode covering everything from Machine Learning simulations to Swift, space junk, and more! In this episode you will learn: • What simulations and synthetic data are, and why they can be invaluable for real-life applications [5:47] • How simulated bots can solve any problem [9:07] • Practical uses of simulated data [21:49] • Why the mobile operating system language Swift is interesting for A....

Jul 12, 20221 hr 15 min

590: Artificial General Intelligence is Not Nigh (Part 2 of 2)

In this episode, Jon continues his two-part series on artificial general intelligence (AGI) and why we are unlikely to realize it anytime soon. Listen in as Jon reviews Meta's Yann LeCun's seven-part perspective on the topic. Additional materials: www.superdatascience.com/590

Jul 08, 20226 min

589: Narrative A.I. with Hilary Mason

Hilary Mason, Co-Founder and CEO of Hidden Door, joins Jon Krohn for a live discussion that explores narrative A.I., emerging ML techniques, and how her OSEMN data science process developed. In this episode you will learn: How narrative A.I. can assist creativity [5:14] How to build ML products that have no quantitative error function to optimize [10:31] How to ensure creative A.I. systems do not output non-sense or explicit content [16:58] Hilary's OSEMN data science process [21:05] The emergin...

Jul 05, 202256 min

588: Artificial General Intelligence is Not Nigh

In this episode, Jon kicks off a two-part series that sees him explore the popular topic of artificial general intelligence and why it might–or might not–be only a few years away. Listen in as Jon explains the several reasons why he doesn't believe that AGI is nigh. Additional materials: www.superdatascience.com/588

Jul 01, 20226 min

587: Data Engineering for Data Scientists

Mark Freeman, Senior Data Scientist at Humu, joins Jon Krohn to talk about all things data engineering and offers listeners some critical tips for their data science career journey – from what it takes to get promoted to his number one tip for getting hired at a fast-growing capital-backed startup. In this episode you will learn: How Humu leverages data and machine learning to improve workplace behaviors [10:38] What is data engineering? [14:21] What it takes to get promoted into more senior dat...

Jun 28, 20221 hr 25 min

586: Daily Habit #10: Limit Social Media Use

In this episode, Jon dives into the popular topic of social media and its impact on his productivity. Tune in to hear how minimizing the use of social media can positively impact your days, mental health and work. Additional materials: www.superdatascience.com/586

Jun 24, 20225 min

585: PyMC for Bayesian Statistics in Python

In this episode, Dr. Thomas Wiecki, Core Developer of the PyMC Library and CEO of PyMC Labs, joins Jon for a masterclass in Bayesian statistics. Tune in to hear about PyMC, and discover why Bayesian statistics can be more powerful and interpretable than any other data modeling approach. In this episode you will learn: What Bayesian statistics is [7:30] Why Bayesian statistics can be more powerful and interpretable than any other data modeling approach [17:20] How PyMC was developed [20:41] Comme...

Jun 21, 20221 hr 26 min

584: OpenAI Codex

In this episode, Jon reviews the remarkable natural language model Codex by OpenAI. Learn why it has amassed a waitlist and how you can leverage its practical applications in your work. Additional materials: www.superdatascience.com/584

Jun 17, 20224 min

583: The State of Natural Language Processing

In this episode, natural language processing (NLP) expert and Lead Data Scientist at CB Insights, Rongyao Huang, joins Jon Krohn to discuss NLP. Listen in for a thorough review of the field over the past decade and how the coming iron age of NLP will help us overcome the limitations of today's approaches. In this episode you will learn: The evolution of NLP techniques over the past decade [4:14] What's next in the coming iron age of NLP [35:33] Rongyao’s Bauhaus-inspired model for effective data...

Jun 14, 20221 hr 15 min

582: Model Speed vs Model Accuracy

In this episode, Jon wraps up his three-part series on business value and machine learning. Listen in as he explains why starting with simple models is best, and why speed is likely more important to your users than accuracy. Additional materials: www.superdatascience.com/582

Jun 10, 20223 min

581: Bayesian, Frequentist, and Fiducial Statistics in Data Science

In this episode founding Editor-in-Chief of the Harvard Data Science Review and Professor of Statistics at Harvard University, Prof. Xiao-Li Meng, joins Jon Krohn to dive into data trade-offs that abound, and shares his view on the paradoxical downside of having lots of data. In this episode you will learn: What the Harvard Data Science Review is and why Xiao-Li founded it [5:31] The difference between data science and statistics [17:56] The concept of 'data minding' [22:27] The concept of 'data...

Jun 07, 20221 hr 25 min

580: Collecting Valuable Data

In this episode, Jon resumes his series on strategies for getting business value from machine learning. Part one saw him review several ways to identify a commercial problem before starting data collection or ML model development. And now, in part two, Jon digs into the data collection process. Additional materials: www.superdatascience.com/580

Jun 03, 20226 min

579: Transforming Dentistry with A.I.

In this episode, the CEO of Overjet, Dr. Wardah Inam, joins Jon Krohn to discuss the classification and quantification of dental diagnoses with computer vision, her data labeling challenges, and tips for building a successful A.I. business. In this episode you will learn: How Overjet leverages computer vision to qualify and quantify dental diagnoses [5:11] How A.I. solutions reduce the under-diagnosis of common diseases like periodontal disease [8:15] Overjet's particular ML challenges within th...

May 31, 202247 min

578: Identifying Commercial ML Problems

In this episode, Jon kicks off a new Five-Minute Friday series that explores the strategies for getting business value from machine learning. Part one sees him review several ways to identify a commercial problem before starting data collection or ML model development. Additional materials: www.superdatascience.com/578

May 27, 20224 min

577: Scaling A.I. Startups Globally

In this episode, the former CEO and co-founder behind Onfido, an AI-based ID verification, joins Jon Krohn to discuss his path to start-up success. Tune in to hear valuable information from Husayn Kassai. In this episode you will learn: How Husayn's start-up journey began [5:55] How Husayn determined that his challenge could be solved by machine vision [11:18] Onfido's initial seed stages [18:23] Launching and scaling your start-up in the U.S. market [22:00] The most important component in build...

May 24, 202255 min

576: Tech Startup Dramas

Hollywood has officially fallen for the drama of tech startups! Tune in to hear Jon Krohn review the small-screen adaptations of WeWork (WeCrashed), Uber (Super Pumped), and Theranos (The Dropout). Additional materials: www.superdatascience.com/576

May 20, 20223 min

575: Optimizing Computer Hardware with Deep Learning

In this episode, the Director of Architecture at NVIDIA, Dr. Magnus Ekman, joins Jon Krohn to discuss how machine learning, including deep learning, can optimize computer hardware design. The pair also review his exceptional book 'Learning Deep Learning.' In this episode you will learn: What hardware architects do [10:15] How ML can optimize hardware speed [ 13:19] Magnus’s Deep Learning Book [21:14] Is understanding how ML models work important? [36:16] Algorithms inspired by biological evoluti...

May 17, 20221 hr 24 min

574: Music for Deep Work

In this episode, Jon shares how the right music can power your productivity. It's no secret that he's a big fan of 'deep work,' but this week, he opens up about the artists, sites, and playlists that propel his productivity to new levels. Additional materials: www.superdatascience.com/574

May 13, 20224 min

573: Automating ML Model Deployment

In this episode, co-founder and CEO of Linea, Dr. Doris Xin, joins Jon Krohn to discuss how automating ML model deployment delivers groundbreaking change to data science productivity, and shares what it's like being the CEO of an exciting, early-stage tech start-up. In this episode you will learn: How Linea reduces ML model deployment down to a couple of lines of Python code [5:14] Linea use cases [11:30] How DAGs can 10x production workflow efficiency [22:12] ML model graphlets and reducing was...

May 10, 20221 hr 7 min

572: Daily Habit #9: Avoiding Messages Until a Set Time Each Day

In this episode, Jon shares his habit of blocking out two hours in his mornings that are free from email and social media distractions. Tune in to learn how this habit helps him deeply focus on his most delightful tasks of the day. Additional materials: www.superdatascience.com/572

May 06, 20223 min

571: Collaborative, No-Code Machine Learning

Einblick co-founder and associate professor at MIT, Tim Kraska, joins Jon Krohn to discuss no-code collaboration tools for data science and uncovers the clever database and machine learning tricks under the hood of the visual data computing platform. In this episode you will learn: The inspiration behind Einblick [2:45] Einblick's progressive approximation engine [6:43] How no-code tools impact productivity [17:18] The critical steps to become more data-driven as an organization [24:30] How rese...

May 03, 202258 min

570: DALL-E 2: Stunning Photorealism from Any Text Prompt

In this episode, Jon is back with another A.I. model breakthrough! He updates listeners on OpenAI's outstanding DALL-E 2 model. The new natural language processing model churns out staggering visual examples of whatever text your mind can dream up. Additional materials: www.superdatascience.com/570

Apr 29, 20226 min

569: A.I. For Crushing Humans at Poker and Board Games

Research Scientist at Meta AI, Dr. Noam Brown, joins Jon Krohn to discuss his award-winning no-limit poker-playing algorithms and the real-world implications of his game-playing A.I. breakthroughs. In this episode you will learn: What Meta A.I. is and how it fits into Meta, the company [3:01] Noam's award-winning no-limit poker-playing algorithms, Libratus and Pluribus algorithms. [4:33] What game theory is and how does Noam integrate it into his models? [8:45] The real-world implications of Noa...

Apr 26, 202245 min

568: PaLM: Google's Breakthrough Natural Language Model

In this episode, Jon updates listeners on one of the industry's biggest breakthroughs to date –Google's new natural language processing model, PaLM. The key innovation with PaLM is scaling up Google's Pathways modeling approach to half a trillion parameters — many-fold more parameters than had previously been trained using this approach. Additional materials: www.superdatascience.com/568

Apr 22, 20225 min

567: Open-Access Publishing

In this episode, the MIT Press Director and Publisher, Dr. Amy Brand, joins Jon Krohn to discuss open-access publishing in data science and how to address the inequalities that exist for women and minorities in STEM. In this episode you will learn: What it’s like to run the prestigious MIT Press [4:34] How open access makes scholarly work more impactful [6:34] How publishing outstanding STEM books for broader audiences, including for children, can help address STEM biases [19:28] Amy's award-win...

Apr 19, 20221 hr 18 min

566: The Best Time to Plant a Tree

In this episode, Jon reflects on the Chinese proverb: "The best time to plant a tree was 20 years ago. The second best time is now." He also challenges listeners to reflect on their long-term goals that have gone unfulfilled. Additional materials: www.superdatascience.com/566

Apr 15, 20224 min

565: AGI: The Apocalypse Machine

In this episode, Jeremie Harris dives into the stirring topic of AI Safety and the existential risks that Artificial General Intelligence poses to humankind. In this episode you will learn: Why mentorship is crucial in a data science career development [15:45] Canadian vs American start-up ecosystems [24:18] What is Artificial General Intelligence (AGI)? [38:50] How Artificial Superintelligence could destroy the world [1:04:00] How AGI could prove to be a panacea for humankind and life on the pl...

Apr 12, 20222 hr 5 min

564: Clem Delangue on Hugging Face and Transformers

In this episode, Jon speaks with the CEO of Hugging Face, Clem Delangue, about open-source machine learning and transformer architectures, while attending the ScaleUp:AI Conference in New York. Additional materials: www.superdatascience.com/564

Apr 08, 202219 min

563: How to Rock at Data Science — with Tina Huang

In this episode, superstar data science YouTuber Tina Huang joins us to discuss what it's like to work at one of the world's largest tech companies, her strategies for efficient learning, and how best to prepare for a career in data science from scratch. In this episode you will learn: The key areas to focus on when getting started in data science [6:01] Tina’s five steps to consistently doing anything [11:55] Tina's day-to-day life as a data scientist at one of the world’s largest tech companie...

Apr 05, 20221 hr 5 min
For the best experience, listen in Metacast app for iOS or Android
Open in Metacast