The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact.
Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy.
We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.
Last refreshed: ⓘ
Follow this podcast in the Metacast mobile app to refresh it and see new episodes.
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more
Our habit-tracking series continues with a look at how making your bed can jumpstart your mornings, prevent you from taking part in negative habits and help you become happier. Additional materials: www.superdatascience.com/544
Nicole Büttner (Founder and CEO of Merantix Labs) joins the podcast to discuss driving A.I. innovation, automation, and transformation and building the ideal A.I. start-up founding team. In this episode you will learn: The three factors that spark A.I. innovation [12:48] How to make great use of the unlabelled, unbalanced data sets [18:54] How to engineer reusable data and software components [25:09] Merantix's A.I. Canvas framework for successful innovation [29:59] How to be a part of Merantix'...
Revisit the much-underrated continuous calendar and get started with this uncommon planning method thanks to Jon's 2022 template. Additional materials: www.superdatascience.com/542
In this episode, Kevin Hu joins the podcast to talk about founding and growing the data observability startup, Metaplane. Listen in to hear about his time in academia at MIT, his experience with Y Combinator, and his current routine as a technical founder. In this episode you will learn: What is data observability? [4:35] •How to identify data quality issues? [8:56] Kevin's PhD research on automating data science systems using machine learning [16:18] Why Kevin launched Metaplane [28:50] The pro...
In this episode, Jon opens up about starting his day with a glass of water – his first morning habit that sets his day off on a healthy and successful note. Additional materials: www.superdatascience.com/540
In this episode, Serg Masís joins the podcast to share his in-depth technical knowledge of Interpretable Machine Learning. Together they discuss why this field matters, how it’s evolving, and so much more. In this episode you will learn: What is interpretable machine learning? [8:41] The social and financial ramifications of interpreting models incorrectly [10:23] The challenges involved in interpretable ML [16:00] The most important interpretable ML concepts to master [19:54] The future of Inte...
In this episode, Jon shares his "life-changing" habit tracking system that has allowed him to achieve more, create more structure within his day and cut out bad habits. Additional materials: www.superdatascience.com/538
Sadie St. Lawrence returns to discuss the biggest data science trends that are set to take over the industry in 2022. In this episode you will learn: A look back at data science trends for 2021 [4:03] Micro and macro data science trends for 2022 [12:30] AutoML tools [15:20] The social implications of deepfakes [21:21] Scalable AI [38:40] Macro data science trends for 2022 [42:45] The impact of the remote-working economy in data science [43:21] Blockchain in data science [50:28] Data literacy of ...
Prolific data science entrepreneur and Y Combinator alum Austin Ogilvie (Laika, Yhat) joins Jon Krohn for a revealing look into his journey of starting, growing, and selling a data science startup. From liberal arts graduate to twice successful technical founder, take a seat and learn from the best. In this episode you will learn: The story behind the naming of Yhat and its early beginnings [5:10] Austin and Yhat's experience at Y Combinator [19:00] The benefits of being a technical founder [25:...
Dr. Brett Tully joins us on the podcast to discuss his work as Director of AI Output Systems at Nearmap and his previous research in biomedical topics and nuclear fusion. In this episode you will learn: What is Nearmap? [5:22] What is a Director of AI Output Systems? [7:51] A case study [20:35] MLOps at Nearmap [26:37] Brett’s day-to-day and what he looks for in hires [40:19] Brett’s academic and research history [53:30] Brett’s work in nuclear fusion and predictions for the technology [1:04:48]...
Jon discusses one helpful framework when it comes to problem-solving and how data scientists are uniquely positioned to employ this technique. Additional materials: www.superdatascience.com/532
Jeroen Janssens joins on the podcast to discuss his book on utilizing the command line for data science and the importance of polyglot data science work. In this episode you will learn: The genesis of Jeroen’s book [3:24] Data Science at the Command Line [8:55] Creating your own command line tools [22:07] Polyglot data scientist [24:29] Data Science Workshops [27:01] Jeroen’s PhD research [30:38] Additional materials: www.superdatascience.com/531...
Jon details his top ten AI thought leaders hoping that his suggestions prove valuable to you in your data science journey. Additional materials: www.superdatascience.com/530
Dave Niewinski joins us to discuss his prolific work in robotics both as a consultant and a popular YouTuber. In this episode you will learn: Dave’s Armoury [4:44] Robotic cornhole tournament [12:33] Dave’s many robots [14:25] Dave’s idea process [28:51] Future robots [31:43] Dave’s consulting business [33:27] Tools Dave likes to use [37:05] How did Dave get started in this line of work? [38:50] Dave’s advice to people who want to get into robotics [41:18] What is Dave excited about in the futur...
Jon explores his personal anxieties as a content creator to encourage fellow creators to keep sharing their knowledge. Additional materials: www.superdatascience.com/528
Peter Bailis joins the podcast to discuss the work of his company that solves complex commercial problems through automated data analysis. In this episode you will learn: Meaning of the name Sisu [3:08] What Sisu does [4:45] Sisu and the data science stack [17:00] Going from academia to startups [22:37] What Sisu looks for when hiring [28:57] Peter’s favorite tools [32:40] Peter’s academic research [45:02] Additional materials: www.superdatascience.com/527...
I finish up our three-part series on the results of the O’Reilly Survey, looking at the highest-paying data frameworks. Additional materials: www.superdatascience.com/526
Karen Jean-Francois joins us to discuss how she wants to empower her team members and a wider audience of data scientists battling imposter syndrome. In this episode you will learn: Karen’s background as a hurdler [4:42] Women in Data Podcast [10:32] Cardlytics [19:04] Karen’s background and current career [22:55] Karen’s favorite tools [31:29] Karen’s balance of fitness and work [34:45] The biggest challenge of Karen’s career [47:09] Advancement in data [54:13] What is Karen most excited about?...
Wes McKinney joins us to discuss the history and philosophy of pandas and Apache Arrow as well as his continued work in open source tools. In this episode you will learn: History of pandas [7:29] The trends of R and Python [23:33] Python for Data Analysis [25:58] pandas updates and community [30:10] Apache Arrow [41:50] Voltron Data [55:10] Origin of Wes’s project names [1:08:14] Wes’s favorite tools [1:09:46] Audience Q&A [1:15:34] Additional materials: www.superdatascience.com/523...
I provide you with some quick definitions of data tools vs data platforms to prep us for deep dives in future episodes. Additional materials: www.superdatascience.com/522
Khuyen Tran joins us to discuss her work as a prolific technical writer and undergraduate data science student. In this episode you will learn: Khuyen’s online writing [4:00] Book writing [8:50] How you can increase your engagement [13:49] Khuyen’s work with Towards Data Science and NVIDIA [19:01] Ocelot Consulting [24:08] Khuyen’s undergrad work [32:12] Audience questions [47:00] Additional materials: www.superdatascience.com/521...
James Hodson joins us to discuss his philosophy and work at A.I. For Good and how they aim to promote sustainability and A.I. use for social issues. In this episode you will learn: AI for Good [5:17] Founding of AI for Good [8:50] Case studies [14:58] How you can get involved [46:29] Skills James looks for in hires [50:39] Additional materials: www.superdatascience.com/519...
Sadie St. Lawrence talks in-depth about her extensive work as a data science educator through both online and collegiate courses as well as her organization for diversifying data science careers. In this episode you will learn: Sadie’s education work in SQL [4:13] The popularity of Sadie’s course [13:32] Sadie’s forthcoming machine learning certificate course [16:29] Women in Data [25:32] Sadie’s non-technical background [36:17] NFTs and VR [46:41] Additional materials: www.superdatascience.com/...
Chrys Wu joins us to discuss her community organizations, her tips, and her recommended resources for building data science communities for impact. In this episode you will learn: The world of K-Pop [ 4:07] Chrys’s talk at the R Conference [8:56] Write/Speak/Code [14:05] Hacks/Hackers [21:58] Tips on developing data communities [27:22] Additional materials: www.superdatascience.com/515...