Data Science at Home - podcast cover

Data Science at Home

Francesco Gadaletadatascienceathome.podbean.com

Cutting through AI bullsh*t.
Come join the discussion on Discord!
https://discord.gg/4UNKGf3

Last refreshed:
Follow this podcast in the Metacast mobile app to refresh it and see new episodes.
Download Metacast podcast app
Podcasts are better in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episodes

Prove It Without Revealing It: Exploring the Power of Zero-Knowledge Proofs in Data Science (Ep. 218)

In this episode, we dive into the fascinating world of zero-knowledge proofs and their impact on data science. Zero-knowledge proofs allow one party to prove to another that they know a secret without revealing the secret itself. This powerful concept has numerous applications in data science, from ensuring data privacy and security, to facilitating secure transactions and identity verification. We explore the mechanics of zero-knowledge proofs, its real-world applications, and how it is revolut...

Feb 27, 202316 minEp. 220

Deep learning vs tabular models (Ep. 217)

Deep learning methods are not as effective with tabular data. Here is why, and what to do about it. Sponsors If you're ready to take your WiFi game to the next level, head over to asus.click/ZenWiFi_XD5 or check out the show notes for this episode. Trust me, with ASUS ZenWiFi XD5, you'll get the best WiFi experience ever! References https://paperswithcode.com/methods/category/deep-tabular-learning https://m-clark.github.io/posts/2022-04-01-more-dl-for-tabular/...

Feb 21, 202328 minEp. 205

[RB] Online learning is better than batch, right? Wrong! (Ep. 216)

In this episode I speak about online learning systems and why blindly choosing such a paradigm can lead to very unpredictable and expensive outcomes. Also in this episode, I have to deal with an intruder :) Links Birman, K.; Joseph, T. (1987). "Exploiting virtual synchrony in distributed systems". Proceedings of the Eleventh ACM Symposium on Operating Systems Principles - SOSP '87 . pp. 123–138. doi : 10.1145/41457.37515 . ISBN 089791242X . S2CID 7739589 ....

Feb 15, 202329 minEp. 219

Chatting with ChatGPT: Pros and Cons of Advanced Language AI (Ep. 215)

In this episode, I'll be discussing the capabilities and limitations of ChatGPT, an advanced language AI model. I'll go over its power to understand and respond to natural language, and its applications in tasks such as language translation and text summarization. However, I'll also touch on the challenges that still need to be overcome such as bias and data privacy concerns. Tune in for a comprehensive look at the current state of advanced language AI. References https://datascienceathome.com/h...

Jan 26, 202331 minEp. 218

Accelerating Perception Development with Synthetic Data (Ep. 214)

In this episode I am with Kevin McNamara, founder and CEO of Parallel Domain. We speak about a very effective method to generate synthetic data that is currently in production at Parallel Domain. Enjoy the show! References Parallel Domain Synthetic Data Improves Cyclist Detection (blog post): https://paralleldomain.com/parallel-domain-synthetic-data-improves-cyclist-detection/ Beating the State of the Art in Object Tracking with Synthetic Data: https://paralleldomain.com/beating-the-state-of-the...

Jan 14, 202342 minEp. 217

Edge AI applications for military and space [RB] (Ep. 213)

Our Sponsors NordPass Business has developed a password manager, that will save you a lot of time and energy whenever you need access to business accounts, work across devices, even with the other members of your team, or whenever you need to share sensitive data with your colleagues, or make payments efficiently. All this with the highest standard of cyber secure technology. See NordPass Business in action now with a 3-month free trial here https://nordpass.com/DATASCIENCE with code DATASCIENCE...

Dec 13, 202221 minEp. 216

From image to 3D model (Ep. 212)

Is it possible to reconstruct a 3D model from a simple image? Under certain constraints, it is! In this episode I tell you how. Our Sponsors Explore the Complex World of Regulations. Compliance can be overwhelming. Multiple frameworks. Overlapping requirements. Let Arctic Wolf be your guide. Check it out at https://arcticwolf.com/datascience Amethix works to create and maximize the impact of the world’s leading corporations and startups, so they can create a better future for everyone they serve...

Dec 08, 202223 minEp. 215

Machine learning is physics (Ep. 211)

What if we borrowed from physics some theories that would interpret deep learning and machine learning in general? Here is a list of plausible ways to interpret our beloved ML models and understand why they works, or they don't. Enjoy the show! Our Sponsors NordPass Business has developed a password manager, that will save you a lot of time and energy whenever you need access to business accounts, work across devices, even with the other members of your team, or whenever you need to share sensit...

Dec 02, 202224 minEp. 214

Autonomous cars cannot drive. Here is why. (Ep. 210)

If you think that the problem of self-driving cars has been solved, think twice. As a matter of fact, the problem of self-driving cars cannot be solved with the technical solutions that companies are currently considering. Don't get fooled by marketing and PR on social media. Whoever is telling you they solved the problem of driving a vehicle fully autonomously, they are lying. Here is why. Our Sponsors Explore the Complex World of Regulations. Compliance can be overwhelming. Multiple frameworks...

Nov 21, 202235 minEp. 213

Evolution of data platforms (Ep. 209)

Let's look at the history of data platforms. How did they evolve? Why? Shall I switch to the latest architecture? Enjoy the show! Our Sponsors Explore the Complex World of Regulations. Compliance can be overwhelming. Multiple frameworks. Overlapping requirements. Let Arctic Wolf be your guide. Check it out at https://arcticwolf.com/datascience Amethix works to create and maximize the impact of the world’s leading corporations and startups, so they can create a better future for everyone they ser...

Nov 08, 202218 minEp. 212

[RB] Is studying AI in academia a waste of time? (Ep. 208)

Companies and other business entities are actively involved in defining data products and applied research every year. Academia has always played a role in creating new methods and solutions/algorithms in the fields of machine learning and artificial intelligence. However, there is doubt about how powerful and effective such research efforts are. Is studying AI in academia a waste of time? Our Sponsors Ready to advance your career in data science? University of Cincinnati Online offers nationall...

Nov 02, 202220 minEp. 211

Private machine learning done right (Ep. 207)

There are many solutions to private machine learning. I am pretty confident when I say that the one we are speaking in this episode is probably one of the most feasible and reliable. I am with Daniel Huynh, CEO of Mithril Security, a graduate from Ecole Polytechnique with a specialisation in AI and data science. He worked at Microsoft on Privacy Enhancing Technologies under the office of the CTO of Microsoft France. He has written articles on Homomorphic Encryptions with the CKKS explained serie...

Oct 25, 202227 minEp. 206

Edge AI for applications in military and space (Ep. 206)

Our Sponsors Ready to advance your career in data science? University of Cincinnati Online offers nationally recognized educational programs in business analytics and information systems. Predictive Analytics Today named UC as the No.1 MS Data Science school in the country and is nationally recognized with a proven track record of placing students at high-profile companies such as Google, Amazon and P&G. Discover more about the University of Cincinnati’s 100% online master’s degree programs ...

Oct 15, 202221 minEp. 210

[RB] What are generalist agents and why they can change the AI game (Ep. 205)

That deep learning alone is not sufficient to solve artificial general intelligence, is more and more accepted statement. Generalist agents have great properties that can overcome some of the limitations of single-task deep learning models. Be aware, we are still far from AGI, though. So what are generalist agents? References https://arxiv.org/pdf/2205.06175

Oct 05, 202221 minEp. 209

LIDAR, cameras and autonomous vehicles (Ep. 204)

How does an autonomous vehicle see? How does it sense the road? They are equipped of many sensors, of course. Are they all powerful enough? Small enough to hide them and make your car look beautiful? In this episode I speak about LIDAR, high resolution cameras and some machine learning methods adapted to a minimal number of sensors. Our Sponsors Ready to advance your career in data science? University of Cincinnati Online offers nationally recognized educational programs in business analytics an...

Sep 28, 202220 minEp. 202

Predicting Out Of Memory Kill events with Machine Learning (Ep. 203)

Sometimes applications crash. Some other times applications crash because memory is exhausted. Such issues exist because of bugs in the code, or heavy memory usage for reasons that were not expected during design and implementation. Can we use machine learning to predict and eventually detect out of memory kills from the operating system? Apparently, the Netflix app many of us use on a daily basis leverage ML and time series analysis to prevent OOM-kills. Enjoy the show! Our Sponsors Explore the...

Sep 20, 202220 minEp. 203

Is studying AI in academia a waste of time? (Ep. 202)

Companies and other business entities are actively involved in defining data products and applied research every year. Academia has always played a role in creating new methods and solutions/algorithms in the fields of machine learning and artificial intelligence. However, there is doubt about how powerful and effective such research efforts are. Is studying AI in academia a waste of time? Our Sponsors Explore the Complex World of Regulations. Compliance can be overwhelming. Multiple frameworks....

Sep 13, 202218 minEp. 204

Zero-Cost Proxies: How to find the best neural network without training (Ep. 201)

Neural networks are becoming massive monsters that are hard to train (without the "regular" 12 last-generation GPUs). Is there a way to skip that? Let me introduce you to Zero-Cost proxies References https://www.technologyreview.com/2022/08/05/1056814/automation-ai-machine-learning-automl/ https://iclr-blog-track.github.io/2022/03/25/zero-cost-proxies/...

Sep 07, 202220 minEp. 201

Online learning is better than batch, right? Wrong! (Ep. 200)

In this episode I speak about online learning systems and why blindly choosing such a paradigm can lead to very unpredictable and expensive outcomes. Also in this episode, I have to deal with an intruder :) Links Birman, K.; Joseph, T. (1987). "Exploiting virtual synchrony in distributed systems". Proceedings of the Eleventh ACM Symposium on Operating Systems Principles - SOSP '87 . pp. 123–138. doi : 10.1145/41457.37515 . ISBN 089791242X . S2CID 7739589 ....

Jun 13, 202229 minEp. 200

What are generalist agents and why they can change the AI game (Ep. 199)

That deep learning alone is not sufficient to solve artificial general intelligence, is more and more accepted statement. Generalist agents have great properties that can overcome some of the limitations of single-task deep learning models. Be aware, we are still far from AGI, though. So what are generalist agents? References https://arxiv.org/pdf/2205.06175

Jun 03, 202221 minEp. 199

Streaming data with ease. With Chip Kent from Deephaven Data Labs (Ep. 198)

In this episode, I am with Chip Kent, chief data scientist at Deephaven Data Labs . We speak about streaming data, real-time, and other powerful tools part of the Deephaven platform. Links Deephaven - https://deephaven.io Deephaven Community Core Documentation - ​​ https://deephaven.io/core/docs/ Deephaven Community Slack - https://join.slack.com/t/deephavencommunity/shared_invite/zt-11x3hiufp-DmOMWDAvXv_pNDUlVkagLQ GitHub: Deephaven Community Core - https://github.com/deephaven/deephaven-core B...

May 27, 202224 minEp. 198

Learning from data to create personalized experiences with Matt Swalley from Omneky (Ep. 197)

In this episode I speak with Matt Swalley, Chief Business Officer of Omneky , an AI platform that generates, analyzes and optimizes personalized ad creatives at scale. We speak about the way AI is used for generating customized recommendation and creating experiences with data aggregation and analytics. And yes! respecting the privacy of individuals. Links Grow your business with personalized ads https://www.omneky.com/ Data Science at Home Podcast (Live) https://www.twitch.tv/datascienceathome...

May 16, 202225 minEp. 197

State of Artificial Intelligence 2022 (Ep. 196)

Let's take a break and think about the state of AI in 2022. In this episode I summarize the long report from the Stanford Institute for Human-Centered Artificial Intelligence (HAI) Enjoy! References https://spectrum.ieee.org/artificial-intelligence-index

May 06, 202220 minEp. 196

Improving your AI by finding issues within data pockets (Ep. 195)

In this episode I have a conversation with, Itai Bar-Sinai, CPO & Cofounder of Mona. We speak about several interesting points about data and monitoring. Why is AI monitoring so different from monitoring classic software? How to reduce the gap between data science and business? What is the role of MLOps in the data monitoring field? With over 10 years of experience with AI and as the CPO and head of customer success at Mona, the leading AI monitoring intelligence company, Itai has a unique v...

Apr 21, 202233 minEp. 195

Fake data that looks, feels, and behaves like production.(Ep.194)

I am with Ander Steele, data scientist and mathematician with a passion for privacy and Shannon Bayatpur, product manager with a background in technical writing and computer science, from Tonic.ai. We speak about data. Fake data. But all we say is authentic. Links Tonic website Career page Neural networks for synthetic data...

Apr 13, 202226 minEp. 194

Batteries and AI in Automotive (Ep. 193)

In this episode my friend and I speak about AI, batteries and automotive. Dennis Berner, founder of Digitlabs has been operating in the field of automotive and batteries for a long time. His point of views are absolutely a must to listen to. Below a list of the links he mentioned in the show. https://amethix.com https://digitlabs.com https://www.moia.io https://www.elli.eco https://www.uber.com https://www.didiglobal.com/ https://waymo.com/ https://group.mercedes-benz.com/ https://www.fakultaet7...

Apr 01, 202237 minEp. 193

Collect data at the edge [RB] (Ep. 192)

In this episode I speak with Manavalan Krishnan from Tsecond about capturing massive amounts of data at the edge with security and reliability in mind. This episode is brought to you by NordVPN NordVPN protects your privacy while you are online. Get secure and private access to the internet by surfing nordvpn.com/DATASCIENCE or use coupon code DATASCIENCE and get a massive discount. and by Amethix Technologies Amethix use advanced Artificial Intelligence and Machine Learning to build data platfo...

Mar 25, 202236 minEp. 192

Bayesian Machine Learning with Ravin Kumar (Ep. 191)

This is one episode where passion for math, statistics and computers are merged. I have a very interesting conversation with Ravin, data scientist at Google where he uses data to inform decisions. He has previously worked at Sweetgreen, designing systems that would benefit team members and communities through sustainable and healthy food, and SpaceX, creating tools that would ultimately launch rocket ships. All opinions in this episode are his own and none of the companies he has worked for are ...

Mar 19, 202231 minEp. 191

What is spatial data science? With Matt Forest from Carto (Ep. 190)

In this episode I am with Matt Forrest, VP of Solutions Engineering at Carto. We speak about machine learning applied to spatial data, spatial SQL and GIS (Geographic Information System). Enjoy the show! This episode is brought to you by RailzAI The Railz API connects to major accounting platforms to provide you with quick access to normalized and analyzed financial data. Get free access to their API and more. Just tell them you came through Data Science at Home podcast. and by Amethix Technolog...

Mar 02, 202226 minEp. 190

Connect. Collect. Normalize. Analyze. An interview with the people from Railz AI (Ep. 189)

In this episode I am with Pasha Zavari - Director of Data Science and Derek Manuge - Co-founder and CTO at Railz. Railz is a very interesting company with an incredible mission: normalizing and extracting insights from the most tedious data out there, financial data. Guess what technology stack are they on? Enjoy the show! This episode is brought to you by RailzAI The Railz API connects to major accounting platforms to provide you with quick access to normalized and analyzed financial data. Spon...

Feb 22, 202247 minSeason 5Ep. 189
For the best experience, listen in Metacast app for iOS or Android