Data Science at Home - podcast cover

Data Science at Home

Francesco Gadaletadatascienceathome.podbean.com

Cutting through AI bullsh*t.
Come join the discussion on Discord!
https://discord.gg/4UNKGf3

Last refreshed:
Follow this podcast in the Metacast mobile app to refresh it and see new episodes.
Download Metacast podcast app
Podcasts are better in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episodes

Pandas vs Rust [RB] (Ep. 158)

Sponsors Get one of the best VPN at a massive discount with coupon code DATASCIENCE. It provides you with an 83% discount which unlocks the best price in the market plus 3 extra months for free. Here is the link https://surfshark.deals/DATASCIENCE

Jul 01, 202132 minEp. 160

A simple trick for very unbalanced data (Ep. 157)

Data from the real world are never perfectly balanced. In this episode I explain a simple yet effective trick to train models with very unbalanced data. Enjoy the show! Sponsors Get one of the best VPN at a massive discount with coupon code DATASCIENCE. It provides you with an 83% discount which unlocks the best price in the market plus 3 extra months for free. Here is the link https://surfshark.deals/DATASCIENCE References Leo Breiman, Random Forests , 2001 C. Chen, A. Liaw, L. Breiman, Using R...

Jun 22, 202122 minEp. 157

Time to take your data back with Tapmydata (Ep. 156)

In this episode I am with Gilbert Hill, head of strategy at https://tapmydata.com/ We speak about personal data, blockchain and the ability to control it and monetize with another simple yet effective app in the ecosystem. References https://tapmydata.com/ https://medium.com/@tholder/we-dont-want-your-data-pushing-boundaries-in-data-collection-and-end-to-end-encryption-for-apps-ebd1d5f79df5...

Jun 15, 202141 minEp. 156

True Machine Intelligence just like the human brain (Ep. 155)

In this episode I have a really interesting conversation with Karan Grewal, member of the research staff at Numenta where he investigates how biological principles of intelligence can be translated into silicon. We speak about the thousand brains theory and why neural networks forget. References Main paper on the Thousand Brains Theory: https://www.frontiersin.org/articles/10.3389/fncir.2018.00121/full Blog post on Thousand Brains Theory: https://numenta.com/blog/2019/01/16/the-thousand-brains-t...

Jun 04, 202134 minEp. 155

Delivering unstoppable data with Streamr (Ep. 154)

Delivering unstoppable data to unstoppable apps is now possible with Streamr Network Streamr is a layer zero protocol for real-time data which powers the decentralized Streamr pub/sub network. The technology works in tandem with companion blockchains - currently Ethereum and xDai chain - which are used for identity, security and payments. On top is the application layer, including the Data Union framework, Marketplace and Core, and all third party applications. In this episode I have a very inte...

May 26, 202143 minEp. 154

MLOps: the good, the bad and the ugly (Ep. 153)

Our Sponsor Amethix use advanced Artificial Intelligence and Machine Learning to build data platforms and predictive engines in domain like finance, healthcare, pharmaceuticals, logistics, energy. Amethix provide solutions to collect and secure data with higher transparency and disintermediation, and build the statistical models that will support your business.

May 24, 202125 minEp. 153

MLOps: what is and why it is important Part 2 (Ep. 152)

Our Sponsor Amethix use advanced Artificial Intelligence and Machine Learning to build data platforms and predictive engines in domain like finance, healthcare, pharmaceuticals, logistics, energy. Amethix provide solutions to collect and secure data with higher transparency and disintermediation, and build the statistical models that will support your business.

May 19, 202131 minEp. 152

MLOps: what is and why it is important (Ep. 151)

If you think that knowing Tensorflow and Scikit-learn is enough, think again. MLOps is one of those trendy terms today. What is MLOps and why is it important? In this episode I speak about the undeniable evolution of the data scientist in the last 5-10 years. Sponsors If building software is your passion, you’ll love ThoughtWorks Technology Podcast . It’s a podcast for techies by techies. Their team of experienced technologists take a deep dive into a tech topic that’s piqued their interest — it...

May 11, 202133 minEp. 151

Can I get paid for my data? With Mike Andi from Mytiki (Ep. 150)

Your data is worth thousands a year. Why aren’t you getting your fair share? There is a company that has a mission: they want you to take back control and get paid for your data. In this episode I speak about knowledge graphs, data confidentiality and privacy with Mike Audi, CEO of MyTiki. You can reach them on their website https://mytiki.com/ Discord official channel https://discord.com/invite/evjYQq48Be Telegram https://t.me/mytikiapp Signal https://signal.group/#CjQKIA66Eq2VHecpcCd-cu-dziozM...

Apr 28, 202139 minEp. 150

Building high-growth data businesses with Lillian Pierson (Ep. 149)

In this episode I have an amazing conversation with Lillian Pierson from data-mania.com This is an action-packed episode on how data professionals can quickly convert their data expertise into high-growth data businesses, all by selecting optimal business models, revenue models, and pricing structures. If you want to know more or get in touch with Lillian, follow the links below: Weekly Free Trainings: We currently publish 1 free training per week on YouTube! https://www.youtube.com/channel/UCK4...

Apr 19, 202126 minEp. 149

Learning and training in AI times (Ep. 148)

Is there a gap between life sciences and data science? What's the situation when it comes to interdisciplinary research? In this episode I am with Laura Harris, Director of Training for the Institute of Cyber-Enabled Research (ICER) at Michigan State University (MSU), and we try to answer some of those questions. You can contact Laura at training@msu.edu or on LinkedIn...

Apr 13, 202132 minEp. 148

You are the product [RB] (Ep. 147)

In this episode I am with George Hosu from Cerebralab and we speak about how dangerous it is not to pay for the services you use, and as a consequence how dangerous it is letting an algorithm decide what you like or not. Our Sponsors This episode is supported by Chapman’s Schmid College of Science and Technology, where master’s and PhD students join in cutting-edge research as they prepare to take the next big leap in their professional journey. To learn more about the innovative tools and colla...

Apr 11, 202145 minEp. 147

Polars: the fastest dataframe crate in Rust - with Ritchie Vink (Ep. 146)

In this episode I speak with Ritchie Vink, the author of Polars, a crate that is the fastest dataframe library at date of speaking :) If you want to participate to an amazing Rust open source project, this is your change to collaborate to the official repository in the references. References https://github.com/ritchie46/polars

Apr 08, 202133 minEp. 146

Apache Arrow, Ballista and Big Data in Rust with Andy Grove (Ep. 145)

Do you want to know the latest in big data analytics frameworks? Have you ever heard of Apache Arrow? Rust ? Ballista? In this episode I speak with Andy Grove one of the main authors of Apache Arrow and Ballista compute engine. Andy explains some challenges while he was designing the Arrow and Ballista memory models and he describes some amazing solutions. Our Sponsors This episode is supported by Chapman’s Schmid College of Science and Technology, where master’s and PhD students join in cutting...

Mar 26, 202130 minEp. 145

Pandas vs Rust (Ep. 144)

Pandas is the de-facto standard for data loading and manipulation. Python is the de-facto programming language for such operations. Rust is the underdog. Or is it? In this episode I am showing you why that is no longer the case. Our Sponsors This episode is supported by Chapman’s Schmid College of Science and Technology, where master’s and PhD students join in cutting-edge research as they prepare to take the next big leap in their professional journey. To learn more about the innovative tools a...

Mar 19, 202132 minEp. 144

Concurrent is not parallel - Part 2 (Ep. 143)

In plain English, concurrent and parallel are synonyms. Not for a CPU. And definitely not for programmers. In this episode I summarize the ways to parallelize on different architectures and operating systems. Rock-star data scientists must know how concurrency works and when to use it IMHO. Our Sponsors This episode is supported by Chapman’s Schmid College of Science and Technology, where master’s and PhD students join in cutting-edge research as they prepare to take the next big leap in their p...

Mar 13, 202115 minEp. 143

Concurrent is not parallel - Part 1 (Ep. 142)

In plain English, concurrent and parallel are synonyms. Not for a CPU. And definitely not for programmers. In this episode I summarize the ways to parallelize on different architectures and operating systems. Rock-star data scientists must know how concurrency works and when to use it IMHO. Our Sponsors This episode is supported by Chapman’s Schmid College of Science and Technology, where master’s and PhD students join in cutting-edge research as they prepare to take the next big leap in their p...

Mar 10, 202132 minEp. 142

Backend technologies for machine learning in production (Ep. 141)

This is one of the most dynamic and fascinating topics: API technologies for machine learning. It's always fun to build ML models. But how about serving them in the real world? In this episode I speak about three must-know technologies to place your model behind an API. Our Sponsors This episode is supported by Chapman’s Schmid College of Science and Technology, where master’s and PhD students join in cutting-edge research as they prepare to take the next big leap in their professional journey. ...

Mar 02, 202125 minEp. 141

You are the product (Ep. 140)

In this episode I am with George Hosu from Cerebralab and we speak about how dangerous it is not to pay for the services you use, and as a consequence how dangerous it is letting an algorithm decide what you like or not. Our Sponsors This episode is supported by Chapman’s Schmid College of Science and Technology, where master’s and PhD students join in cutting-edge research as they prepare to take the next big leap in their professional journey. To learn more about the innovative tools and colla...

Feb 22, 202145 minEp. 140

How to reinvent banking and finance with data and technology (Ep. 139)

The financial system is changing. It is becoming more efficient and integrated with many more services making our life more... digital. Is the old banking system doomed to fail? Or will it just be disrupted by the smaller players of the fintech industry? In this episode we answer some of these fundamental questions with Alessandro E. Hatami from Pacemakers Subscribe to the Newsletter and come chat with us on the official Discord channel Our Sponsors This episode is supported by Chapman’s Schmid ...

Feb 15, 202137 minEp. 139

Is Rust flexible enough for a flexible data model? (Ep. 137)

In this podcast I get inspired by Paul Done 's presentation about The Six Principles for Building Robust Yet Flexible Shared Data Applications, and show how powerful of a language Rust is while still maintaining the flexibility of less strict languages. Our Sponsor This episode is supported by Chapman’s Schmid College of Science and Technology, where master's and PhD students join in cutting-edge research as they prepare to take the next big leap in their professional journey. To learn more abou...

Feb 01, 202129 minEp. 137

Is Apple M1 good for machine learning? (Ep.136)

In this episode I explain the basics of computer architecture and introduce some features of the Apple M1 Is it good for Machine Learning tasks? References Computer architectures book https://www.amazon.com/Computer-Architecture-Quantitative-John-Hennessy/dp/012383872X Performance https://nod.ai/comparing-apple-m1-with-amx2-m1-with-neon/...

Jan 25, 202128 minEp. 136

Rust and deep learning with Daniel McKenna (Ep. 135)

In this episode I speak with Daniel McKenna about Rust, machine learning and artificial intelligence. You can find Daniel from http://github.com/xd009642 https://twitter.com/xd009642 Don't forget to come join me in our Discord channel speaking about all things data science. Subscribe to the official Newsletter and never miss an episode...

Jan 18, 202123 minEp. 135

Scaling machine learning with clusters and GPUs (Ep. 134)

Let's finish this year with an amazing episode about scaling ML with clusters and GPUs. Kind of as a continuation of Episode 112 I have a terrific conversation with Aaron Richter from Saturn Cloud about, well, making ML faster and scaling it to massive infrastructure. Aaron can be reached on his website https://rikturr.com and Twitter @rikturr Our Sponsor Saturn Cloud is a data science and machine learning platform for scalable Python analytics. Users can jump into cloud-based Jupyter and Dask t...

Dec 31, 202031 minEp. 134

What is data ethics? (Ep. 133)

What is data ethics? In this episode I have an interesting chat with Denny Wong from FaqBot and Muna. Our Sponsor Amethix use advanced Artificial Intelligence and Machine Learning to build data platforms and predictive engines in domain like finance, healthcare, pharmaceuticals, logistics, energy. Amethix provide solutions to collect and secure data with higher transparency and disintermediation, and build the statistical models that will support your business. References Denny's Twitter profile...

Dec 19, 202026 minEp. 133

A Standard for the Python Array API (Ep. 132)

Our Links Come join me in our Discord channel speaking about all things data science. Subscribe to the official Newsletter and never miss an episode Follow me on Twitch during my live coding sessions usually in Rust and Python Our Sponsors ProtonMail offers a simple and trusted solution to protect your internet connection and access blocked or restricted websites. All of ProtonMail and ProtonVPN’s apps are open source and have been inspected by cybersecurity experts, and Proton is based in Switz...

Dec 08, 202034 minEp. 132

What happens to data transfer after Schrems II? (Ep. 131)

In this episode Adam Leon Smith, CTO of DragonFly and expert in data regulations explains some of the consequences of Schrems II and data transfers from EU to US. For very interesting references and a practical example, subscribe to our Newsletter

Dec 04, 202032 minEp. 131

Test-First Machine Learning [RB] (Ep. 130)

Our Links Come join me in our Discord channel speaking about all things data science. Subscribe to the official Newsletter and never miss an episode Follow me on Twitch during my live coding sessions usually in Rust and Python Our Sponsors ProtonMail offers a simple and trusted solution to protect your internet connection and access blocked or restricted websites. All of ProtonMail and ProtonVPN’s apps are open source and have been inspected by cybersecurity experts, and Proton is based in Switz...

Dec 01, 202021 minEp. 130

Similarity in Machine Learning (Ep. 129)

Come join me in our Discord channel speaking about all things data science. Follow me on Twitch during my live coding sessions usually in Rust and Python Subscribe to the official Newsletter and never miss an episode Our Sponsors ProtonMail offers a simple and trusted solution to protect your internet connection and access blocked or restricted websites. All of ProtonMail and ProtonVPN's apps are open source and have been inspected by cybersecurity experts, and Proton is based in Switzerland, ho...

Nov 24, 202030 minEp. 129
For the best experience, listen in Metacast app for iOS or Android