In this episode, we dive into the fascinating world of zero-knowledge proofs and their impact on data science. Zero-knowledge proofs allow one party to prove to another that they know a secret without revealing the secret itself. This powerful concept has numerous applications in data science, from ensuring data privacy and security, to facilitating secure transactions and identity verification. We explore the mechanics of zero-knowledge proofs, its real-world applications, and how it is revolut...
Feb 27, 2023•16 min•Ep. 220
Deep learning methods are not as effective with tabular data. Here is why, and what to do about it. Sponsors If you're ready to take your WiFi game to the next level, head over to asus.click/ZenWiFi_XD5 or check out the show notes for this episode. Trust me, with ASUS ZenWiFi XD5, you'll get the best WiFi experience ever! References https://paperswithcode.com/methods/category/deep-tabular-learning https://m-clark.github.io/posts/2022-04-01-more-dl-for-tabular/...
Feb 21, 2023•28 min•Ep. 205
In this episode I speak about online learning systems and why blindly choosing such a paradigm can lead to very unpredictable and expensive outcomes. Also in this episode, I have to deal with an intruder :) Links Birman, K.; Joseph, T. (1987). "Exploiting virtual synchrony in distributed systems". Proceedings of the Eleventh ACM Symposium on Operating Systems Principles - SOSP '87 . pp. 123–138. doi : 10.1145/41457.37515 . ISBN 089791242X . S2CID 7739589 ....
Feb 15, 2023•29 min•Ep. 219
In this episode, I'll be discussing the capabilities and limitations of ChatGPT, an advanced language AI model. I'll go over its power to understand and respond to natural language, and its applications in tasks such as language translation and text summarization. However, I'll also touch on the challenges that still need to be overcome such as bias and data privacy concerns. Tune in for a comprehensive look at the current state of advanced language AI. References https://datascienceathome.com/h...
Jan 26, 2023•31 min•Ep. 218
In this episode I am with Kevin McNamara, founder and CEO of Parallel Domain. We speak about a very effective method to generate synthetic data that is currently in production at Parallel Domain. Enjoy the show! References Parallel Domain Synthetic Data Improves Cyclist Detection (blog post): https://paralleldomain.com/parallel-domain-synthetic-data-improves-cyclist-detection/ Beating the State of the Art in Object Tracking with Synthetic Data: https://paralleldomain.com/beating-the-state-of-the...
Jan 14, 2023•42 min•Ep. 217
Our Sponsors NordPass Business has developed a password manager, that will save you a lot of time and energy whenever you need access to business accounts, work across devices, even with the other members of your team, or whenever you need to share sensitive data with your colleagues, or make payments efficiently. All this with the highest standard of cyber secure technology. See NordPass Business in action now with a 3-month free trial here https://nordpass.com/DATASCIENCE with code DATASCIENCE...
Dec 13, 2022•21 min•Ep. 216
Is it possible to reconstruct a 3D model from a simple image? Under certain constraints, it is! In this episode I tell you how. Our Sponsors Explore the Complex World of Regulations. Compliance can be overwhelming. Multiple frameworks. Overlapping requirements. Let Arctic Wolf be your guide. Check it out at https://arcticwolf.com/datascience Amethix works to create and maximize the impact of the world’s leading corporations and startups, so they can create a better future for everyone they serve...
Dec 08, 2022•23 min•Ep. 215
What if we borrowed from physics some theories that would interpret deep learning and machine learning in general? Here is a list of plausible ways to interpret our beloved ML models and understand why they works, or they don't. Enjoy the show! Our Sponsors NordPass Business has developed a password manager, that will save you a lot of time and energy whenever you need access to business accounts, work across devices, even with the other members of your team, or whenever you need to share sensit...
Dec 02, 2022•24 min•Ep. 214
If you think that the problem of self-driving cars has been solved, think twice. As a matter of fact, the problem of self-driving cars cannot be solved with the technical solutions that companies are currently considering. Don't get fooled by marketing and PR on social media. Whoever is telling you they solved the problem of driving a vehicle fully autonomously, they are lying. Here is why. Our Sponsors Explore the Complex World of Regulations. Compliance can be overwhelming. Multiple frameworks...
Nov 21, 2022•35 min•Ep. 213
Let's look at the history of data platforms. How did they evolve? Why? Shall I switch to the latest architecture? Enjoy the show! Our Sponsors Explore the Complex World of Regulations. Compliance can be overwhelming. Multiple frameworks. Overlapping requirements. Let Arctic Wolf be your guide. Check it out at https://arcticwolf.com/datascience Amethix works to create and maximize the impact of the world’s leading corporations and startups, so they can create a better future for everyone they ser...
Nov 08, 2022•18 min•Ep. 212
Companies and other business entities are actively involved in defining data products and applied research every year. Academia has always played a role in creating new methods and solutions/algorithms in the fields of machine learning and artificial intelligence. However, there is doubt about how powerful and effective such research efforts are. Is studying AI in academia a waste of time? Our Sponsors Ready to advance your career in data science? University of Cincinnati Online offers nationall...
Nov 02, 2022•20 min•Ep. 211
There are many solutions to private machine learning. I am pretty confident when I say that the one we are speaking in this episode is probably one of the most feasible and reliable. I am with Daniel Huynh, CEO of Mithril Security, a graduate from Ecole Polytechnique with a specialisation in AI and data science. He worked at Microsoft on Privacy Enhancing Technologies under the office of the CTO of Microsoft France. He has written articles on Homomorphic Encryptions with the CKKS explained serie...
Oct 25, 2022•27 min•Ep. 206
Our Sponsors Ready to advance your career in data science? University of Cincinnati Online offers nationally recognized educational programs in business analytics and information systems. Predictive Analytics Today named UC as the No.1 MS Data Science school in the country and is nationally recognized with a proven track record of placing students at high-profile companies such as Google, Amazon and P&G. Discover more about the University of Cincinnati’s 100% online master’s degree programs ...
Oct 15, 2022•21 min•Ep. 210
That deep learning alone is not sufficient to solve artificial general intelligence, is more and more accepted statement. Generalist agents have great properties that can overcome some of the limitations of single-task deep learning models. Be aware, we are still far from AGI, though. So what are generalist agents? References https://arxiv.org/pdf/2205.06175
Oct 05, 2022•21 min•Ep. 209
How does an autonomous vehicle see? How does it sense the road? They are equipped of many sensors, of course. Are they all powerful enough? Small enough to hide them and make your car look beautiful? In this episode I speak about LIDAR, high resolution cameras and some machine learning methods adapted to a minimal number of sensors. Our Sponsors Ready to advance your career in data science? University of Cincinnati Online offers nationally recognized educational programs in business analytics an...
Sep 28, 2022•20 min•Ep. 202
Sometimes applications crash. Some other times applications crash because memory is exhausted. Such issues exist because of bugs in the code, or heavy memory usage for reasons that were not expected during design and implementation. Can we use machine learning to predict and eventually detect out of memory kills from the operating system? Apparently, the Netflix app many of us use on a daily basis leverage ML and time series analysis to prevent OOM-kills. Enjoy the show! Our Sponsors Explore the...
Sep 20, 2022•20 min•Ep. 203
Companies and other business entities are actively involved in defining data products and applied research every year. Academia has always played a role in creating new methods and solutions/algorithms in the fields of machine learning and artificial intelligence. However, there is doubt about how powerful and effective such research efforts are. Is studying AI in academia a waste of time? Our Sponsors Explore the Complex World of Regulations. Compliance can be overwhelming. Multiple frameworks....
Sep 13, 2022•18 min•Ep. 204
Neural networks are becoming massive monsters that are hard to train (without the "regular" 12 last-generation GPUs). Is there a way to skip that? Let me introduce you to Zero-Cost proxies References https://www.technologyreview.com/2022/08/05/1056814/automation-ai-machine-learning-automl/ https://iclr-blog-track.github.io/2022/03/25/zero-cost-proxies/...
Sep 07, 2022•20 min•Ep. 201
In this episode I speak about online learning systems and why blindly choosing such a paradigm can lead to very unpredictable and expensive outcomes. Also in this episode, I have to deal with an intruder :) Links Birman, K.; Joseph, T. (1987). "Exploiting virtual synchrony in distributed systems". Proceedings of the Eleventh ACM Symposium on Operating Systems Principles - SOSP '87 . pp. 123–138. doi : 10.1145/41457.37515 . ISBN 089791242X . S2CID 7739589 ....
Jun 13, 2022•29 min•Ep. 200
That deep learning alone is not sufficient to solve artificial general intelligence, is more and more accepted statement. Generalist agents have great properties that can overcome some of the limitations of single-task deep learning models. Be aware, we are still far from AGI, though. So what are generalist agents? References https://arxiv.org/pdf/2205.06175
Jun 03, 2022•21 min•Ep. 199
In this episode, I am with Chip Kent, chief data scientist at Deephaven Data Labs . We speak about streaming data, real-time, and other powerful tools part of the Deephaven platform. Links Deephaven - https://deephaven.io Deephaven Community Core Documentation - https://deephaven.io/core/docs/ Deephaven Community Slack - https://join.slack.com/t/deephavencommunity/shared_invite/zt-11x3hiufp-DmOMWDAvXv_pNDUlVkagLQ GitHub: Deephaven Community Core - https://github.com/deephaven/deephaven-core B...
May 27, 2022•24 min•Ep. 198
In this episode I speak with Matt Swalley, Chief Business Officer of Omneky , an AI platform that generates, analyzes and optimizes personalized ad creatives at scale. We speak about the way AI is used for generating customized recommendation and creating experiences with data aggregation and analytics. And yes! respecting the privacy of individuals. Links Grow your business with personalized ads https://www.omneky.com/ Data Science at Home Podcast (Live) https://www.twitch.tv/datascienceathome...
May 16, 2022•25 min•Ep. 197
Let's take a break and think about the state of AI in 2022. In this episode I summarize the long report from the Stanford Institute for Human-Centered Artificial Intelligence (HAI) Enjoy! References https://spectrum.ieee.org/artificial-intelligence-index
May 06, 2022•20 min•Ep. 196
In this episode I have a conversation with, Itai Bar-Sinai, CPO & Cofounder of Mona. We speak about several interesting points about data and monitoring. Why is AI monitoring so different from monitoring classic software? How to reduce the gap between data science and business? What is the role of MLOps in the data monitoring field? With over 10 years of experience with AI and as the CPO and head of customer success at Mona, the leading AI monitoring intelligence company, Itai has a unique v...
Apr 21, 2022•33 min•Ep. 195
I am with Ander Steele, data scientist and mathematician with a passion for privacy and Shannon Bayatpur, product manager with a background in technical writing and computer science, from Tonic.ai. We speak about data. Fake data. But all we say is authentic. Links Tonic website Career page Neural networks for synthetic data...
Apr 13, 2022•26 min•Ep. 194
In this episode my friend and I speak about AI, batteries and automotive. Dennis Berner, founder of Digitlabs has been operating in the field of automotive and batteries for a long time. His point of views are absolutely a must to listen to. Below a list of the links he mentioned in the show. https://amethix.com https://digitlabs.com https://www.moia.io https://www.elli.eco https://www.uber.com https://www.didiglobal.com/ https://waymo.com/ https://group.mercedes-benz.com/ https://www.fakultaet7...
Apr 01, 2022•37 min•Ep. 193
In this episode I speak with Manavalan Krishnan from Tsecond about capturing massive amounts of data at the edge with security and reliability in mind. This episode is brought to you by NordVPN NordVPN protects your privacy while you are online. Get secure and private access to the internet by surfing nordvpn.com/DATASCIENCE or use coupon code DATASCIENCE and get a massive discount. and by Amethix Technologies Amethix use advanced Artificial Intelligence and Machine Learning to build data platfo...
Mar 25, 2022•36 min•Ep. 192
This is one episode where passion for math, statistics and computers are merged. I have a very interesting conversation with Ravin, data scientist at Google where he uses data to inform decisions. He has previously worked at Sweetgreen, designing systems that would benefit team members and communities through sustainable and healthy food, and SpaceX, creating tools that would ultimately launch rocket ships. All opinions in this episode are his own and none of the companies he has worked for are ...
Mar 19, 2022•31 min•Ep. 191
In this episode I am with Matt Forrest, VP of Solutions Engineering at Carto. We speak about machine learning applied to spatial data, spatial SQL and GIS (Geographic Information System). Enjoy the show! This episode is brought to you by RailzAI The Railz API connects to major accounting platforms to provide you with quick access to normalized and analyzed financial data. Get free access to their API and more. Just tell them you came through Data Science at Home podcast. and by Amethix Technolog...
Mar 02, 2022•26 min•Ep. 190
In this episode I am with Pasha Zavari - Director of Data Science and Derek Manuge - Co-founder and CTO at Railz. Railz is a very interesting company with an incredible mission: normalizing and extracting insights from the most tedious data out there, financial data. Guess what technology stack are they on? Enjoy the show! This episode is brought to you by RailzAI The Railz API connects to major accounting platforms to provide you with quick access to normalized and analyzed financial data. Spon...
Feb 22, 2022•47 min•Season 5Ep. 189