MLOps Coffee Sessions #110 with David Bayliss, Chief Data Scientist of LexisNexis Risk Solutions, Just Fetch the Data and then... co-hosted by Vishnu Rachakonda. // Abstract Composing data to extract features can be a significant problem. Key factors are the data size, compliance restrictions, and real-time data. Ethics (and law) can drive extremely complex audit requirements. In the cloud, you can do anything - at a price. // Bio One of the creators of the world's first big data platform (HPCC)...
Jul 29, 2022•52 min•Season 1Ep. 110
MLOps Coffee Sessions #109 with Ketan Umare, Co-founder and CEO of Union.ai, Why You Need More Than Airflow co-hosted by George Pearse. // Abstract Airflow is a beloved tool by data engineers and Machine Learning Engineers alike. But when doing ML what are the shortcomings and why is an orchestration tool like that not always the best developer experience? In this episode, we break down what some key drivers are for using an ML-specific orchestration tool. // Bio Ketan Umare is the CEO and co-fo...
Jul 23, 2022•1 hr 12 min•Season 1Ep. 109
MLOps Coffee Sessions #108 with Byron Allen, AI & ML Practice Lead at Contino, ML Flow vs Kubeflow 2022 co-hosted by George Pearse. // Abstract The amazing Byron Allen talks to us about why MLflow and Kubeflow are not playing the same game! ML flow vs Kubeflow is more like comparing apples to oranges or as he likes to make the analogy they are both cheese but one is an all-rounder and the other a high-class delicacy. This can be quite deceiving when analyzing the two. We do a deep div...
Jul 19, 2022•1 hr 6 min•Season 1Ep. 108
MLOps Coffee Sessions #107 with Ryan Russon, Manager, MLOps and Data Science of Maven Wave Partners, Why and When to Use Kubeflow for MLOps co-hosted by Mihail Eric. // Abstract Kubeflow is an excellent platform if your team is already leveraging Kubernetes and allows for a truly collaborative experience. Let’s take a deep dive into the pros and cons of using Kubeflow in your MLOps. // Bio From serving as an officer in the US Navy to Consulting for some of America's largest corpora...
Jul 11, 2022•59 min•Season 1Ep. 107
MLOps Coffee Sessions #106 with Delina Ivanova, Associate Director, Data of HelloFresh, Building a Culture of Experimentation to Speed Up Data-Driven Value co-hosted by Vishnu Rachakonda. // Abstract Supply chain/manufacturing are prime areas where the use of data science/analytics/ ML is underdeveloped, and experimentation is required to collect data and enable data-driven solutions. This talk encourages companies to conduct experiments and collect data over time in order to build accurate/scal...
Jul 05, 2022•54 min•Season 1Ep. 106
MLOps Coffee Sessions #106 with Curtis Northcutt, CEO & Co-Founder of Cleanlab, Cleanlab: Labeled Datasets that Correct Themselves Automatically co-hosted by Vishnu Rachakonda. // Abstract Pioneered at MIT by 3 Ph.D. Co-Founders, Cleanlab is an open-source/SaaS company building the premier data-centric AI tools workflows for (1) automatically correcting messy data and labels, (2) auto-tracking of dataset quality over time, (3) automatically finding classes to merge and delete, (4) auto ml fo...
Jul 01, 2022•1 hr 6 min•Season 1Ep. 105
MLOps Coffee Sessions #104 with the creator of Apache Airflow and Apache Superset Maxime Beauchemin, Future of BI co-hosted by Vishnu Rachakonda. // Abstract // Bio Maxime Beauchemin is the founder and CEO of Preset. Original creator of Apache Superset. Max has worked at the leading edge of data and analytics his entire career, helping shape the discipline in influential roles at data-dependent companies like Yahoo!, Lyft, Airbnb, Facebook, and Ubisoft. // MLOps Jobs board https://m...
Jun 24, 2022•52 min•Season 1Ep. 104
MLOps Coffee Sessions #103 with Corey Zumar, MLOps Podcast on Making MLflow co-hosted by Mihail Eric. // Abstract Because MLOps is a broad ecosystem of rapidly evolving tools and techniques, it creates several requirements and challenges for platform developers: - To serve the needs of many practitioners and organizations, it's important for MLOps platforms to support a variety of tools in the ecosystem. This necessitates extra scrutiny when designing APIs, as well as rigorous testing str...
Jun 17, 2022•1 hr 5 min•Season 1Ep. 103
MLOps Coffee Sessions #102 with Yash Sheth, Fixing Your ML Data Blindspots co-hosted by Adam Sroka. // Abstract Improving your dataset quality is absolutely critical for effective ML. Finding errors in your datasets is generally a slow, iterative, and painstaking process. Data scientists should be proactively fixing their model’s blindspots by improving their training data. In this talk, Yash discusses how Galileo helps data scientists identify, fix, and track data across th...
Jun 10, 2022•52 min•Season 1Ep. 102
MLOps Coffee Sessions #101 with Piero Molino, Declarative Machine Learning Systems: Big Tech Level ML Without a Big Tech Team co-hosted by Vishnu Rachakonda. // Abstract Declarative Machine Learning Systems are the next step in the evolution of Machine Learning infrastructure. With such systems, organizations can marry the flexibility of low-level APIs with the simplicity of AutoML. Companies adopting such systems can increase the speed of machine learning development, reaching the quality and s...
Jun 03, 2022•59 min•Season 1Ep. 101
Lightning Sessions #1 with Peeyush Agarwal, Scaling Real-time Machine Learning at Chime. // Abstract In this Lighting Talk, Peeyush Agarwal explains 2 key pieces of the ML infrastructure at Chime. Peeyush goes into detail about the current feature store design and feature monitoring process along with the ML monitoring setup. This Lighting Talk is brought to you by arize.com reach out to them for all of your ML monitoring needs. // Bio Peeyush Agarwal is the Lead Software Engineer, ML Platform a...
May 27, 2022•24 min•Season 1Ep. 1
MLOps Coffee Sessions #100 with Matthijs Brouns, MLOps Critiques co-hosted by David Aponte. // Abstract MLOps is too tool-driven, don't let FOMO drive you to pick the latest feature/model/evaluation/ store but pay closer attention to what you actually need to release more safely and reliably. // Bio Matthijs is a Machine Learning Engineer, active in Amsterdam, The Netherlands. His current work involves training MLEs at Xccelerated.io. This means Matthijs divides his time between building new tra...
May 27, 2022•50 min•Season 1Ep. 100
MLOps Coffee Sessions #99 with Ronen Dar and Gijsbert Janssen van Doorn, Getting the Most Out of your AI Infrastructure co-hosted by Vishnu Rachakonda. // Abstract Run:AI is building a cloud-based platform for building with AI. In this talk, we hear all about why this need exists, how this works, and what value it creates. // Bio Ronen Dar Run:AI Co-founder and CTO Ronen was previously a research scientist at Bell Labs and has worked at Apple and Intel in multiple R&D roles. As CTO, R...
May 20, 2022•1 hr 4 min•Season 1Ep. 99
MLOps Coffee Sessions #98 with Brannon Dorsey, Racing the Playhead: Real-time Model Inference in a Video Streaming Environment co-hosted by Vishnu Rachakonda. // Abstract Runway ML is doing an incredibly cool workaround applying machine learning to video editing. Brannon is a software engineer there and he’s here to tell us all about machine learning in video and how Runway maintains their machine learning infrastructure. // Bio Brannon Dorsey is an early employee at Runway, where he leads the B...
May 12, 2022•58 min•Season 1Ep. 98
MLOps Coffee Sessions #97 with Jacob Tsafatinos, Real-Time Exactly-Once Event Processing with Apache Flink, Kafka, and Pinot co-hosted by Mihail Eric. // Abstract A few years ago Uber set out to create an ads platform for the Uber Eats app that relied heavily on three pillars; Speed, Reliability, and Accuracy. Some of the technical challenges they were faced with included exactly-once semantics in real-time. To accomplish this goal, they created the architecture diagram above with lots of love f...
May 05, 2022•54 min•Season 1Ep. 97
MLOps Coffee Sessions #96 with Sebastián Ramírez, FastAPI for Machine Learning co-hosted by Adam Sroka. // Abstract Fast API almost never happened. Sebastián Ramírez, the creator of FastAPI, tried as hard as possible not to build something new. After many failed attempts at finding what he was looking for he decided to scratch his own itch and build a new product. The conversation goes over what Fast API is, how Sebastián built it, what the next big problems to tackle in ML are, and...
May 02, 2022•53 min•Season 1Ep. 96
MLOps Coffee Sessions #95 with Ciro Greco, MLOps as Tool to Shape Team and Culture. // Abstract Good MLOps practices are a way to operationalize a more “vertical” practice and blur the boundaries between different stages of “production-ready”. Sometimes you have this idea that production-ready means global availability but with ML products that need to be constantly tested against real-world data, we believe production-ready should be a continuum and that the key person that drives that needs to...
Apr 25, 2022•43 min•Season 1Ep. 95
MLOps Coffee Sessions #94 with Mark Freeman, Traversing the Data Maturity Spectrum: A Startup Perspective. // Abstract A lot of companies talk about having ML and being data-driven, but few are there currently and doing it well. If anything, many companies are on the cusp of implementing ML rather than being ML mature. As a startup, what decisions are we making today to drive data maturity and set us up for success when we further implement ML in the near future. What business cases are w...
Apr 21, 2022•46 min•Season 1Ep. 94
MLOps Coffee Sessions #93 with Krishnaram Kenthapadi, Model Monitoring in Practice: Top Trends co-hosted by Mihail Eric // Abstract We first motivate the need for ML model monitoring, as part of a broader AI model governance and responsible AI framework, and provide a roadmap for thinking about model monitoring in practice. We then present findings and insights on model monitoring in practice based on interviews with various ML practitioners spanning domains such as financial services, healthcar...
Apr 14, 2022•52 min•Season 1Ep. 93
MLOps Coffee Sessions #92 with Pete Soderling, Building the World's First Data Engineering Conference. // Abstract Keep things centered around community building and what he looks for in teams. Folks that are building their community around their tool, what advice do you have for that? What's worth turning into a company? // Bio Pete Soderling is the founder of Data Council and the Data Community Fund. As a former software engineer, repeat founder, and investor in more than 40 data-oriented star...
Apr 11, 2022•42 min•Season 1Ep. 92
MLOps Coffee Sessions #91 with Joseph Haaga, The Shipyard: Lessons Learned While Building an ML Platform / Automating Adherence. // Abstract Joseph Haaga and the Interos team walk us through their design decisions in building an internal data platform. Joseph talks about why their use case wasn't a fit for off the self solutions, what their internal tool snitch does, and how they use git as a model registry. Shipyard blogpost series: https://medium.com/interos-engineering . // Bio Joseph ...
Apr 07, 2022•40 min•Season 1Ep. 91
MLOps Coffee Sessions #90 with Valerio Velardo, Bringing Audio ML Models into Production. // Abstract The majority of audio/music tech companies that employ ML still don’t use MLOps regularly. In these companies, you rarely find audio ML pipelines which take care of the whole ML lifecycle in a reliable and scalable manner. Audio ML probably pays the price of being a small sub-discipline of ML. It’s dwarfed by ML applications in image processing and NLP. In audio ML, novelties tend to travel slow...
Apr 04, 2022•51 min•Season 1Ep. 90
MLOps Coffee Sessions #89 with Gabriel Straub, A Journey in Scaling AI. // Abstract Gabriel talks to us about the difficulties of scaling ML products across an organization. He speaks about differences in profiles of data consumers and data producers, and the challenges of educating engineers so they have greater insights into the effects that their changes to the system may have. // Bio Gabriel joined Ocado Technology in 2020 as Chief Data Officer, bringing over 10 years of experience in...
Mar 31, 2022•53 min•Season 1Ep. 89
MLOps Coffee Sessions #88 with Javier Andres Mansilla, ML Platform Tradeoffs and Wondering Why to Use Them. // Abstract Javier runs ML Platform at Mercado Libre. We’re here with Javier because he’s going to tell us about what the ML platform at Mercado Libre looks like granularly, talk about its purpose, lessons, wins, and future improvements, and share with us some of the most challenging use cases they’ve had to engineer around. // Bio During the last 3 years building the internal ML platform ...
Mar 28, 2022•54 min•Season 1Ep. 88
MLOps Coffee Sessions #87 with Kyle Morris, Don't Listen Unless You Are Going to Do ML in Production. // Abstract Companies wanting to leverage ML specializes in model quality (architecture, training method, dataset), but face the same set of undifferentiated work they need to productionize the model. They must find machines to deploy their model on, set it up behind an API, make the inferences fast, cheap, reliable by optimizing hardware, load-balancing, autoscaling, clustering launches per reg...
Mar 17, 2022•52 min•Season 1Ep. 87
MLOps Coffee Sessions #86 with Julien Bisconti, Building ML/Data Platform on Top of Kubernetes. // Abstract When building a platform, a good start would be to define the goals and features of that platform, knowing it will evolve. Kubernetes is established as the de facto standard for scalable platforms but it is not a fully-fledged data platform. Do ML engineers have to learn and use Kubernetes directly? They probably shouldn't. So it is up to the data engineering team to p...
Mar 12, 2022•48 min•Season 1Ep. 86
MLOps Coffee Sessions #85 with Emmanuel Ameisen, Continuous Deployment of Critical ML Applications. // Abstract Finding an ML model that solves a business problem can feel like winning the lottery, but it can also be a curse. Once a model is embedded at the core of an application and used by real users, the real work begins. That's when you need to make sure that it works for everyone, that it keeps working every day, and that it can improve as time goes on. Just like building a model is all abo...
Mar 10, 2022•45 min•Season 1Ep. 85
MLOps Coffee Sessions #84 with Ernest Chan, Lessons from Studying FAANG ML Systems. // Abstract Large tech companies invest in ML platforms to accelerate their ML efforts. Become better prepared to solve your own MLOps problems by learning from their technology and design decisions. Tune in to learn about ML platform components, capabilities, and design considerations. // Bio Ernest is a Data Scientist at Duo Security. As part of the core team that built Duo's first ML-powered product, Duo Trust...
Mar 02, 2022•46 min•Season 1Ep. 84
MLOps Coffee Sessions #83 with Vincent Warmerdam, Better Use cases for Text Embeddings. // Abstract Text embeddings are very popular, but there are plenty of reasons to be concerned about their applications. There's algorithmic fairness, compute requirements as well as issues with datasets that they're typically trained on. In this session, Vincent gives an overview of some of these properties while also talking about an underappreciated use-case for the embeddings: labeling! // Bio Vincent D. W...
Feb 28, 2022•48 min•Season 1Ep. 83
MLOps Reading Group meeting on February 11, 2022 Reading Group Session about Feature Stores with Matt Delacour and Mike Moran --------------- ✌️Connect With Us ✌️ ------------- Join our slack community: https://go.mlops.community/slack Follow us on Twitter: @mlopscommunity Connect with us on LinkedIn: https://www.linkedin.com/company/mlopscommunity/ Sign up for the next meetup: https://go.mlops.community/register Catch all episodes, Feature Store, Machine Learning Monitoring and Bl...
Feb 23, 2022•50 min