MLOps Community Meetup #93! Two weeks ago, we talked to Chad Sanderson, Trustworthy Data for Machine Learning. //Abstract The most common challenge for ML teams operating at scale is data quality. In this talk, Chad discusses how Convoy invested in a large-scale data quality effort to treat data as an API and provide a data change management surface to enable trustworthy machine learning. // Bio Chad Sanderson is the Product Lead for Convoy's Data Platform team, which includes the data warehouse...
Feb 21, 2022•51 min•Season 1Ep. 93
MLOps Coffee Sessions #82 with Donna Schut and Christos Aniftos, Practitioners Guide to MLOps. // Abstract The "Practitioners Guide to MLOps" introduced excellent frameworks for how to think about the field. Can we talk about how you've seen the advice in that guide applied to real-world systems? Is there additional advice you'd add to that paper based on what you've seen since its publication and with new tools being introduced? Your article about selecting the right capabilities has a lot of g...
Feb 15, 2022•47 min•Season 1Ep. 82
MLOps Coffee Sessions #81 with Davis Treybig and Leigh Marie Braswell, Machine Learning from the Viewpoint of Investors. // Abstract Machine learning is a rapidly evolving space that can be hard to keep track of. Every year, thousands of research papers are published in the space, and hundreds of new companies are built both in applied machine learning as well as in machine learning tooling. In this podcast, we interview two investors who focus heavily on machine learning to get their take on th...
Feb 14, 2022•49 min•Season 1Ep. 81
MLOps Coffee Sessions #80 with Ale Solano, The Journey from Data Scientist to MLOps Engineer. // Abstract After years of failed POCs then all of a sudden one of our models is accepted and will be used in production. The next morning we are part of the main scrum stand-up meeting and a DevOps guy is assisting us. A strange feeling, unknown to us until then, starts growing on the AI team: we are useful! Deploying models to production is challenging, but MLOps is more than that. MLOps is about maki...
Feb 08, 2022•42 min•Season 1Ep. 80
MLOps Coffee Sessions #79 with Orr Shilon, Platform Thinking: A Lemonade Case Study. // Abstract This episode is the epitome of why people listen to our podcast. It’s a complete discussion of the technical, organizational, and cultural challenges of building a high-velocity, machine learning platform that impacts core business outcomes. Orr tells us about the focus on automation and platform thinking that’s uniquely allowed Lemonade’s engineers to make long-term investments t...
Feb 04, 2022•52 min•Season 1Ep. 79
MLOps Coffee Sessions #78 with Erica Greene and Seoyoon Park, Calibration for ML at Etsy - apply() special. // Abstract This is a special conversation about Machine Learning calibration at Etsy. Demetrios sat down with Erica Greene and Seoyoon Park to hear about how they implemented Calibration into the Etsy Machine Learning workflow. The conversation is a pre-chat with these two before their presentation at the apply() conference on February 10th. Register here: applyconf.com // Bio Erica Geen ...
Jan 31, 2022•50 min•Season 1Ep. 78
MLOps Coffee Sessions #77 with Scott Hirleman, Data Mesh - The Data Quality Control Mechanism for MLOps? // Abstract Scott covers what is a data mesh at a high level for those not familiar. Data mesh is potentially a great win for ML/MLOps as there is very clear guidance on creating useful, clean, well-documented/described and interoperable data for "unexpected use". So instead of data spelunking being a harrowing task, it can be a very fruitful one. And that one data set that was so awesome? We...
Jan 28, 2022•57 min•Season 1Ep. 77
MLOps Coffee Sessions #76 with Mohamed Elgendy, Build a Culture of ML Testing and Model Quality. // Abstract Machine learning engineers and data scientists spend most of their time testing and validating their models’ performance. But as machine learning products become more integral to our daily lives, the importance of rigorously testing model behavior will only increase. Current ML evaluation techniques are falling short in their attempts to describe the full picture of model performance. Eva...
Jan 25, 2022•51 min•Season 1Ep. 76
MLOps Coffee Sessions #75 with Shreya Shankar, Towards Observability for ML Pipelines. // Abstract Achieving observability in ML pipelines is a mess right now. We are tracking thousands of means, percentiles, and KL divergences of features and outputs in a haphazard attempt to figure out when and how to retrain models. In this session, we break down current unsuccessful approaches and discuss the path towards effectively maintaining ML models in production. Along the way, we introduce mltrace --...
Jan 21, 2022•57 min•Season 1Ep. 75
MLOps Coffee Sessions #74 with Jesse Johnson, Scaling Biotech. // Abstract Scaling a biotech research platform requires managing organization complexity - teams, functions, projects - rather than just the traditional volume, velocity, and variety. By examining the processes and experiments that drive the platform, you can focus your work where it matters the most by finding the ideal balance for each type of experiment along with a number of common trade-offs. // Bio Jesse Johnson is head of Dat...
Jan 19, 2022•51 min•Season 1Ep. 74
MLOps Coffee Sessions #73 with Breno Costa and Matheus Frata, On Structuring an ML Platform 1 Pizza Team. // Abstract Breno and Matheus were part of an organizational change at Neoway in recent years. With the creation of cross-functional and platform teams in order to improve the value stream generated by these. They share their experience in creating a machine learning platform team. The challenges they faced along the way, how they approached using product thinking and the results achieved so...
Jan 07, 2022•53 min•Season 1Ep. 73
MLOps Coffee Sessions #72 with Vishnu Rachakonda and Demetrios Brinkmann, 2021 MLOps Year in Review. // Abstract Vishnu and Demetrios sit down to reflect on some of the biggest news and learnings from 2021 from the biggest funding rounds to best insights. The two finish out the chat by talking about what to expect in 2022. // Bio Demetrios Brinkmann At the moment Demetrios is immersing himself in Machine Learning by interviewing experts from around the world in the weekly MLOps.community meetups...
Jan 03, 2022•52 min•Season 1Ep. 72
Loblaws is one of Canada’s largest grocery store chains, Mefta's team at Loblaw Digital runs several ML systems such as search, recommendations, inventory, and labor prediction on production. In this conversation, he shares his experience setting up their ML platform on GCP using Vertex AI and open-source tools. The goal of this platform is to help all the data science teams within their organization to take ML projects from EDA to production rapidly while ensuring end-to-end tracking of t...
Dec 28, 2021•40 min•Season 1Ep. 71
MLOps Coffee Sessions #70 with Reah Miyara, 2022 Predictions for MLOps and the Industry. // Abstract MLOps has moved fast in the last year. What will 2022 be like in the MLOps ecosystem? Raeh from Arize AI comes on to talk to us about what he expects for the new year. Arize is kindly offering 20 free subscriptions to their tool. No marketing BS these are design partners. First come first serve https://arize.com/mlops-signup/ ! // Bio Reah Miyara is a Senior Product Manager at Arize AI, a ...
Dec 23, 2021•36 min•Season 1Ep. 70
MLOps Coffee Sessions #69 with James Lamb, Building for Small Data Science Teams co-hosted by Adam Sroka. // Abstract In this conversation, James shares some hard-won lessons on how to effectively use technology to create applications powered by machine learning models. James also talks about how making the "right" architecture decisions is as much about org structure and hiring plans as it is about technological features. // Bio James Lamb is a machine learning engineer at SpotHero, a Chicago-b...
Dec 20, 2021•53 min•Season 1Ep. 69
MLOps Coffee Sessions #68 with Chris Albon, Wikimedia MLOps co-hosted by Neal Lathia. // Abstract // Bio Chris spent over a decade applying statistical learning, artificial intelligence, and software engineering to political, social, and humanitarian efforts. He is the Director of Machine Learning at the Wikimedia Foundation. Previously, Chris was the Director of Data Science at Devoted Health, Director of Data Science at the Kenyan startup BRCK, cofounded the AI startup Yonder, created the data...
Dec 13, 2021•1 hr 6 min•Season 1Ep. 68
MLOps Coffee Sessions #67 with John Crousse, ML Stepping Stones: Challenges & Opportunities for Companies co-hosted by Adam Sroka. // Abstract In this coffee session, John shares his observations after working with multiple companies which were in the process of scaling up their ML capabilities. John's observations are mostly around changes in practices, successes, failures, and bottlenecks identified when building ML products and teams from scratch. John shares a few thoughts on building lo...
Dec 09, 2021•48 min•Season 1Ep. 67
MLOps Coffee Sessions #66 with Jacopo Tagliabue, Machine Learning at Reasonable Scale. // Abstract We believe that immature data pipelines are preventing a large portion of industry practitioners from leveraging the latest research on ML: truth is, outside of Big Tech and advanced startups, ML systems are still far from producing the promised ROI. The good news is that times are changing: thanks to a growing ecosystem of tools and shared best practices, even small teams can be incredibly product...
Dec 08, 2021•1 hr 5 min•Season 1Ep. 66
MLOps Coffee Sessions #65 with Skylar Payne, The Future of Data Science Platforms is Accessibility. // Abstract The machine learning and data science space is blowing up -- new tools are popping up every day. While we seem to have every type of "Flow" and "Store" you could imagine, few people really understand how to glue this stuff together. Despite all the tools we have available, we still see companies failing to leverage data science effectively to drive business results. Instead of spending...
Nov 30, 2021•52 min•Season 1Ep. 65
MLOps Reading Group meeting on November 20, 2021 --------------- ✌️Connect With Us ✌️ ------------- Join our slack community: https://go.mlops.community/slack Follow us on Twitter: @mlopscommunity Connect with us on LinkedIn: https://www.linkedin.com/company/mlopscommunity/ Sign up for the next meetup: https://go.mlops.community/register Catch all episodes, Feature Store, Machine Learning Monitoring and Blogs: https://mlops.community/
Nov 29, 2021•56 min
MLOps Coffee Sessions #64 with Slater Victoroff, The Future of AI and ML in Process Automation. // Abstract The Unstructured Imperative Recent advances in AI have dramatically advanced the state of the art around unstructured data, especially in the spaces of NLP and computer vision. Despite this, the adoption of unstructured technologies has remained low. Why do you think that is? How have the dynamics changed in the last five years? Multimodal AI Historic AI approaches have generally be...
Nov 23, 2021•58 min•Season 1Ep. 64
Dmytro Dzhulgakov, PyTorch: Bridging AI Research and Production. Talking PyTorch is always interesting, as the Facebook ML OSS project is one of the most important parts of the machine learning tooling ecosystem. This week, we talked to Dmytro Dzhulgakov, a tech lead for PyTorch. We started off talking about Dmytro's journey to being an engineer and tech lead at Facebook, and what his role entails. Dmytro has been at Facebook for 10+ years, so he gave some very interesting advice on how t...
Nov 16, 2021•53 min•Season 1Ep. 63
MLOps Coffee Sessions #62 with Joel Grus, MLOps from Scratch. // Abstract In this talk, Joel Grus of “I don’t like notebooks” fame shares with us his 2021 perspective on notebooks, where he thinks MLOps is now, and what his hot takes in the data space are now. // Bio Joel Grus is a Principal Engineer at Capital Group, where he leads a team that builds search, data, and machine learning products for the investment group. He is the author of the bestselling O'Reilly book *Data Science from Scratch...
Nov 09, 2021•56 min•Season 1Ep. 62
MLOps Coffee Sessions #60 with Svet Penkov, ML Tests. // Abstract How confident do you feel when you deploy a new model? Does improving an ML model feel like a game of "whack-a-mole"? ML is taking over all sorts of industries and yet ML testing tools are virtually non-existent. Drawing parallels from software engineering and electronic circuit board design to the aviation and semiconductor industries, the need for principled quality assurance (QA) step in the MLOps pipeline is long overdue. Let'...
Nov 02, 2021•41 min•Season 1Ep. 61
Coffee Sessions #60 with Alexandre Patry, Path to Productivity in Job Search and Job Recommendation AI at LinkedIn. // Abstract A year ago, LinkedIn job search and recommendation AI teams were at the end of a growth cycle. They were fighting many fires at once: a high number of user complaints, engineers spending a significant amount of their time keeping our machine learning pipelines running, online infrastructure that wasn't supporting their growth, and challenges ramping new models to experi...
Oct 25, 2021•52 min•Season 1Ep. 60
Coffee Sessions #59 with Cody Coleman, Data Quality Over Quantity or Data Selection for Data-Centric AI. // Abstract Big data has been critical to many of the successes in ML, but it brings its own problems. Working with massive datasets is cumbersome and expensive, especially with unstructured data like images, videos, and speech. Careful data selection can mitigate the pains of big data by focusing computational and labeling resources on the most valuable examples. Cody Coleman, a ...
Oct 11, 2021•1 hr 11 min•Season 1Ep. 59
Coffee Sessions #58 with Anne Cocos, 10 Types of Features your Location ML Model is Missing. // Abstract Machine learning on geographic data is relatively under-studied in comparison to ML on other formats like images or graphs. But geographic data is prevalent across a wide variety of domains (although many practitioners may not think of it that way). Clearly, any dataset with `latitude` and `longitude` columns can be viewed as geographic data, but also any dataset with a `zipcode`, `city`, `ad...
Oct 07, 2021•56 min•Season 1Ep. 58
Coffee Sessions #57 with Michael Del Balso and Erik Bernhardsson, The Future of ML and Data Platforms. // Abstract Machine learning, data analytics, and software engineering are converging as data-intensive systems become more ubiquitous. Erik Bernhardsson, ex-CTO at Better and former Spotify machine learning lead, and Mike Del Balso, CEO at Tecton and former Uber machine learning lead and co-creator of Michelangelo sit down to chat with us today. These two jammed with us abou...
Oct 01, 2021•55 min•Season 1Ep. 57
Soumanta wouldn't claim they've reached where they want to and they're still learning, so he's happy sharing successes as well as failures at Yugen.ai. // Abstract Determining Minimum Achievable Goals helps Yugen.ai ensure a significant amount of focus on value-added and impact before diving deep into solutions & building ML Systems. In this episode, Soumanta discusses Balancing ML Development vs Ops and Monitoring efforts while scaling plus their focus on improvements in small sprints. Soum...
Sep 27, 2021•52 min•Season 1Ep. 56
Coffee Sessions #55 with Salwa Muhammad, Learning and Teaching MLOps Applications. //Abstract Salwa shared her perspective on how FourthBrain and all learners can keep their education strategy fresh enough for the current zeitgeist. Furthermore, Salwa, Demetrios, and Vishnu talked about principles of effective learning that are important to keep in mind while embarking on any educational journey. This was a great conversation with a lot of practical tips that we hope you all listen...
Sep 21, 2021•48 min•Season 1Ep. 55