MLOps.community - podcast cover

MLOps.community

Demetrios mlops.community
Relaxed Conversations around getting AI into production, whatever shape that may come in (agentic, traditional ML, LLMs, Vibes, etc)

Episodes

Data Mesh - The Data Quality Control Mechanism for MLOps? // Scott Hirleman // MLOps Coffee Sessions #77

MLOps Coffee Sessions #77 with Scott Hirleman, Data Mesh - The Data Quality Control Mechanism for MLOps? // Abstract Scott covers what is a data mesh at a high level for those not familiar. Data mesh is potentially a great win for ML/MLOps as there is very clear guidance on creating useful, clean, well-documented/described and interoperable data for "unexpected use". So instead of data spelunking being a harrowing task, it can be a very fruitful one. And that one data set that was so awesome? We...

Jan 28, 202257 minSeason 1Ep. 77

Build a Culture of ML Testing and Model Quality // Mohamed Elgendy // MLOps Coffee Sessions #76

MLOps Coffee Sessions #76 with Mohamed Elgendy, Build a Culture of ML Testing and Model Quality. // Abstract Machine learning engineers and data scientists spend most of their time testing and validating their models’ performance. But as machine learning products become more integral to our daily lives, the importance of rigorously testing model behavior will only increase. Current ML evaluation techniques are falling short in their attempts to describe the full picture of model performance. Eva...

Jan 25, 202251 minSeason 1Ep. 76

Towards Observability for ML Pipelines // Shreya Shankar // MLOps Coffee Sessions #75

MLOps Coffee Sessions #75 with Shreya Shankar, Towards Observability for ML Pipelines. // Abstract Achieving observability in ML pipelines is a mess right now. We are tracking thousands of means, percentiles, and KL divergences of features and outputs in a haphazard attempt to figure out when and how to retrain models. In this session, we break down current unsuccessful approaches and discuss the path towards effectively maintaining ML models in production. Along the way, we introduce mltrace --...

Jan 21, 202257 minSeason 1Ep. 75

Scaling Biotech // Jesse Johnson // MLOps Coffee Sessions #74

MLOps Coffee Sessions #74 with Jesse Johnson, Scaling Biotech. // Abstract Scaling a biotech research platform requires managing organization complexity - teams, functions, projects - rather than just the traditional volume, velocity, and variety. By examining the processes and experiments that drive the platform, you can focus your work where it matters the most by finding the ideal balance for each type of experiment along with a number of common trade-offs. // Bio Jesse Johnson is head of Dat...

Jan 19, 202251 minSeason 1Ep. 74

On Structuring an ML Platform 1 Pizza Team //Breno Costa & Matheus Frata //MLOps Coffee Sessions #73

MLOps Coffee Sessions #73 with Breno Costa and Matheus Frata, On Structuring an ML Platform 1 Pizza Team. // Abstract Breno and Matheus were part of an organizational change at Neoway in recent years. With the creation of cross-functional and platform teams in order to improve the value stream generated by these. They share their experience in creating a machine learning platform team. The challenges they faced along the way, how they approached using product thinking and the results achieved so...

Jan 07, 202253 minSeason 1Ep. 73

2021 MLOps Year in Review // Vishnu Rachakonda and Demetrios Brinkmann // MLOps Coffee Sessions #72

MLOps Coffee Sessions #72 with Vishnu Rachakonda and Demetrios Brinkmann, 2021 MLOps Year in Review. // Abstract Vishnu and Demetrios sit down to reflect on some of the biggest news and learnings from 2021 from the biggest funding rounds to best insights. The two finish out the chat by talking about what to expect in 2022. // Bio Demetrios Brinkmann At the moment Demetrios is immersing himself in Machine Learning by interviewing experts from around the world in the weekly MLOps.community meetups...

Jan 03, 202252 minSeason 1Ep. 72

Setting up an ML Platform on GCP: Lessons Learned // Mefta Sadat // MLOps Coffee Sessions #71

Loblaws is one of Canada’s largest grocery store chains, Mefta's team at Loblaw Digital runs several ML systems such as search, recommendations, inventory, and labor prediction on production. In this conversation, he shares his experience setting up their ML platform on GCP using Vertex AI and open-source tools.  The goal of this platform is to help all the data science teams within their organization to take ML projects from EDA to production rapidly while ensuring end-to-end tracking of t...

Dec 28, 202140 minSeason 1Ep. 71

2022 Predictions for MLOps and the Industry // Reah Miyara // MLOps Coffee Sessions #70

MLOps Coffee Sessions #70 with Reah Miyara, 2022 Predictions for MLOps and the Industry. // Abstract MLOps has moved fast in the last year. What will 2022 be like in the MLOps ecosystem? Raeh from Arize AI comes on to talk to us about what he expects for the new year.   Arize is kindly offering 20 free subscriptions to their tool. No marketing BS these are design partners. First come first serve https://arize.com/mlops-signup/ ! // Bio Reah Miyara is a Senior Product Manager at Arize AI, a ...

Dec 23, 202136 minSeason 1Ep. 70

Building for Small Data Science Teams // James Lamb // MLOps Coffee Sessions #69

MLOps Coffee Sessions #69 with James Lamb, Building for Small Data Science Teams co-hosted by Adam Sroka. // Abstract In this conversation, James shares some hard-won lessons on how to effectively use technology to create applications powered by machine learning models. James also talks about how making the "right" architecture decisions is as much about org structure and hiring plans as it is about technological features. // Bio James Lamb is a machine learning engineer at SpotHero, a Chicago-b...

Dec 20, 202153 minSeason 1Ep. 69

Wikimedia MLOps // Chris Albon // Coffee Sessions #68

MLOps Coffee Sessions #68 with Chris Albon, Wikimedia MLOps co-hosted by Neal Lathia. // Abstract // Bio Chris spent over a decade applying statistical learning, artificial intelligence, and software engineering to political, social, and humanitarian efforts. He is the Director of Machine Learning at the Wikimedia Foundation. Previously, Chris was the Director of Data Science at Devoted Health, Director of Data Science at the Kenyan startup BRCK, cofounded the AI startup Yonder, created the data...

Dec 13, 20211 hr 6 minSeason 1Ep. 68

ML Stepping Stones: Challenges & Opportunities for Companies // John Crousse // Coffee Sessions #67

MLOps Coffee Sessions #67 with John Crousse, ML Stepping Stones: Challenges & Opportunities for Companies co-hosted by Adam Sroka. // Abstract In this coffee session, John shares his observations after working with multiple companies which were in the process of scaling up their ML capabilities. John's observations are mostly around changes in practices, successes, failures, and bottlenecks identified when building ML products and teams from scratch. John shares a few thoughts on building lo...

Dec 09, 202148 minSeason 1Ep. 67

Machine Learning at Reasonable Scale // Jacopo Tagliabue // MLOps Coffee Sessions #66

MLOps Coffee Sessions #66 with Jacopo Tagliabue, Machine Learning at Reasonable Scale. // Abstract We believe that immature data pipelines are preventing a large portion of industry practitioners from leveraging the latest research on ML: truth is, outside of Big Tech and advanced startups, ML systems are still far from producing the promised ROI. The good news is that times are changing: thanks to a growing ecosystem of tools and shared best practices, even small teams can be incredibly product...

Dec 08, 20211 hr 5 minSeason 1Ep. 66

The Future of Data Science Platforms is Accessibility // Skylar Payne // Coffee Session #65

MLOps Coffee Sessions #65 with Skylar Payne, The Future of Data Science Platforms is Accessibility. // Abstract The machine learning and data science space is blowing up -- new tools are popping up every day. While we seem to have every type of "Flow" and "Store" you could imagine, few people really understand how to glue this stuff together. Despite all the tools we have available, we still see companies failing to leverage data science effectively to drive business results. Instead of spending...

Nov 30, 202152 minSeason 1Ep. 65

Impact of SWE in ML Projects // Laszlo Sragner and Tim Blazina // MLOps Reading Group

MLOps Reading Group meeting on November 20, 2021   --------------- ✌️Connect With Us ✌️ ------------- Join our slack community: https://go.mlops.community/slack Follow us on Twitter: @mlopscommunity Connect with us on LinkedIn: https://www.linkedin.com/company/mlopscommunity/ Sign up for the next meetup: https://go.mlops.community/register Catch all episodes, Feature Store, Machine Learning Monitoring and Blogs: https://mlops.community/

Nov 29, 202156 min

The Future of AI and ML in Process Automation // Slater Victoroff // MLOps Coffee Sessions #64

MLOps Coffee Sessions #64 with Slater Victoroff, The Future of AI and ML in Process Automation. // Abstract The Unstructured Imperative Recent advances in AI have dramatically advanced the state of the art around unstructured data, especially in the spaces of NLP and computer vision. Despite this, the adoption of unstructured technologies has remained low. Why do you think that is? How have the dynamics changed in the last five years? Multimodal AI   Historic AI approaches have generally be...

Nov 23, 202158 minSeason 1Ep. 64

PyTorch: Bridging AI Research and Production // Dmytro Dzhulgakov // Coffee Sessions #63

Dmytro Dzhulgakov, PyTorch: Bridging AI Research and Production.   Talking PyTorch is always interesting, as the Facebook ML OSS project is one of the most important parts of the machine learning tooling ecosystem. This week, we talked to Dmytro Dzhulgakov, a tech lead for PyTorch. We started off talking about Dmytro's journey to being an engineer and tech lead at Facebook, and what his role entails. Dmytro has been at Facebook for 10+ years, so he gave some very interesting advice on how t...

Nov 16, 202153 minSeason 1Ep. 63

I Don't Like Jupyter Notebooks // Joel Grus // Coffee Sessions #62

MLOps Coffee Sessions #62 with Joel Grus, MLOps from Scratch. // Abstract In this talk, Joel Grus of “I don’t like notebooks” fame shares with us his 2021 perspective on notebooks, where he thinks MLOps is now, and what his hot takes in the data space are now. // Bio Joel Grus is a Principal Engineer at Capital Group, where he leads a team that builds search, data, and machine learning products for the investment group. He is the author of the bestselling O'Reilly book *Data Science from Scratch...

Nov 09, 202156 minSeason 1Ep. 62

ML Tests // Svet Penkov // Coffee Sessions #61

MLOps Coffee Sessions #60 with Svet Penkov, ML Tests. // Abstract How confident do you feel when you deploy a new model? Does improving an ML model feel like a game of "whack-a-mole"? ML is taking over all sorts of industries and yet ML testing tools are virtually non-existent. Drawing parallels from software engineering and electronic circuit board design to the aviation and semiconductor industries, the need for principled quality assurance (QA) step in the MLOps pipeline is long overdue. Let'...

Nov 02, 202141 minSeason 1Ep. 61

Linkedin Job Recommendations // Alexandre Patry // Coffee Sessions #60

Coffee Sessions #60 with Alexandre Patry, Path to Productivity in Job Search and Job Recommendation AI at LinkedIn. // Abstract A year ago, LinkedIn job search and recommendation AI teams were at the end of a growth cycle. They were fighting many fires at once: a high number of user complaints, engineers spending a significant amount of their time keeping our machine learning pipelines running, online infrastructure that wasn't supporting their growth, and challenges ramping new models to experi...

Oct 25, 202152 minSeason 1Ep. 60

Data Selection for Data-Centric AI: Data Quality Over Quantity // Cody Coleman // Coffee Sessions #59

Coffee Sessions #59 with Cody Coleman, Data Quality Over Quantity or Data Selection for Data-Centric AI. // Abstract Big data has been critical to many of the successes in ML, but it brings its own problems. Working with massive datasets is cumbersome and expensive, especially with unstructured data like images, videos, and speech. Careful data selection can mitigate the pains of big data by focusing computational and labeling resources on the most valuable examples.   Cody Coleman, a ...

Oct 11, 20211 hr 11 minSeason 1Ep. 59

10 Types of Features your Location ML Model is Missing // Anne Cocos // Coffee Sessions #58

Coffee Sessions #58 with Anne Cocos, 10 Types of Features your Location ML Model is Missing. // Abstract Machine learning on geographic data is relatively under-studied in comparison to ML on other formats like images or graphs. But geographic data is prevalent across a wide variety of domains (although many practitioners may not think of it that way). Clearly, any dataset with `latitude` and `longitude` columns can be viewed as geographic data, but also any dataset with a `zipcode`, `city`, `ad...

Oct 07, 202156 minSeason 1Ep. 58

The Future of ML and Data Platforms // Michael Del Balso - Erik Bernhardsson // Coffee Sessions #57

Coffee Sessions #57 with Michael Del Balso and Erik Bernhardsson, The Future of ML and Data Platforms. // Abstract Machine learning, data analytics, and software engineering are converging as data-intensive systems become more ubiquitous.  Erik Bernhardsson, ex-CTO at Better and former Spotify machine learning lead, and Mike Del Balso, CEO at Tecton and former Uber machine learning lead and co-creator of Michelangelo sit down to chat with us today.    These two jammed with us abou...

Oct 01, 202155 minSeason 1Ep. 57

A Few Learnings from Building a Bootstrapped MLOps Services Startup //Soumanta Das// Coffee Sessions #56

Soumanta wouldn't claim they've reached where they want to and they're still learning, so he's happy sharing successes as well as failures at Yugen.ai. // Abstract Determining Minimum Achievable Goals helps Yugen.ai ensure a significant amount of focus on value-added and impact before diving deep into solutions & building ML Systems. In this episode, Soumanta discusses Balancing ML Development vs Ops and Monitoring efforts while scaling plus their focus on improvements in small sprints. Soum...

Sep 27, 202152 minSeason 1Ep. 56

Learning and Teaching MLOps Applications // Salwa Muhammad // MLOps Coffee Sessions #55

Coffee Sessions #55 with Salwa Muhammad, Learning and Teaching MLOps Applications.   //Abstract Salwa shared her perspective on how FourthBrain and all learners can keep their education strategy fresh enough for the current zeitgeist. Furthermore, Salwa, Demetrios, and Vishnu talked about principles of effective learning that are important to keep in mind while embarking on any educational journey.   This was a great conversation with a lot of practical tips that we hope you all listen...

Sep 21, 202148 minSeason 1Ep. 55

Machine Learning SRE // Niall Murphy // MLOps Coffee Sessions #54

Coffee Sessions #54 with Niall Murphy, Machine Learning SRE. //Abstract SRE is making its way into the machine learning world. Software engineering for machine learning requires reliability, performance, and maintainability. Site reliability engineering is the field that deals with reliability and ensuring constant, real-time performance. Niall Murphy, most recently Global Head of SRE at Microsoft Azure, helps us understand what SRE can do for modern ML products and teams. Building machine learn...

Sep 10, 202149 minSeason 1Ep. 54

MLOps Insights // David Aponte-Demetrios Brinkmann-Vishnu Rachakonda // MLOps Coffee Sessions #53

Coffee Sessions #53 with David Aponte, Demetrios Brinkmann, and Vishnu Rachakonda, MLOps Insights. //Abstract MLOps Insights from MLOps community core organizers Demetrios Brinkmann, Vishnu Rachakonda, and David Aponte. In this conversation the guys do a deep dive on testing with respect to MLOps, talk about what they have learned recently around the ML field, and what new things are happening with the MLOps community. //Bio David Aponte David is one of the organizers of the MLOps Community. He ...

Sep 07, 202138 minSeason 1Ep. 53

Vector Similarity Search at Scale // Dave Bergstein // MLOps Coffee Sessions #52

Coffee Sessions #52 with Dave Bergstein, Vector Similarity Search at Scale. //Abstract Ever wonder how Facebook and Spotify now seem to know you better than your friends? Or why the search feature in some products really “gets” you while in other products it feels stuck in the '90s? The difference is vector search— a method of indexing and searching through large volumes of vector embeddings to find more relevant search results and recommendations. Dave Bergstein, the Director of Product at Pine...

Aug 31, 202150 minSeason 1Ep. 52

ML Security: Why should you care? // Sahbi Chaieb // MLOps Coffee Sessions #51

Coffee Sessions #51 with Sahbi Chaieb, ML security: Why should you care? //Abstract Sahbi, a senior data scientist at SAS, joined us to discuss the various security challenges in MLOps. We went deep into the research he found describing various threats as part of a recent paper he wrote. We also discussed tooling options for this problem that is emerging from companies like Microsoft and Google. // Bio Sahbi Chaieb is a Senior Data Scientist at SAS, he has been working on designing, implementing...

Aug 17, 202153 minSeason 1Ep. 51

Creating MLOps Standards // Alex Chung and Srivathsan Canchi // MLOps Coffee Sessions #50

Coffee Sessions #50 with Alex Chung and Srivathsan Canchi, Creating MLOps Standards. // Abstract With the explosion in tools and opinionated frameworks for machine learning, it's very hard to define standards and best practices for MLOps and ML platforms. Based on their building AWS SageMaker and Intuit's ML Platform respectively, Alex Chung and Srivathsan Canchi talk with Demetrios and Vishnu about their experience navigating "tooling sprawl". They discuss their efforts to solve this problem or...

Aug 12, 202148 minSeason 1Ep. 50

Aggressively Helpful Platform Teams // Stefan Krawczyk // MLOps Coffee Sessions #49

Coffee Sessions #49 with Stefan Krawczyk, Aggressively Helpful Platform Teams. //Abstract At Stitch Fix there are 130+ “Full Stack Data Scientists” who in addition to doing data science work, are also expected to engineer and own data pipelines for their production models. One data science team, the Forecasting, Estimation, and Demand team were in a bind. Their data generation process was causing them iteration & operational frustrations in delivering time-series forecasts for the business. ...

Aug 10, 202152 minSeason 1Ep. 49