MLOps.community

Relaxed Conversations around getting AI into production, whatever shape that may come in (agentic, traditional ML, LLMs, Vibes, etc)

Follow on

Episodes

Building Trust Through Technology: Responsible AI in Practice // Allegra Guinan // #298

Building Trust Through Technology: Responsible AI in Practice // MLOps Podcast #298 with Allegra Guinan, Co-founder of Lumiera. Join the Community: https://go.mlops.community/YTJoinIn Get the newsletter: https://go.mlops.community/YTNewsletter // AbstractAllegra joins the podcast to discuss how Responsible AI (RAI) extends beyond traditional pillars like transparency and privacy. While these foundational elements are crucial, true RAI success requires deeply embedding responsible practices into ...

Mar 25, 2025•47 min

Claude Plays Pokémon - A Conversation with the Creator // David Hershey // #294

I Let An AI Play Pokémon! - Claude plays Pokémon Creator // MLOps Podcast #295 with David Hershey, Member of Technical Staff at Anthropic. Join the Community: https://go.mlops.community/YTJoinIn Get the newsletter: https://go.mlops.community/YTNewsletter // AbstractDemetrios chats with David Hershey from Anthropic's Applied AI team about his agent-powered Pokémon project using Claude. They explore agent frameworks, prompt optimization vs. fine-tuning, and AI's growing role in software, legal, an...

Mar 21, 2025•47 min

From Rules to Reasoning Engines // George Mathew // #296

From Rules to Reasoning Engines // MLOps Podcast #297 with George Mathew, Managing Director at Insight Partners. Join the Community: https://go.mlops.community/YTJoinIn Get the newsletter: https://go.mlops.community/YTNewsletter // AbstractGeorge Mathew (Insight Partners) joins Demetrios to break down how AI and ML have evolved over the past few years and where they’re headed. He reflects on the major shifts since his last chat with Demetrios, especially how models like ChatGPT have changed the ...

Mar 18, 2025•1 hr 5 min

GenAI Traffic: Why API Infrastructure Must Evolve... Again // Erica Hughberg // #296

GenAI Traffic: Why API Infrastructure Must Evolve... Again // MLOps Podcast #295 with Erica Hughberg, Community Advocate at Tetrate.Join the Community: https://go.mlops.community/YTJoinIn Get the newsletter: https://go.mlops.community/YTNewsletter

Mar 14, 2025•1 hr 6 min

The Unbearable Lightness of Data // Rohit Krishnan // #295

The Unbearable Lightness of Data // MLOps Podcast #295 with Rohit Krishnan, Chief Product Officer at bodo.ai.Join the Community: https://go.mlops.community/YTJoinIn Get the newsletter: https://go.mlops.community/YTNewsletter // AbstractRohit Krishnan, Chief Product Officer at Bodo.AI, joins Demetrios to discuss AI's evolving landscape. They explore interactive reasoning models, AI's impact on jobs, scalability challenges, and the path to AGI. Rohit also shares insights on Bodo.AI’s open-source m...

Mar 11, 2025•54 min

Kubernetes, AI Gateways, and the Future of MLOps // Alexa Griffith // #294

Kubernetes, AI Gateways, and the Future of MLOps // MLOps Podcast #294 with Alexa Griffith, Senior Software Engineer at Bloomberg. Join the Community: https://go.mlops.community/YTJoinIn Get the newsletter: https://go.mlops.community/YTNewsletter // Abstract Alexa shares her journey into software engineering, from early struggles with Airflow and Kubernetes to leading open-source projects like the Envoy AI Gateway. She and Demetrios discuss AI model deployment, tooling differences across tech ro...

Mar 07, 2025•52 min

Future of Software, Agents in the Enterprise, and Inception Stage Company Building // Eliot Durbin // #293

Future of Software, Agents in the Enterprise, and Inception Stage Company Building // MLOps Podcast 293 with Eliot Durbin, General Partner at Boldstart Ventures.Join the Community: https://go.mlops.community/YTJoinIn Get the newsletter: https://go.mlops.community/YTNewsletter // AbstractKey lessons for founders that are thinking about or starting their companies. 15 years of inception stage investing from how data science companies like Yhat went to market in 2013-14 and how that's evolved, to b...

Mar 04, 2025•54 min

The Agent Exchange: Practitioner Insights

Agents in Production [Podcast Limited Series] - Episode Five, Dmitri Jarnikov, Chiara Caratelli, and Steven Vester join Demetrios to explore AI agents in e-commerce. They discuss the trade-offs between generic and specialized agents, with Dmitri noting the need for a balance between scalability and precision. Chiara highlights how agents can dynamically blend both approaches, while Steven predicts specialized agents will dominate initially before trust in generic agents grows. The panel also exa...

Mar 03, 2025•48 min

Talk to Your Data: The SQL Data Analyst

In Agents in Production [Podcast Limited Series] - Episode Four , Donné Stevenson and Paul van der Boor break down the deployment of a Token Data Analyst agent at Prosus—why, how, and what worked. They discuss the challenges of productionizing the agent, from architecture to mitigating LLM overconfidence, key design choices, the role of pre-checks for clarity, and why they opted for simpler text-based processes over complex recursive methods. Guest speakers: Paul van der Boor - VP AI at Prosus G...

Feb 28, 2025•54 min

Getting to Grips with Web Agents

Agents in Production [Podcast Limited Series] - Episode Three explores the concept of web agents—AI-powered systems that interact with the web as humans do, navigating browsers instead of relying solely on APIs. The discussion covers why web agents emerge as a natural step in AI evolution, their advantages over API-based systems, and their potential impact on e-commerce and automation. The conversation also highlights challenges in making websites agent-friendly and envisions a future where agen...

Feb 26, 2025•46 min

The Challenge with Voice Agents

In Agents in Production Series - Episode Two , Demetrios, Paul, and Floris explore the latest in Voice AI agents. They discuss real-time voice interactions, OpenAI's real-time Voice API, and real-world deployment challenges. Paul shares insights from iFood’s voice AI tests in Brazil, while Floris highlights technical hurdles like turn detection and language processing. The episode covers broader applications in healthcare and customer service, emphasizing continuous learning and open-source inno...

Feb 22, 2025•48 min

The Agent Landscape - Lessons Learned Putting Agents Into Production

In Agents in Production Series - Episode One , Demetrios chats with Paul van der Boor and Floris Fok about the real-world challenges of deploying AI agents across @ProsusGroup of companies. They break down the evolution from simple LLMs to fully interactive systems, tackling scale, UX, and the harsh lessons from failed projects. Packed with insights on what works (and what doesn’t), this episode is a must-listen for anyone serious about AI in production. Guest speakers: Paul van der Boor - VP AI...

Feb 20, 2025•1 hr 9 min

Evolving Workflow Orchestration // Alex Milowski // #291

Alex Milowski is a researcher, developer, entrepreneur , mathematician, and computer scientist .Evolving Workflow Orchestration // MLOps Podcast #291 with Alex Milowski, Entrepreneur and Computer Scientist.// AbstractThere seems to be a shift from workflow languages to code - mostly annotation pythons - happening and getting us. It is a symptom of how complex workflow orchestration has gotten. Is it a dominant trend or will we cycle back to “DAG specifications”? At Stitchfix, we had our own DSL ...

Feb 14, 2025•1 hr 15 min

Insights from Cleric: Building an Autonomous AI SRE // Willem Pienaar // #290

Willem Pienaar is the Co-Founder and CTO of Cleric . He previously worked at Tecton as a Principal Engineer. Willem Pienaar attended the Georgia Institute of Technology. Insights from Cleric: Building an Autonomous AI SRE // MLOps Podcast #289 with Willem Pienaar, CTO & Co-Founder of Cleric.// AbstractIn this MLOps Community Podcast episode, Willem Pienaar, CTO of Cleric, breaks down how they built an autonomous AI SRE that helps engineering teams diagnose production issues. We explore how C...

Feb 11, 2025•56 min

Robustness, Detectability, and Data Privacy in AI // Vinu Sankar Sadasivan // #289

Vinu Sankar Sadasivan is a CS PhD ... Currently, I am working as a full-time Student Researcher at Google DeepMind on jailbreaking multimodal AI models. Robustness, Detectability, and Data Privacy in AI // MLOps Podcast #289 with Vinu Sankar Sadasivan, Student Researcher at Google DeepMind. // Abstract Recent rapid advancements in Artificial Intelligence (AI) have made it widely applicable across various domains, from autonomous systems to multimodal content generation. However, these models rem...

Feb 07, 2025•53 min

AI & Aliens: New Eyes on Ancient Questions // Richard Cloete // #288

Richard Cloete is a computer scientist and a Laukien-Oumuamua Postdoctoral Research Fellow at the Center for Astrophysics, Harvard University. He is a member of the Galileo Project working under the supervision of Professor Avi, having recently held a postdoctoral position at the University of Cambridge, UK. AI & Aliens: New Eyes on Ancient Questions // MLOps Podcast #288 with Richard Cloete, Laukien-Oumuamua Postdoctoral Research Fellow at Harvard University. // Abstract Demetrios speaks wi...

Feb 04, 2025•48 min

Real LLM Success Stories: How They Actually Work // Alex Strick van Linschoten // #287

A software engineer based in Delft, Alex Strick van Linschoten recently built Ekko, an open-source framework for adding real-time infrastructure and in-transit message processing to web applications. With years of experience in Ruby, JavaScript, Go, PostgreSQL, AWS, and Docker, I bring a versatile skill set to the table. I hold a PhD in History, have authored books on Afghanistan, and currently work as an ML Engineer at ZenML . Real LLM Success Stories: How They Actually Work // MLOps Podcast #2...

Jan 31, 2025•50 min

Navigating Machine Learning Careers: Insights from Meta to Consulting // Ilya Reznik // #286

In his 13 years of software engineering, Ilya Reznik has specialized in commercializing machine learning solutions and building robust ML platforms. He's held technical lead and staff engineering roles at premier firms like Adobe, Twitter, and Meta. Currently, Ilya channels his expertise into his travel startup, Jaunt, while consulting and advising emerging startups. Navigating Machine Learning Careers: Insights from Meta to Consulting // MLOps Podcast #286 with Ilya Reznik, ML Engineering Thoug...

Jan 27, 2025•1 hr 1 min

Collective Memory for AI on Decentralized Knowledge Graph // Tomaž Levak // #285

Tomaž Levak is the Co-founder and CEO of Trace Labs – OriginTrail core developers. OriginTrail is a web3 infrastructure project combining a decentralized knowledge graph (DKG) and blockchain technologies to create a neutral, inclusive ecosystem. Collective Memory for AI on Decentralized Knowledge Graph // MLOps Podcast #285 with Tomaz Levak, Founder of Trace Labs, Core Developers of OriginTrail. // Abstract The talk focuses on how OriginTrail Decentralized Knowledge Graph serves as a collective ...

Jan 24, 2025•53 min

Efficient Deployment of Models at the Edge // Krishna Sridhar // #284

Krishna Sridhar is an experienced engineering leader passionate about building wonderful products powered by machine learning. Efficient Deployment of Models at the Edge // MLOps Podcast #284 with Krishna Sridhar, Vice President of Qualcomm. Big shout out to Qualcomm for sponsoring this episode! // Abstract Qualcomm® AI Hub helps to optimize, validate, and deploy machine learning models on-device for vision, audio, and speech use cases. With Qualcomm® AI Hub, you can: Convert trained models from...

Jan 17, 2025•52 min

Real World AI Agent Stories // Zach Wallace // #283

Machine Learning, AI Agents, and Autonomy // MLOps Podcast #283 with Zach Wallace, Staff Software Engineer at Nearpod Inc. // Abstract Demetrios chats with Zach Wallace, engineering manager at Nearpod, about integrating AI agents in e-commerce and edtech. They discuss using agents for personalized user targeting, adapting AI models with real-time data, and ensuring efficiency through clear task definitions. Zach shares how Nearpod streamlined data integration with tools like Redshift and DBT, en...

Jan 15, 2025•47 min

Machine Learning, AI Agents, and Autonomy // Egor Kraev // #282

Since three years, Egor is bringing the power of AI to bear at Wise , across domains as varied as trading algorithms for Treasury, fraud detection, experiment analysis and causal inference, and recently the numerous applications unlocked by large language models. Open-source projects initiated and guided by Egor include wise-pizza, causaltune, and neural-lifetimes, with more on the way. Machine Learning, AI Agents, and Autonomy // MLOps Podcast #282 with Egor Kraev, Head of AI at Wise Plc. // Ab...

Jan 08, 2025•1 hr 5 min

Re-Platforming Your Tech Stack // Michelle Marie Conway & Andrew Baker // #281

Re-Platforming Your Tech Stack // MLOps Podcast #281 with Michelle Marie Conway, Lead Data Scientist at Lloyds Banking Group and Andrew Baker, Data Science Delivery Lead at Lloyds Banking Group. // Abstract Lloyds Banking Group is on a mission to embrace the power of cloud and unlock the opportunities that it provides. Andrew, Michelle, and their MLOps team have been on a journey over the last 12 months to take their portfolio of circa 10 Machine Learning models in production and migrate them fr...

Jan 03, 2025•51 min

Holistic Evaluation of Generative AI Systems // Jineet Doshi // #280

Jineet Doshi is an award-winning Scientist, Machine Learning Engineer, and Leader at Intuit with over 7 years of experience. He has a proven track record of leading successful AI projects and building machine-learning models from design to production across various domains which have impacted 100 million customers and significantly improved business metrics, leading to millions of dollars of impact. Holistic Evaluation of Generative AI Systems // MLOps Podcast #280 with Jineet Doshi, Staff AI Sc...

Dec 23, 2024•58 min

Unleashing Unconstrained News Knowledge Graphs to Combat Misinformation // Robert Caulk // #279

Robert Caulk is responsible for directing software development, enabling research, coordinating company projects, quality control, proposing external collaborations, and securing funding. He believes firmly in open-source, having spent 12 years accruing over 1000 academic citations building open-source software in domains such as machine learning, image analysis, and coupled physical processes. He received his Ph.D. from Université Grenoble Alpes, France, in computational mechanics. Unleashing U...

Dec 20, 2024•1 hr 15 min

LLM Distillation and Compression // Guanhua Wang // #278

Guanhua Wang is a Senior Researcher in DeepSpeed Team at Microsoft . Before Microsoft , Guanhua earned his Computer Science PhD from UC Berkeley. Domino: Communication-Free LLM Training Engine // MLOps Podcast #278 with Guanhua "Alex" Wang, Senior Researcher at Microsoft. // Abstract Given the popularity of generative AI, Large Language Models (LLMs) often consume hundreds or thousands of GPUs to parallelize and accelerate the training process. Communication overhead becomes more pronounced when...

Dec 17, 2024•50 min

AI's Next Frontier // Aditya Naganath // #277

Thanks to the High Signal Podcast by Delphina: https://go.mlops.community/HighSignalPodcast Aditya Naganath is an experienced investor currently working with Kleiner Perkins . He has a passion for connecting with people over coffee and discussing various topics related to tech, products, ideas, and markets. AI's Next Frontier // MLOps Podcast #277 with Aditya Naganath, Principal at Kleiner Perkins. // Abstract LLMs have ushered in an unmistakable supercycle in the world of technology. The low-ha...

Dec 11, 2024•58 min

PyTorch for Control Systems and Decision Making // Vincent Moens // #276

Dr Vincent Moens is an Applied Machine Learning Research Scientist at Meta and an author of TorchRL and TensorDict in Pytorch. PyTorch for Control Systems and Decision Making // MLOps Podcast #276 with Vincent Moens, Research Engineer at Meta. // Abstract PyTorch is widely adopted across the machine learning community for its flexibility and ease of use in applications such as computer vision and natural language processing. However, supporting reinforcement learning, decision-making, and contro...

Dec 04, 2024•57 min

AI-Driven Code: Navigating Due Diligence & Transparency in MLOps // Matt van Itallie // #275

Matt Van Itallie is the founder and CEO of Sema . Prior to this, they were the Vice President of Customer Support and Customer Operations at Social Solutions. AI-Driven Code: Navigating Due Diligence & Transparency in MLOps // MLOps Podcast #275 with Matt van Itallie, Founder and CEO of Sema. // Abstract Matt Van Itallie, founder and CEO of Sema, discusses how comprehensive codebase evaluations play a crucial role in MLOps and technical due diligence. He highlights the impact of Generative A...

Nov 29, 2024•57 min

PyTorch's Combined Effort in Large Model Optimization // Michael Gschwind // #274

Dr. Michael Gschwind is a Director / Principal Engineer for PyTorch at Meta Platforms . At Meta , he led the rollout of GPU Inference for production services. // MLOps Podcast #274 with Michael Gschwind, Software Engineer, Software Executive at Meta Platforms. // Abstract Explore the role in boosting model performance, on-device AI processing, and collaborations with tech giants like ARM and Apple. Michael shares his journey from gaming console accelerators to AI, emphasizing the power of commun...

Nov 26, 2024•58 min

← Prev Next →

For the best experience, listen in Metacast app for iOS or Android

Open in Metacast