ITOps, DevOps, AIOps - All Things Ops - podcast cover

ITOps, DevOps, AIOps - All Things Ops

Elias Voelkercheckmk.com

The “ITOps, DevOps, AIOps - All Things Ops” podcast is dedicated to operating and managing modern large-scale IT infrastructures. If you want to learn best practices from other Leaders in IT operations, this show is for you. Each episode features an interview with a senior IT executive or Thought Leader, discussing topics like: 1. How to manage the increasing complexity of hybrid IT infrastructures 2. How to effectively leverage automation to “do more with less” 3. Getting the most out of monitoring and observability for hybrid IT infrastructures 4. Managing ITOps and DevOps teams

Last refreshed:
Follow this podcast in the Metacast mobile app to refresh it and see new episodes.
Download Metacast podcast app
Podcasts are better in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episodes

Ep. 54 - How Documentation Can Make or Break Your Team – with Connor Clark-Lindh

The best teams don’t chase perfection—they reduce friction. Ever been told you can’t improve a system because “the documentation isn’t up to date”? In this episode, Elias Voelker sits down with Connor Clark-Lindh , VP of Consulting and Operations at OrionW, to unpack what really drives operational excellence—and what gets in the way. Connor shares how over-documentation can become a blocker instead of a support system, especially as teams scale. He also told me how lately he's seen teams push fo...

Jul 01, 20251 hr 3 min

Ep. 53 - From 5,000 Alerts to AI-Ready Ops: Inside Acrisure’s Observability Overhaul - with Jordon Peeple

Jordon Peeple is Head of IT Infrastructure Operations at Acrisure—the fast-growing fintech powerhouse you’ve probably used without even knowing it. In this episode, Jordon shares how his team turned 5,000 ignored alerts into a focused, AI-ready monitoring system. He explains how they cut through the noise, rebuilt escalation chains, and shifting from reactive ops towards a proactive, business-aligned observability—supported by a complete IT org restructure. You'll learn: 1. How to reorganize you...

May 27, 202535 min

Ep. 52 - Keeping the Lights On at Visa: How to Engineer for Reliability at Scale - with Divya Veerapandian

How do you ensure reliability across hybrid infrastructure when the cloud half isn’t fully in your control? In this episode, host Elias Voelker talks with Divya Veerapandian , Senior Director for Infrastructure Reliability Engineering and Global Head of Infrastructure Network Services at Visa. From network automation and hybrid observability to AI-driven operations and inclusive engineering teams—Divya shares what it takes to deliver reliable infrastructure in one of the world’s most mission-cri...

Apr 15, 202537 min

Ep. 51 - IT for Human Rights: Scaling Secure Infrastructure for a Global Nonprofit – with Lydia Nicola

For Lydia Nicola’s IT team, a security breach isn’t just a risk—it could mean life or death for the people they protect. In this episode, Elias Voelker speaks with Lydia Nicola , Head of IT Operations at Amnesty International, about the unique challenges of running IT for a global nonprofit. Lydia shares how her team secures infrastructure in politically volatile regions, optimizes cloud costs, and combats shadow IT—all while defending against cyber threats from nation-states. She also reveals h...

Mar 25, 202546 min

Ep. 50 - Data Center Efficiency: Monitoring, AI, and Decarbonization - with Martin Casaulta & Martin Hirschvogel

How can you drive data center efficiency without compromising performance? In this episode, host Elias Voelker sits down with Martin Casaulta , Chief Technologist at Hewlett Packard Enterprise Switzerland, and Martin Hirschvogel , Chief Product Officer at Checkmk. They discuss data center efficiency—from traditional metrics like PUE to the impact of AI workloads—and share strategies for optimizing IT operations. Martin Casaulta highlights the Swiss Data Center Efficiency Association’s work on de...

Feb 17, 202545 min

Ep. 49 – 90% First-Contact Resolution: How to Build a Secure, Efficient IT Helpdesk– with Peg Kearney

How do you hit 90% first-contact resolution—and keep it there? In this episode, Elias speaks with Peg Kearney , Director of IT Operations at the University of Arizona College of Nursing, about how her helpdesk team maintains a 90% first-contact resolution rate by hiring top talent and providing them with the right tools and system access. Peg also highlights the importance of treating endpoints as critical infrastructure, managing IT in a HIPAA-compliant environment, and using VR labs to transfo...

Jan 27, 202542 min

Ep. 48 - From Surviving to Thriving: How a Major Cyberattack Sparked a Full IT Transformation - with Thomas Klithav Hansen

A major cyberattack led Blue Water Shipping to completely transform its IT infrastructure—and now, they're stronger than ever. In this episode, host Elias Voelker sits down with Thomas Klithav Hansen , Head of IT Operations at Blue Water Shipping, to discuss how the attack became the catalyst for a transformative IT journey. You’ll learn: 1. The role of enterprise architecture in driving IT success 2. Why product-based IT organizations foster agility and clarity 3. How cost and value transparenc...

Jan 05, 202534 min

Ep. 47 - 35,000 to 130,000 Students in 5 Years: Scaling IT at Internationale Hochschule - with Thomas Singbartl

Since 2019, Thomas Singbartl , Head of Global IT Operations at Internationale Hochschule (IU), has supported the university's astonishing growth journey from 35,000 to 130,000 students. Join host Elias Voelker in this episode as Thomas shares how IT fueled IU’s exponential growth, why the term “digital transformation” no longer applies, and how Syntea, IU’s AI learning companion, is revolutionizing education. He also outlines his vision for AI in IT operations, from process optimization to empow...

Dec 09, 202431 min

Ep. 46 - IT for the City of Atlanta: Building Scalable and Resilient Systems - with Tameka Neely-Dudley

What can we learn about resilience, scalability, and workforce development from the IT organization of the 6th-largest metro in the US? In this episode, Tameka Neely-Dudley , Director of IT Infrastructure Operations and Service Delivery for the City of Atlanta , shares insights from her nearly 25-year career, beginning as an intern and growing into a key leader managing Atlanta's complex IT landscape. Tameka reveals how her early experience preparing for Y2K laid the groundwork for building resi...

Nov 19, 202431 min

Ep. 45 - The Metrics That Matter: Optimizing ITSM by Focusing on Customer Effort – with Huseyin Uysal

Which KPIs really matter in IT Service Management? In this episode, Elias sits down with Huseyin Uysal , Head of Global Service Desk at ISS , to uncover what separates successful IT service management from the rest. With a wealth of experience managing global teams and optimizing IT processes, Huseyin reveals the metrics that really matter, how customer effort is often overlooked, and the strategies his team used to slash ticket resolution times in half despite a surge in ticket volumes. You'll ...

Oct 29, 202437 min

Ep. 44 - Mature ITSM: How to Drive Top-Down Change and Build Well-Oiled IT Operations - With Haroon Hasan

What separates a well-oiled IT operation from one constantly putting out fires? In this episode, we dive deep into the world of IT Service Management (ITSM) with Haroon Hasan , author of "Choose to Lead" and Director of IT Service Management and Governance at Computacenter . With 20+ years of experience, Haroon shares insights on optimizing ITSM for operational excellence. He covers quick ITSM assessments, the benefits of a mature ITSM model, the role of leadership in driving change, and more. Y...

Oct 15, 202439 min

Ep. 43 - Scaling Without the Cloud: How Sofascore Manages Millions of Real-Time Requests with Clever Caching - with Josip Stuhli

What happens when your infrastructure faces a live peak of millions of users worldwide—without cloud scalability? In this episode, Sofascore ’s CTO Josip Stuhli breaks down how his team navigates massive traffic surges, optimizes caching, and saves big by ditching the cloud while still delivering real-time updates to 25 million monthly users. You'll learn: 1. How caching flattens traffic peaks and handles massive live events without backend strain. 2. Why Sofascore left the cloud and the financi...

Oct 01, 202455 min

Ep. 42 - Integrating Cybersecurity with Operations: Ensuring Impact and Efficiency at UNICEF USA - with Andrew Nuxoll

Successful cybersecurity isn’t about heroics, it’s about preventing disasters you’ll never hear about. In this episode, Andrew Nuxoll , Managing Director of IT Operations and Cybersecurity at UNICEF USA , shares his journey from working at various managed service providers to leading cybersecurity efforts at a global NGO. Andrew offers insights into why cybersecurity is more than just keeping the lights on, how purpose-driven work changes the stakes, and the strategies he employs to manage cyber...

Sep 17, 202441 min

Ep. 41 - IT Leadership in Higher Education: Strategies for Service Management, Optimal Customer Experiences, and Employee Growth - with Mark Katsouros

How can universities navigate the complexities of service delivery while pursuing growth and innovation? Mark Katsouros , Senior Director for IT Engineering and Operations at Duquesne University , brings nearly 40 years of higher education IT experience. From the University of Maryland to pivotal roles at the University of Iowa and Penn State, Mark has driven significant IT transformations. Tune in to hear his journey, the unique challenges in higher education, and strategies for successful serv...

Sep 03, 20241 hr 9 min

Ep. 40 - Service Management in IT and Beyond - with Martijn Adams

Martijn Adams , General Manager at 4me , brings a lifetime of expertise in IT service management, having worked with leading companies such as Philips, Deloitte, and Danone. This episode delves into his journey and the unique approaches that 4me employs to streamline service management across IT, HR, and facilities. You'll discover how service management principles can impact different departments, why user experience is crucial, and the potential of AI in enhancing service delivery. You'll lear...

Jul 16, 202452 min

Ep. 39 - Scaling Cyware: Lessons From Growing the Company Headcount Fivefold - With Joe Aurilia

How can you scale your tech company while maintaining rigorous operational standards? Senior VP of Operations at Cyware Joe Aurilia shares what he learned while 5x-ing the company. In this episode, Joe shares how he's building operations from the ground up, handling the complexities of international teams, and embedding a culture of security and compliance in a rapidly growing company. You'll learn: 1. How to balance immediate operational needs with long-term growth strategies 2. The challenges ...

Jun 25, 202442 min

Ep. 38 - Transparency, Credibility, and Connection: Hard-Earned Lessons From 25 Years in IT - with Paul Teodorescu

Learn from Paul Teodorescu 's 25 years of IT experience as he shares the importance of transparency, credibility, and connecting with people in the tech industry. In this episode, Paul shares his journey from crawling under desks at Merrill Lynch to advising top firms like Morgan & Morgan. Explore the nuances of interim management versus advisory roles, and discover how IT challenges remain consistent across industries. Paul emphasizes the importance of connecting with people, attending and ...

Jun 10, 202456 min

Ep. 37 - How GenAI Is Reshaping the Way We Do ITOps - with Nathanial Smalley

The end of the traditional SRE? How do you see the future unfolding as AI's role in IT operations grows? In this episode, we welcome Nathanial Smalley , Principal Sales Engineer at Transposit . He brings his rich experience from over a decade at Splunk and his current role at Transposit to discuss the impact of AI on IT operations. He delves into practical AI applications in the present, its promising future in IT operations and SRE, and the crucial lessons from his transition from military oper...

May 21, 202448 min

Ep 36 - Hyper-Converged Infrastructures: The Answer to the Complexity of IT Systems? - with Lee Caswell

Over the next 3 years, more than 750 million new applications will hit the market... and nobody can predict what those applications will look like. In this episode, Lee Caswell , SVP of Product and Solutions Marketing at Nutanix , introduces Hyper-Converged Infrastructures: a groundbreaking solution that integrates computing, storage, and networking into a single system to reduce complexity and improve scalability. Reflecting on his tenure at VMware, Caswell also offers insights into VMware's cu...

Mar 26, 202444 min

Ep 35 - Cybersecurity Masterclass: Compliance and Breach Prevention in the Era of Cloud and AI - with Jason Ford

Cybersecurity as we know it today is still in its infancy, which begs the question: how will it mature in the wake of rapid cloud and AI innovations? In this episode, Dalarie is joined by Jason Ford , CEO and CISO at Steel Patriot Partners , who shares his in-depth insights into the evolving world of IT operations. From the Wild West of early cybersecurity days to the cutting-edge advancements in cloud computing and AI, Jason guides us through the critical shifts and strategies that are shaping ...

Mar 10, 202445 min

Ep 34 - The Future of Real Estate: Fathom Realty on Cloud Transition, Kubernetes, and AI - with David Almeida

How is technology shaping the future of real estate? In this episode, David Almeida , VP of DevOps and BI at Fathom Realty delves into the nuances of cloud transformation in real estate, the strategic implementation of Kubernetes for operational efficiency, and the evolving role of AI in this sector. David also shares his first-hand experiences, challenges, and triumphs in navigating these technological advancements at Fathom Realty. You'll learn: 1. Insights into cloud-based brokerage models 2....

Dec 21, 202348 min

Ep 33 - From Manufacturing Excellence to IT Excellence: Inside Toyota’s DevOps Playbook - with Kumar Singirikonda

Discover the synergies of DevOps, DataOps, and cloud technologies that are driving Toyota's IT revolution as we delve into the cutting-edge IT strategies powering one of the world's leading automakers. In this episode, Kumar Singirikonda , Director of DevOps Engineering at Toyota North America, shares his insights into Toyota's IT methodologies, reflecting on his role in orchestrating these strategies. He discusses the practicalities and challenges in achieving high levels of DevOps maturity, hi...

Dec 11, 202337 min

EP 32 - The Future of Healthcare Tech: Cloud, AI, and Conversational Interfaces - with Steven Michaels

Dive into the transformative world of healthcare technology with Steven Michaels , VP of Technology and CTO at Baylor Scott & White Health . Explore how cloud computing, AI, and conversational interfaces are reshaping patient care and IT operations in one of the largest healthcare systems in Texas. With Baylor Scott & White Health encompassing over 50 hospitals and around 700 clinics, Michaels brings a wealth of experience in integrating advanced technology in healthcare settings. This e...

Nov 28, 20231 hr 1 min

KubeCon Special: Opinions, Learnings, and Booth Breakdowns With Checkmk’s Elias and Dalarie

Dive into the vibrant world of KubeCon with hosts Dalarie Gonzales and Elias Voelker of Checkmk as they share their personal experiences of the event. In this episode, they unpack everything from insightful tech trends to the most captivating booths. Get ready to hear first-hand experiences and learnings that are shaping the future of IT operations, cloud transformation, and data management. What to expect: 1. What were Elias' and Dalarie's general impressions of Kubecon? 2. What were the top-th...

Nov 16, 202337 min

Ep 30 - Balancing Speed and Security in Software Development: Navigating the Software Supply Chain - with Paul Karsten and Alexander Kalinovsky

Alexander Kalinowsky and Paul Karsten , Founders of Idea Harbor , navigate the complexities of the software supply chain in a world where speed and security are paramount. With their rich backgrounds in IT consultancy and DevOps, they join host Elias Voelker to dissect the balancing act between rapid software development and robust security. They delve into the intricacies of the software supply chain, exploring its vulnerabilities and the art of managing risks, and cover the global variations i...

Nov 14, 202340 min

Ep 29 - The Truth About Application Availability in the Cloud: What You Need to Know - with Blaize Stewart

Prepare to discover the secrets to high availability and resiliency of applications in the cloud with Blaize Stewart , Architect at Xpirit . In this episode, Blaize delves into crucial topics surrounding high availability, resiliency, and multicloud strategies, and offers valuable insights into building robust infrastructures and applications that can withstand any challenge. He also explores how Microsoft's shift towards open source could benefit your organization and revolutionize your approac...

Nov 07, 202353 min

Ep 28 - Infrastructure as Code and the Pivotal Role It Plays in Cloud Computing - with Nora Schoener

Early in her career, Nora Schöner fell in love with infrastructure as code. In this episode, Nora Schöner , Senior Cloud Consultant at superluminar , explains why she's so passionate about tools like Terraform, AWS CDK, and Pulumi, how they lower the threshold for developers entering the industry, and how they simplify the way developers define and make changes to infrastructure. You'll learn: 1. How infrastructure as code helped Nora enter the programming industry 2. What impact will the HashiC...

Oct 10, 202347 min

Ep 27 - Enterprise-Grade Cloud Transformations: Essential Advice for Big Companies Moving to the Cloud - with Kamlesh Moliya

Having held leadership positions at large insurance and financial institutions like Prudential, Citigroup, Bank of America, and UBS, Kamlesh Moliya is the go-to expert for all things cloud transformation. In this episode, he shares how you can drive your company's cloud transformation initiatives with enterprise-grade DevOps solutions and application migration strategies. You'll learn: 1. The applicability of DORA and other DevOps KPIs 2. The role of monitoring and observability in cloud transfo...

Sep 26, 202348 min

Ep 26 - The API Mindset: How to Build Products That Can Communicate and Scale - with Baiju Joseph Thalupadath

Baiju 's first job selling push-button phones earned him just $12 per month. Today, he has amassed an impressive portfolio of work at companies like Yahoo and Verizon and is now on the lookout for a new challenging endeavor. In this interview, Baiju casts his expert eye on hot topics like quality engineering, agile development, APIs, and more. You'll learn: 1. Why APIs have evolved from an afterthought to a necessity 2. The challenges of distributed agile development 3. The value of mentorship f...

Sep 12, 202349 min

Ep 25 - Inside the Giant: The Innovations Powering Microsoft's Hyperscale Data Centers - with Adam Morton

Microsoft owns some of the biggest data centers in the world. And as the demand for computing capacity continues to grow, so do the challenges. Adam Morton , Senior Director at Microsoft , is responsible for introducing new technology at Microsoft's hyperscale data centers. From liquid cooling systems to power supplies, emission regulations to robots, Adam and his team work tirelessly to keep up with Microsoft's insatiable need for more capacity. In this enlightening episode, you'll learn: 1. Wh...

Sep 05, 202352 min
For the best experience, listen in Metacast app for iOS or Android