On-Call Nightmares Podcast - podcast cover

On-Call Nightmares Podcast

Jay Gordonwww.podomatic.com
Being on-call in a tech team can lead to some interesting stories. On this podcast we'll talk to a variety of people from the world of technology, discuss their experiences in on-call and find out some nightmares they survived. Hosted by Jay Gordon - Twitter @jaydestro
Download Metacast podcast app
Podcasts are better in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episodes

Episode 18 - Phoummala Schmitt - Microsoft

You get opportunities in tech to work with some of the best people in the world. I got that opportunity when I joined Microsoft, that's where I met the Exchange Goddess! We discuss family, work and how it all comes together when you're on-call. We also discuss the Microsoft Create Startups Event Phoummala will be taking part in, https://www.createstartups.io/ You can register now, for free! Sr Cloud Advocate @ Microsoft, with a background focus on messaging and collaboration, virtualization, and...

Apr 04, 201939 min

Episode 17 - Andy Fleener - SportsEngine

Get your playbook and have the stats ready, we're talking with Andy Fleener of SportsEngine this week. Andy is a Humanist, Systems Thinker, New View Safety Nerd, Sr. Platform Operations Manager at SportsEngine, DevOps Days MSP Co-Organizer. Twitter: @andyfleener

Mar 28, 201942 min

Episode 16 - Eric Sorenson - Puppet

Ever wonder what it was like to do dial-up support hosting in Hawaii? Well this is the damn episode you've waited for your whole life. After 16 years working as a systems/network administrator in the Bay Area, Eric relocated to Portland in 2012 to further develop his passion for awesome configuration management tools. As Puppet's product manager, he worked on extending and improving it for modern infrastructure; his current project is Lyra, a cloud-native workflow engine. Outside of work he enjo...

Mar 20, 201930 min

Episode 15 - Andrew Clay Shafer - Pivotal

The Conscientious Developer There are great ways to think of how to attack the on-call situation even if you aren't in an on-call rotation. By being a conscientious developer and taking that extra interest in your software after deployment you're adding incredible valuable. Your co-workers may also really end up appreciating your time a little bit more as well. Some people are born to on call, and some people have on call thrust up on them. Andrew Clay Shafer stole good ideas from wherever he co...

Mar 14, 201943 min

Episode 14 - JD Trask - Raygun

Welcome back to OCN! I this time I chat with CEO of Raygun, JD Trask. One of the cool parts of this podcast is meeting people from all over the world who have had some experience on-call, JD does his thing in New Zealand! John-Daniel is the CEO and co-founder of Raygun.com, an application monitoring company that helps teams identify hidden performance bottlenecks and software bugs. With over 25 years of experience in software development, JD is a programmer at heart with unique insights into sca...

Mar 07, 201935 min

Episode 13 - Damian Schenkelman - Auth0

Welcome back to another podcast about downtime! Once again we meet with another technologist who's building a new product and getting it out to the world. This time we meet Damian of Auth0 who's been working with his team to ensure identity services. Damian is an Software Engineer that loves to solve hard problems of any type, especially those related to making software and teams scale. He is a Director of Engineering at Auth0 helping make identity simple for developers. Before Auth0, Damian spe...

Feb 28, 201929 min

Episode 12 - Baron Schwartz - VividCortex

Content Warning: This episode does contain some graphic description of the work done by an EMT - if you find this troubling you may want to check out another episode! On this episode, I speak with the CTO and founder of VividCortex on his life down on the farm and as an EMT. Baron gives us some insight into how that prepared him for his time on-call in different roles to ensure databases are fast and reliable. Baron is the CTO and founder of VividCortex, the best way to see what your production ...

Feb 21, 201941 min

Episode 11 - Sam Phippen - Google

On this edition, Sam shares with me some scary moments from his time at DigitalOcean. Sam tells the tale of a database table that was dropped. https://blog.digitalocean.com/update-on-the-april-5th-2017-outage/ Sam Phippen is a Developer Advocate at Google, and previously an Engineering Manager at DigitalOcean. He's seen his fair share of deep, complex, incidents. He has strong opinions about incident management, postmortem culture, and on call practises. He's sad that he can't hug every cat. Twi...

Feb 14, 201939 min

Episode 10 - J. Paul Reed - Everywhere and Nowhere ;-)

In this episode, Jay and J. Paul Reed discuss the need for on-call practices and incident response in the world of software release engineering. Paul shares some great stories, including how the World Series can depend on a single line of code. J. Paul Reed has over twenty years experience in the trenches as a build/release engineer, working with such companies as VMware, Mozilla, Postbox, Symantec, and Salesforce. In 2012, he founded Release Engineering Approaches, a consultancy incorporating a...

Feb 07, 201944 min

Episode 9 - Charity Majors - Honeycomb.io

Infrastructure Week, Episode 2! Charity and Jay sit down for a discussion on her career and a deep dive into a database incident. You'll get some interesting thoughts on how monitoring has changed in operations. Charity is cofounder and CEO of Honeycomb.io, a startup aimed at debugging complex systems. (“It’s like strace for systems!”) Previously, Charity ran infrastructure at Parse and was an engineering manager at Facebook. She also worked with the RocksDB team to build and deploy the world’s ...

Jan 31, 201922 min

Episode 8 - Melissa Palmer - Veeam

Does this VM bring me joy? Melissa is Product Strategy Technologist at Veeam and an information technology infrastructure enthusiast, with a focus on virtualization, security, and emerging technologies. Melissa is a VMware Certified Design Expert (VCDX #236), and has held roles such as VMware Engineer, Systems Engineer, Solutions Architect, and Technical Marketing Engineer prior to joining Veeam. You can find Melissa on twitter @vMiss33 or at her blog https://vMiss.net.

Jan 28, 201933 min

Episode 7 - Jamesha "Jam" Fisher - Splice

Jamesha "Jam" Fisher is an infrastructure engineer at Splice. Jamesha has worked in the tech industry for over 15(!) years, with a special interest in security. Graduating with a degree in information assurance and security engineering, they lent their experience to operations and systems engineering at companies like Google and GitHub. In their spare time, Jamesha queers it up, along with being a maker of things musical or delicious and objects that use binary numbers.

Jan 24, 201936 min

Episode 6 - Adam Jacob - Board Member at Chef Software

Ride The On-Call Lightning with Adam Jacob Adam Jacob is a Board Member, CTO and founder of Chef. Adam joins us this week to discuss his world as an on-call engineer. Find out what happens when they call in the "Mr. Wolf" of Oracle on a private jet to get the database back online. Learn about Adam's passion for Open Source while we interject our mutual interest in heavy metal.

Jan 17, 201940 min

Episode - 5 - Kolton Andrus - Gremlin Inc

Fear, Chaos and Pain Common subjects in the Christopher Nolan Batman films, especially when the Joker appears. How do we avoid the moments of fear, chaos and pain in real time? By preparing for it. Today we talk with Gremlin Inc founder and CEO Kolton Andrus. Kolton is co-founder and CEO of Gremlin. Previously, he was a Chaos Engineer at Netflix improving streaming reliability and operating the Edge services. He designed and built F.I.T., Netflix's failure injection service. Prior he improved th...

Jan 10, 201938 min

Episode 4 - Tanya Janca - Microsoft

There's on-call in nearly every aspect of the tech industry, in this episode we will focus on Security. Tanya Janca is a senior cloud advocate for Microsoft, specializing in application and cloud security; evangelizing software security and advocating for developers and operations folks alike through public speaking, her open source project OWASP DevSlop, and various forms of teaching via workshops, blogs and community events. As an ethical hacker, OWASP Project and Chapter Leader, Women in Secu...

Jan 03, 201932 min

Episode 3 - Chris Short - Red Hat

Chris Short has been a proponent of open source solutions throughout his over two decades in various IT disciplines including systems, security, networks, and DevOps engineering and advocacy across the public and private sectors. He currently works on the Ansible team at Red Hat. Chris is a partially disabled US Air Force veteran living with his wife and son in Greater Metro Detroit. Chris writes about DevOps and other topics at chrisshort.net. He also runs the DevOps, Cloud Native, and open sou...

Dec 27, 201837 min

On-Call Nightmares Podcast - Episode 2 - Dan Maher - Datadog

Welcome to the first full-length episode of The On-Call Nightmares Podcast. Dan is a veteran of the original dotcom bubble and has since worked in a variety of environments from start-ups to global corporations, including a stints as a founder, university lecturer, and a day labourer. Today, Dan is a member of the Devopsdays Global team, and a Developer Advocate at Datadog. Twitter: @phrawzty

Dec 20, 201838 min
For the best experience, listen in Metacast app for iOS or Android
Open in Metacast