You get opportunities in tech to work with some of the best people in the world. I got that opportunity when I joined Microsoft, that's where I met the Exchange Goddess! We discuss family, work and how it all comes together when you're on-call. We also discuss the Microsoft Create Startups Event Phoummala will be taking part in, https://www.createstartups.io/ You can register now, for free! Sr Cloud Advocate @ Microsoft, with a background focus on messaging and collaboration, virtualization, and...
Apr 04, 2019•39 min
Get your playbook and have the stats ready, we're talking with Andy Fleener of SportsEngine this week. Andy is a Humanist, Systems Thinker, New View Safety Nerd, Sr. Platform Operations Manager at SportsEngine, DevOps Days MSP Co-Organizer. Twitter: @andyfleener
Mar 28, 2019•42 min
Ever wonder what it was like to do dial-up support hosting in Hawaii? Well this is the damn episode you've waited for your whole life. After 16 years working as a systems/network administrator in the Bay Area, Eric relocated to Portland in 2012 to further develop his passion for awesome configuration management tools. As Puppet's product manager, he worked on extending and improving it for modern infrastructure; his current project is Lyra, a cloud-native workflow engine. Outside of work he enjo...
Mar 20, 2019•30 min
The Conscientious Developer There are great ways to think of how to attack the on-call situation even if you aren't in an on-call rotation. By being a conscientious developer and taking that extra interest in your software after deployment you're adding incredible valuable. Your co-workers may also really end up appreciating your time a little bit more as well. Some people are born to on call, and some people have on call thrust up on them. Andrew Clay Shafer stole good ideas from wherever he co...
Mar 14, 2019•43 min
Welcome back to OCN! I this time I chat with CEO of Raygun, JD Trask. One of the cool parts of this podcast is meeting people from all over the world who have had some experience on-call, JD does his thing in New Zealand! John-Daniel is the CEO and co-founder of Raygun.com, an application monitoring company that helps teams identify hidden performance bottlenecks and software bugs. With over 25 years of experience in software development, JD is a programmer at heart with unique insights into sca...
Mar 07, 2019•35 min
Welcome back to another podcast about downtime! Once again we meet with another technologist who's building a new product and getting it out to the world. This time we meet Damian of Auth0 who's been working with his team to ensure identity services. Damian is an Software Engineer that loves to solve hard problems of any type, especially those related to making software and teams scale. He is a Director of Engineering at Auth0 helping make identity simple for developers. Before Auth0, Damian spe...
Feb 28, 2019•29 min
Content Warning: This episode does contain some graphic description of the work done by an EMT - if you find this troubling you may want to check out another episode! On this episode, I speak with the CTO and founder of VividCortex on his life down on the farm and as an EMT. Baron gives us some insight into how that prepared him for his time on-call in different roles to ensure databases are fast and reliable. Baron is the CTO and founder of VividCortex, the best way to see what your production ...
Feb 21, 2019•41 min
On this edition, Sam shares with me some scary moments from his time at DigitalOcean. Sam tells the tale of a database table that was dropped. https://blog.digitalocean.com/update-on-the-april-5th-2017-outage/ Sam Phippen is a Developer Advocate at Google, and previously an Engineering Manager at DigitalOcean. He's seen his fair share of deep, complex, incidents. He has strong opinions about incident management, postmortem culture, and on call practises. He's sad that he can't hug every cat. Twi...
Feb 14, 2019•39 min
In this episode, Jay and J. Paul Reed discuss the need for on-call practices and incident response in the world of software release engineering. Paul shares some great stories, including how the World Series can depend on a single line of code. J. Paul Reed has over twenty years experience in the trenches as a build/release engineer, working with such companies as VMware, Mozilla, Postbox, Symantec, and Salesforce. In 2012, he founded Release Engineering Approaches, a consultancy incorporating a...
Feb 07, 2019•44 min
Infrastructure Week, Episode 2! Charity and Jay sit down for a discussion on her career and a deep dive into a database incident. You'll get some interesting thoughts on how monitoring has changed in operations. Charity is cofounder and CEO of Honeycomb.io, a startup aimed at debugging complex systems. (“It’s like strace for systems!”) Previously, Charity ran infrastructure at Parse and was an engineering manager at Facebook. She also worked with the RocksDB team to build and deploy the world’s ...
Jan 31, 2019•22 min
Does this VM bring me joy? Melissa is Product Strategy Technologist at Veeam and an information technology infrastructure enthusiast, with a focus on virtualization, security, and emerging technologies. Melissa is a VMware Certified Design Expert (VCDX #236), and has held roles such as VMware Engineer, Systems Engineer, Solutions Architect, and Technical Marketing Engineer prior to joining Veeam. You can find Melissa on twitter @vMiss33 or at her blog https://vMiss.net.
Jan 28, 2019•33 min
Jamesha "Jam" Fisher is an infrastructure engineer at Splice. Jamesha has worked in the tech industry for over 15(!) years, with a special interest in security. Graduating with a degree in information assurance and security engineering, they lent their experience to operations and systems engineering at companies like Google and GitHub. In their spare time, Jamesha queers it up, along with being a maker of things musical or delicious and objects that use binary numbers.
Jan 24, 2019•36 min
Ride The On-Call Lightning with Adam Jacob Adam Jacob is a Board Member, CTO and founder of Chef. Adam joins us this week to discuss his world as an on-call engineer. Find out what happens when they call in the "Mr. Wolf" of Oracle on a private jet to get the database back online. Learn about Adam's passion for Open Source while we interject our mutual interest in heavy metal.
Jan 17, 2019•40 min
Fear, Chaos and Pain Common subjects in the Christopher Nolan Batman films, especially when the Joker appears. How do we avoid the moments of fear, chaos and pain in real time? By preparing for it. Today we talk with Gremlin Inc founder and CEO Kolton Andrus. Kolton is co-founder and CEO of Gremlin. Previously, he was a Chaos Engineer at Netflix improving streaming reliability and operating the Edge services. He designed and built F.I.T., Netflix's failure injection service. Prior he improved th...
Jan 10, 2019•38 min
There's on-call in nearly every aspect of the tech industry, in this episode we will focus on Security. Tanya Janca is a senior cloud advocate for Microsoft, specializing in application and cloud security; evangelizing software security and advocating for developers and operations folks alike through public speaking, her open source project OWASP DevSlop, and various forms of teaching via workshops, blogs and community events. As an ethical hacker, OWASP Project and Chapter Leader, Women in Secu...
Jan 03, 2019•32 min
Chris Short has been a proponent of open source solutions throughout his over two decades in various IT disciplines including systems, security, networks, and DevOps engineering and advocacy across the public and private sectors. He currently works on the Ansible team at Red Hat. Chris is a partially disabled US Air Force veteran living with his wife and son in Greater Metro Detroit. Chris writes about DevOps and other topics at chrisshort.net. He also runs the DevOps, Cloud Native, and open sou...
Dec 27, 2018•37 min
Welcome to the first full-length episode of The On-Call Nightmares Podcast. Dan is a veteran of the original dotcom bubble and has since worked in a variety of environments from start-ups to global corporations, including a stints as a founder, university lecturer, and a day labourer. Today, Dan is a member of the Devopsdays Global team, and a Developer Advocate at Datadog. Twitter: @phrawzty
Dec 20, 2018•38 min
A quick preview of what's to come!
Dec 19, 2018•2 min