Plan and PRACTICE for better incident response with insights from Tim Armandpour, CTO of PagerDuty. Learn the secrets to resilience from the team that mitigated the impact of a major outage—handling a 250% traffic surge while delivering on their SLA.
Listen to find out:
- 🛠️ Why planning AND practice are both critical for incident response.
- 🚧 How to practice for incident response (e.g Failure Fridays with Chaos Engineering)
- 🧑🤝🧑 Ownership: Why tech AND business teams must join post-mortems.
- ☁️ How to mitigate the impact of your cloud provider’s lower SLA.
- ⚓ Which architectural patterns are more resilient?
- ⚖️ WARNING: “bend” the CAP theorem at your own risk
Listen here: https://alphalist.com/podcast/113-tim-armandpour-cto-pagerduty
#113 - Faster Incident Response feat. Tim Armandpour // CTO @ PagerDuty | alphalist.CTO Podcast - For CTOs and Technical Leaders - Listen or read transcript on Metacast