DevOpsDays Chicago 2017 - Getting Good At System Failure Analysis by Paul Hinze

Channel:
Subscribers:
42,400
Published on ● Video Link: https://www.youtube.com/watch?v=4UFxCegosFc



Duration: 30:53
182 views
4


DevOpsDays Chicago 2017 - Getting Good At System Failure Analysis by Paul Hinze

Every failure is a mystery to be solved. Solving those mysteries is a skill that can be honed. Let’s talk about how to get better at figuring out what’s up when things go wrong! This is a talk full of both high level advice and concrete tips from somebody who loves fixing weird production issues.

What does it mean to be good at debugging production issues? That’s the question we’ll explore in this talk! I’ll be sharing a grab bag of the postures, practices, tips, and tricks I’ve learned from years hanging out near production.

Running production systems are not always designed for operability, and yet we still need to fix them. Thusly, my goal is to share techniques that apply across a range of operational maturity levels. This breaks down into a few sections:

Adopting a productive attitude towards failures
Learning to love logs, wherever you may find them
Guerrilla systems thinking and domain modeling
Code reading for failure analysis
Collaborating to remediate and solve production issues
Production failure analysis has been one of the most rewarding skills that I’ve built up in my career. I hope that after this talk you’ll have a few tools to walk away with, but - more importantly - you’ll be inspired to get better at responding to failures.




Other Videos By Confreaks


2017-09-26DevOpsDays Chicago 2017 - Ignites- Containers, Virtual Machines... by Nell Shamrell-Harrington
2017-09-26DevOpsDays Chicago 2017 - Ignites- How to DevOpsDays by Joe Nuspl
2017-09-26DevOpsDays Chicago 2017 - Delivering Continuous Security with Docker by Matthew Schlue
2017-09-26DevOpsDays Chicago 2017 - You Have A Data Lake, Now What? by Alison Stanton
2017-09-26DevOpsDays Chicago 2017 - Burnout: Community Problem & Community Solution by Jason Yee
2017-09-26DevOpsDays Chicago 2017 - Graphs: The Fabric of DevOps by Ashley Sun
2017-09-26DevOpsDays Chicago 2017 - DevOps Practices for the Database Team by Pramod Sadalage
2017-09-26DevOpsDays Chicago 2017 - Devaluing Hard Work by Katie Prizy
2017-09-26DevOpsDays Chicago 2017 - Automating myself out of a job... by Jahmel Harris
2017-09-26DevOpsDays Chicago 2017 - Serverless Architecture in Azure by Rob Richardson
2017-09-26DevOpsDays Chicago 2017 - Getting Good At System Failure Analysis by Paul Hinze
2017-09-26DevOpsDays Chicago 2017 - Diversity is Not Just a Checklist by Rhea Ghosh
2017-09-26DevOpsDays Chicago 2017 - Security, Don't Fear the DevOps by Bill Weiss
2017-09-26DevOpsDays Chicago 2017 - Hacking Human Systems by Jeff Smith
2017-09-01RustConf 2017 - Closing Keynote: Safe Systems Software and the Future of Computing by Joe Duffy
2017-09-01RustConf 2017 - Fast, Safe, Pure-Rust Elliptic Curve Cryptography
2017-09-01RustConf 2017 - Improving Rust Performance Through Profiling and Benchmarking by Steve Jenson
2017-09-01RustConf 2017 - Type System Tips for the Real World by Sean Griffin
2017-09-01RustConf 2017 - Menhir and Friends: the State of the Art of Parsing in Rust by Naomi Testard
2017-09-01RustConf 2017 - Shipping a Solid Rust Crate by Michael Gattozzi
2017-09-01RustConf 2017 - Building Rocket by Sergio Benitez