The Art of Embracing Failure at Scale - Adrian Hornsby - REdeploy 2019

Channel:
Subscribers:
42,400
Published on ● Video Link: https://www.youtube.com/watch?v=iE3qSY82FIY



Duration: 40:26
85 views
0


Mistakes. Bad judgment. Errors. Failures. They are all part of our engineering lives. While many think of them as being undesirable aspects of engineering, failures are very important, and even- beneficial. One thing that is sure is that failures will happen and will come in many forms, some expected, and some unexpected. It’s therefore important to embrace failure. The question is how to limit its blast-radius? In this talk, I will discuss a range of blast radius reduction design techniques used at AWS and by our customers, including isolation, bulkheads, cells, and sharding. I will also discuss how embracing failure infuses impact our operational practices.

Adrian is a principal technical evangelist at Amazon Web Services and is based in the Nordics. He has over 15 years of experience in the IT industry, having worked as a software and systems engineer; a backend, web, and mobile developer; and part of DevOps teams where his focus has been on cloud infrastructure and site reliability, writing application software, deploying servers, and managing large-scale architectures. The truth is that Adrian loves breaking stuff—controlled chaos and resiliency is his thing. Adrian frequently speaks at conferences and community meetups and blogs at https://medium.com/@adhorn.



To engage in the conversation on Resilience Engineering in the tech industry, tweet with the hashtag #REdeployConf.

For more information about REdeploy, check out https://re-deploy.io




Other Videos By Confreaks


2022-08-25Approaching Overload: Automation as Fellow Responders - Marisa Grayson - REdeploy 2019
2022-08-25Beyond Blameless - Rein Henrichs - REdeploy 2019
2022-08-25Collaboration, Coordination, Co-Design CoEvolution - Jabe Bloom - REdeploy 2019
2022-08-25Trajectory of Chaos - Casey Rosenthal - REdeploy 2019
2022-08-25REdeploy 2019 Speaker Panel - Day 1
2022-08-25The Meat of It - Ryan Kitchens - REdeploy 2019
2022-08-25Resilience Engineering Mythbusting - Will Gallego - REdeploy 2019
2022-08-25Document Yourself: A Framework for Career Advancement - Michelle Brenner - REdeploy 2019
2022-08-25The Practice of Practice: Teamwork in Complexity - Matt Davis - REdeploy 2019
2022-08-25Getting Comfortable With Being Under Water - Ronnie Chen - REdeploy 2019
2022-08-25The Art of Embracing Failure at Scale - Adrian Hornsby - REdeploy 2019
2022-08-25A Few Observations on the Marvelous Resilience of Bone & Resilience Engineering - Dr. Richard Cook
2022-08-25REdeploy 2019 Welcome & Opening
2022-08-25GRCon19 - Prototyping LTE-WiFi Interworking on a Single SDR Platform by Walter Nitzold
2022-08-25GRCon19 - Demonstration of GNU Radio Compatibility with a NASA Space... by David Miller
2022-08-25GRCon19 - UAS Community Testbed Architecture for Advanced Wireless Research with... by Vuk Marojevic
2022-08-25GRCon19 - Performance Evaluation of MIMO Techniques With an SDR-Based Prototype by Evariste Some
2022-08-25GRCon19 - VLBI with GNU Radio and White Rabbit by Paul Boven
2022-08-25GRCon 2019 - Thursday Lightning Talks
2022-08-25GRCon19 - Managing Latency in Continuous GNU Radio Flowgraphs by Matt Ettus
2022-08-25GRCon19 - Enabling Precise Timing Control in SDRs by Srikanth Pagadarai



Tags:
REdeploy