Facebook's BIG Software Failure
The Facebook crash was a big software failure, and headline news, but what caused it? On Monday Facebook, Instagram and WhatsApp all went down. The Facebook outage affected people all around the world. This was widely reported as a configuration mistake, and it was, but it was also a more complex failure than only that. Big systems are complex dynamic things, and fail in ways that are difficult to predict, but we can learn a lot from this, and other failures at Facebook.
In this episode well known software developer Dave Farley looks at the Facebook software issues at the root of two different outages in 2021 and analyses what went wrong and what lessons Facebook, and other software development teams, can learn from these incidents.
-------------------------------------------------------------------------------------
📚 BOOKS:
📖 Dave’s NEW BOOK "Modern Software Engineering" is now available on
Amazon ➡️ https://amzn.to/3DwdwT3
In this book, Dave brings together his ideas and proven techniques to describe a durable, coherent and foundational approach to effective software development, for programmers, managers and technical leads, at all levels of experience.
📖 "Continuous Delivery Pipelines" by Dave Farley
paperback ➡️ https://amzn.to/3gIULlA
ebook version ➡️ https://leanpub.com/cd-pipelines
📖 The original, award-winning "Continuous Delivery" book by Dave Farley and Jez Humble
➡️ https://amzn.to/2WxRYmx
NOTE: If you click on one of the Amazon Affiliate links and buy the book, Continuous Delivery Ltd. will get a small fee for the recommendation with NO increase in cost to you.
-------------------------------------------------------------------------------------
Also from Dave:
🎓 CD TRAINING COURSES
If you want to learn Continuous Delivery and DevOps skills, check out Dave Farley's courses
➡️ https://bit.ly/DFTraining
📧 JOIN CD MAIL LIST 📧
Keep up to date with the latest discussions, free "How To..." guides, events and online courses. ➡️ https://bit.ly/MailListCD
-------------------------------------------------------------------------------------
Links:
Facebooks’ Statement on the Outage ➡️ https://bit.ly/3oEzEqi
Development Approach at Facebook ➡️ https://research.fb.com/wp-content/uploads/2016/11/development-and-deployment-at-facebook.pdf
Hysteresis in Control Systems Defined ➡️ https://en.wikipedia.org/wiki/Hysteresis#Control_systems
Common Causes of Production Failure “Simple Testing can Prevent Production Failure” ➡️ https://www.usenix.org/system/files/conference/osdi14/osdi14-paper-yuan.pdf
-------------------------------------------------------------------------------------
CHANNEL SPONSORS:
Equal Experts is a product software development consultancy with a network of over 1,000 experienced technology consultants globally. They increase the pace of innovation by using modern software engineering practices that embrace Continuous Delivery, Security, and Operability from the outset ➡️ https://bit.ly/3ASy8n0
Harness helps engineers and developers simplify and scale CI/CD, Feature Flags and Cloud Cost Management with an AI-powered platform for software delivery. ➡️ https://bit.ly/3Cfx3qI
Octopus are the makers of Octopus Deploy the single place for your team to manage releases, automate deployments, and automate the runbooks that keep your software operating. ➡️ https://octopus.com/
SpecFlow Behavior Driven Development for .NET SpecFlow helps teams bind automation to feature files and share the resulting examples as Living Documentation across the team and stakeholders. ➡️ https://go.specflow.org/dave_farley
Other Videos By Modern Software Engineering
2021-12-08 | Software Crimes That Send Developers To JAIL |
2021-12-05 | The Fundamentals Of Software Development | Martin Fowler In The Engineering Room Ep. 1 |
2021-12-01 | What It Takes To Be A Software Engineer |
2021-11-26 | What is DevOps? | SHORTS |
2021-11-24 | How To Be A GREAT Programmer |
2021-11-17 | The BEST and WORST Places To Work For Developers |
2021-11-10 | Is DevOps Good Or Bad? |
2021-11-03 | Is This Why You’re Bad At Programming? |
2021-10-27 | Getting Started With Microservices |
2021-10-20 | How Did Twitch Get Hacked? |
2021-10-13 | Facebook's BIG Software Failure |
2021-10-06 | Getting Started with TDD - Part 2 |
2021-09-29 | Is AGILE Better Than KANBAN? |
2021-09-22 | Git Flow Is A Bad Idea |
2021-09-19 | Defining Microservices | SHORTS |
2021-09-15 | Is Software Development The Most Difficult Job ITW |
2021-09-12 | What Is BDD? | SHORTS |
2021-09-08 | Test Driven Development Tutorial For Beginners |
2021-09-01 | How To Test Software Performance |
2021-08-25 | Breaking Bad Software Development Habits |
2021-08-18 | 3 Ways OOP Changed The World |