Using Statistical Monitoring to Detect Failures in Internet Services [1/8]

Subscribers:
351,000
Published on ● Video Link: https://www.youtube.com/watch?v=UQbBLMTeX4Y



Duration: 1:03:08
16 views
0


Today, we are increasingly building large and complex systems whose workings we do not understand, and this lack of understanding translates into systems that are hard to manage and have low availability. The problem is that there is a disconnect between our high-level goals for the system and the low-level visibility and control we have into and over it. To keep a system running, operators must wade through the minutiae of its low-level architecture and implementation. This is not unlike driving a car while looking through a magnifying glass---the driver is both overwhelmed by the details immediately in front of him and unable to focus on more important items on the horizon. A concrete example of this problem is fault detection in Internet services. Current surveys find that over 60) is the time required to simply realize that a service has failed. The challenge is that these Internet services are complex, poorly understood systems, and the correct operation of the application is only defined at a human-layer (I




Other Videos By Microsoft Research


2016-09-08Perelman's work on the Thurston's Geometrization Conjecture.
2016-09-08Perelman's work on the Thurston's Geometrization Conjecture.
2016-09-08Computational Aspects of Biological Information Workshop Session 5
2016-09-08Average-case analysis for combinatorial problems featuring subset sums and stochastic spanning trees
2016-09-08Making Smart Science Easier: The CombeChem Experience - eScience from the Laboratory to the Library
2016-09-08Music Information Retrieval: Query-By-Humming and Source Estimation
2016-09-08Should Russia be looked upon as a Western partner or does Russia pose an ongoing strategic threat?
2016-09-08Information Technologies and International Development: An Overview of Results from Africa & India
2016-09-08New Directions in Static Analysis for Error-Detection and Garbage Collection
2016-09-08Darwin’s Devices: What Evolving Robots Can Teach Us About the History of Life and the Future of Tech
2016-09-08Using Statistical Monitoring to Detect Failures in Internet Services [1/8]
2016-09-08Perelman's work on the Thurston's Geometrization Conjecture.
2016-09-08New Directions in Static Analysis for Error-Detection and Garbage Collection
2016-09-08SCS '06 - Lightning Round 3: Interactions in Online ΓÇ£SpacesΓÇ¥ - Part
2016-09-08SCS '06 - Lightning Round 3: Interactions in Online ΓÇ£SpacesΓÇ¥ - Part
2016-09-08Computer Human Interaction - the Near and Long-Term Prospects [1/8]
2016-09-08Modernist Cuisine at Home
2016-09-0818 Minutes: Find Your Focus, Master Distraction, and Get the Right Things Done
2016-09-08Antifragile: Things that Gain from Disorder
2016-09-08The Joy of X: A Guided Tour of Math, From One to Infinity
2016-09-08How to Be Black



Tags:
microsoft research