Managing Large-scale Probabilistic Databases

Channel:

Subscribers:

351,000

Published on September 6, 2016 6:40:39 PM ● Video Link: https://www.youtube.com/watch?v=53rSLXGeslc

Duration: 1:06:51

231 views

For the next generation of data-management applications, such as sensor-based monitoring, data integration, and information extraction, data processing is the dominant cost. Often, the data driving these applications are uncertain, for example, due to missed, inconsistent, or imprecise sensor readings. Unfortunately, traditional data-management systems provide little or no support for managing uncertainty. To remedy this, my dissertation advocates an approach for data management in which uncertainty is modeled using probabilities. The cost of modeling imprecision using probabilities is that basic data-management tasks, such as querying, become theoretically and practically more difficult. Thus, the key challenge in managing large-scale probabilistic data is efficiency. In this talk, I will discuss the fundamental techniques that I developed in my dissertation to build a probabilistic database capable of handling large, imprecise datasets: these techniques include top-k processing with probabilities, materialized views, approximate lineage, and extensional processing for complex analytic queries. This work resulted in two systems: Mystiq, the first system to support complex queries on gigabytes of probabilistic relational data, and Lahar, the first system to support rich event-style queries on large, probabilistic streams.

Other Videos By Microsoft Research

2016-09-06	Securing the Web Platform
2016-09-06	Volumetric Light Transport for Vision and Graphics
2016-09-06	Automatic Workload Evaluation (AWE): Predicting Web 2.0 Workload Behavior
2016-09-06	Using Wireless Sensor Data to Enable Intelligent Cooling Control in Data Centers - Case Studies
2016-09-06	Play: How it Shapes the Brain, Opens the Imagination, and Invigorates the Soul
2016-09-06	Modular verification of concurrent programs with heap
2016-09-06	Designing Robust Enterprise Wireless Networks
2016-09-06	Enabling Easily Learnable Eyes-free Interaction by Exploiting Human Experience
2016-09-06	A Dirty-Slate Approach to Routing Scalability
2016-09-06	Towards Contextual Text Mining
2016-09-06	Managing Large-scale Probabilistic Databases
2016-09-06	Genus-2 curves with a given number of points
2016-09-06	Building a Safer Web
2016-09-06	Network Coded Wireless Architecture
2016-09-06	Virtex-6 and Spartan-6 Overview
2016-09-06	Making ISP (Dynamic Verification for MPI) Practical
2016-09-06	Mobile Personal Sensing Systems: Applications and Architecture
2016-09-06	Enlightened Trial and Error - Gaining Design Insight Through New Prototyping Tools
2016-09-06	Behind the Code with Eric Horvitz
2016-09-06	Seeding Bugs to Find Bugs - Mutation Testing Revisited
2016-09-06	Relevance Heuristics for Program Analysis

Tags:

microsoft research

Channel	Latest
Simple Gamer	6 hours ago
RedCaio	6 hours ago
A TUTTO CALCIO⚽	6 hours ago
Zaxx Gaming	6 hours ago
LEO DESANDE E ANA CLÁUDIA	6 hours ago
Starzkil1z	6 hours ago
rickX lods official	6 hours ago
WraggyTheGamer	6 hours ago
Böröcz "DeadFox" Bence	6 hours ago
Joey Fernandez	6 hours ago
Drachinifel	6 hours ago
UmmeBlox	6 hours ago
Hutton	6 hours ago
CANAL DO MARCIO 🎮🕹	6 hours ago
なすななし	6 hours ago
COSEF NASTYA	6 hours ago
จุ่มค่ะ มากับนุ่นแล้วก็มากับโบว์	6 hours ago
ADIT DIAMOND	6 hours ago
D R P O O - FF	6 hours ago
Ini Guru Budi	6 hours ago
HaDDGamer YT	6 hours ago
Gamer of Andhra	6 hours ago
WBG LEADER	6 hours ago
ちょぶり【eFootball解説】	6 hours ago
AB Sujeet	6 hours ago