Defining and Enforcing Privacy in Data Publishing

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=fwP7UfDsExQ



Duration: 1:04:46
1,941 views
27


Many organizations, like the Census, hospitals, and search engine companies, wish to altruistically publish unaggregated data about individuals in order to support research. Such data usually contains personal information about the individuals. The challenge is to anonymize this data such that the sensitive information about individuals is not disclosed, while useful aggregate information is preserved. However, such altruistic data releases can lead to egregious leaks of personal information, like in the case of the well publicized AOL data release fiasco in August 2006. In the first part of this talk, I will motivate the need for formally defining privacy by showing attacks on a very popular anonymization technique called k-Anonymity. I will then present my work on L-Diversity, a formal definition of privacy, that provably limits privacy breaches against bounded adversaries. In the second part of my talk, I will present some of the challenges I faced in applying formal privacy definitions to a real Census data publishing application, called OnTheMap. I will also describe the techniques I developed to combat data sparsity and to ensure that useful information was published by OnTheMap. I will conclude by briefly describing a potential application of my work in the development of a privacy-aware platform that may allow web applications to exploit personal data (search & browsing histories, social networks, tags, etc.) to enhance the users' web experience, while provably guaranteeing their privacy.




Other Videos By Microsoft Research


2016-09-06Dependable and Sustainable Cyber-Physical Computing - An Overview of IMPACT Lab's Research
2016-09-06General Theorem Proving for Satisfiability Modulo Theories: An Overview
2016-09-06Real-Time Concurrent Garbage Collection
2016-09-06Virtual Earth Summit - Session 2
2016-09-06Concept Lexicon Construction and Affective Analysis: From Photos to MTV
2016-09-06Cloud Computing for e-Science
2016-09-06Thread-saft dynamic binary translation using transactional memory
2016-09-06Scheduling for multi-carrier wireless systems
2016-09-06Reachability Under Uncertainty & Bayesian Inverse Reinforcement Learning
2016-09-06Exploring large social networks with matrix-based representations
2016-09-06Defining and Enforcing Privacy in Data Publishing
2016-09-06Zero Overhead Verification of Software Programs & On Range Search in Distributed Sensor Networks
2016-09-06XNA Game Studio Workshop - Session One
2016-09-06Reconfigurable Computing: Architectural and Design Tool Challenges
2016-09-06Knowledge sharing and awareness in collaborative computing: Experimental research methods
2016-09-06Multimodal Processing of Human Behavior in Intelligent Instrumented Spaces
2016-09-06Energy Based Models: From Relational Regression to Similarity Metric Learning
2016-09-06Multi-layer architectures for secure communication: information theoretic perspectives
2016-09-06Virtual Earth Summit - Session 4
2016-09-06The Drunkard's Walk: How Randomness Rules our Lives
2016-09-06Hiding global invariants by local reasoning in region logic



Tags:
microsoft research