Extracting Knowledge from Networks: Rumors, Superstars, and Communities

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=KXRO5cZA4p8



Duration: 1:05:35
69 views
1


As social networks such as Facebook and Twitter become larger and more complex, it becomes more and more difficult to extract useful information from them efficiently. However, there is a great deal of knowledge encoded in the structure of these networks. In this talk I will focus on techniques for extracting knowledge from networks using their structure in three situations: finding sources of rumors in networks, predicting the emergence of superstars in networks, and learning community structure in networks. In all three problems I will use the notion of network centrality to extract the relevant knowledge. For the rumor source detection problem, I develop a new centrality notion which I call 'rumor centrality' which is an exact maximum likelihood estimator for the rumor source in certain networks. Rumor centrality counts the number of rumor spreading sequences from a fixed rumor source. I show how the network structure affects one's ability to detect rumor sources and prove asymptotic limits for the detection probability on different network topologies. Simulations performed on real network topologies such as the Internet, US powergrid, and Facebook, show that rumor centrality is an effective rumor source estimator. Superstars are observed in empirical data from Twitter networks. These are nodes with degree on the order of the network size. Twitter networks for specific topics are found to have both a power-law degree distribution and a single superstar node. Common network growth models such as preferential attachment fail to predict the emergence of superstars. To remedy this, I propose a new topological network growth model in which new nodes connect to existing nodes with probability proportional to their ΓÇ£importanceΓÇ¥. If node degree is used as the importance, then this model reduces to preferential attachment. However, if rumor centrality is used as the importance, then this model predicts not only the superstar nodes seen in the Twitter data, but also the power-law degree distribution. This model works for Twitter networks on topics ranging from Wimbledon and the World Cup to music award shows, indicating that there may be a common, topic independent mechanism driving the growth of these networks and the emergence of these superstars. Finally, I propose the 'leader-follower' algorithm for learning community structure in networks. The algorithm is based upon the idea that a community should have leaders which connect it to other communities, and followers, which have no neighbors outside of the community. The algorithm uses network centrality in a differential manner to detect leaders and followers, and then assigns followers to leaders to form communities. The algorithm learns the number and size of communities naturally from the network structure and runs in O(|E|) time for a network with edgeset E. Also, the algorithm is guaranteed to find any communities which are cliques and contain at least one loyal follower. Experiments run on Facebook networks and human brain networks show that the algorithm can find relevant communities and resolve a more detailed community structure than other more common methods such as spectral clustering.




Other Videos By Microsoft Research


2016-08-16On Users' Mental Models of Security Controls
2016-08-16Why Don't Software Developers Use their Tools?
2016-08-16The Mathematics of Side-Channel Attacks
2016-08-16PyPy's Approach to Implementing Dynamic Languages Using a Tracing JIT Compiler
2016-08-16Fine-Grained Power Modeling for Smartphones Using System Call Tracing
2016-08-16Reputational Bargaining Under Knowledge of Rationality
2016-08-16We Will be Right With You: Managing Customers Expectations with Vague Promises and Cheap Talk
2016-08-16Information That Matters: Investigating Relevance of Entities in Social Media Networks
2016-08-16Efficient Bayesian Algorithmic Mechanism Design
2016-08-16Extreme Learning Machine: Learning Without Iterative Tuning
2016-08-16Extracting Knowledge from Networks: Rumors, Superstars, and Communities
2016-08-16Electrical Flows and Laplacian Systems: A New Tool for Graph Algorithms
2016-08-16Limit Theorems in Pseudorandomness and Learning Theory
2016-08-16Empirical Software Engineering, Version 2.0
2016-08-16A Master Bijection for Planar Maps, and Its Applications
2016-08-16Password-based Authenticated Key Exchange at the Cost of Diffie-Hellman
2016-08-16Generalized Identity-Based Encryption
2016-08-16Cryptography with Weak, Noisy, Leaky and Tempered Keys
2016-08-16Electrical Flows and Laplacian Systems: A New Tool for Graph Algorithms
2016-08-16Broadcast Based Peer Review in Open Source Software Development Projects
2016-08-16Optimization under Uncertainty: Understanding the Correlation Gap



Tags:
microsoft research