Extracting Knowledge from Networks: Rumors, Superstars, and Communities

Channel:

Subscribers:

351,000

Published on August 17, 2016 12:37:44 AM ● Video Link: https://www.youtube.com/watch?v=KXRO5cZA4p8

Duration: 1:05:35

69 views

As social networks such as Facebook and Twitter become larger and more complex, it becomes more and more difficult to extract useful information from them efficiently. However, there is a great deal of knowledge encoded in the structure of these networks. In this talk I will focus on techniques for extracting knowledge from networks using their structure in three situations: finding sources of rumors in networks, predicting the emergence of superstars in networks, and learning community structure in networks. In all three problems I will use the notion of network centrality to extract the relevant knowledge. For the rumor source detection problem, I develop a new centrality notion which I call 'rumor centrality' which is an exact maximum likelihood estimator for the rumor source in certain networks. Rumor centrality counts the number of rumor spreading sequences from a fixed rumor source. I show how the network structure affects one's ability to detect rumor sources and prove asymptotic limits for the detection probability on different network topologies. Simulations performed on real network topologies such as the Internet, US powergrid, and Facebook, show that rumor centrality is an effective rumor source estimator. Superstars are observed in empirical data from Twitter networks. These are nodes with degree on the order of the network size. Twitter networks for specific topics are found to have both a power-law degree distribution and a single superstar node. Common network growth models such as preferential attachment fail to predict the emergence of superstars. To remedy this, I propose a new topological network growth model in which new nodes connect to existing nodes with probability proportional to their ΓÇ£importanceΓÇ¥. If node degree is used as the importance, then this model reduces to preferential attachment. However, if rumor centrality is used as the importance, then this model predicts not only the superstar nodes seen in the Twitter data, but also the power-law degree distribution. This model works for Twitter networks on topics ranging from Wimbledon and the World Cup to music award shows, indicating that there may be a common, topic independent mechanism driving the growth of these networks and the emergence of these superstars. Finally, I propose the 'leader-follower' algorithm for learning community structure in networks. The algorithm is based upon the idea that a community should have leaders which connect it to other communities, and followers, which have no neighbors outside of the community. The algorithm uses network centrality in a differential manner to detect leaders and followers, and then assigns followers to leaders to form communities. The algorithm learns the number and size of communities naturally from the network structure and runs in O(|E|) time for a network with edgeset E. Also, the algorithm is guaranteed to find any communities which are cliques and contain at least one loyal follower. Experiments run on Facebook networks and human brain networks show that the algorithm can find relevant communities and resolve a more detailed community structure than other more common methods such as spectral clustering.

Other Videos By Microsoft Research

2016-08-16	On Users' Mental Models of Security Controls
2016-08-16	Why Don't Software Developers Use their Tools?
2016-08-16	The Mathematics of Side-Channel Attacks
2016-08-16	PyPy's Approach to Implementing Dynamic Languages Using a Tracing JIT Compiler
2016-08-16	Fine-Grained Power Modeling for Smartphones Using System Call Tracing
2016-08-16	Reputational Bargaining Under Knowledge of Rationality
2016-08-16	We Will be Right With You: Managing Customers Expectations with Vague Promises and Cheap Talk
2016-08-16	Information That Matters: Investigating Relevance of Entities in Social Media Networks
2016-08-16	Efficient Bayesian Algorithmic Mechanism Design
2016-08-16	Extreme Learning Machine: Learning Without Iterative Tuning
2016-08-16	Extracting Knowledge from Networks: Rumors, Superstars, and Communities
2016-08-16	Electrical Flows and Laplacian Systems: A New Tool for Graph Algorithms
2016-08-16	Limit Theorems in Pseudorandomness and Learning Theory
2016-08-16	Empirical Software Engineering, Version 2.0
2016-08-16	A Master Bijection for Planar Maps, and Its Applications
2016-08-16	Password-based Authenticated Key Exchange at the Cost of Diffie-Hellman
2016-08-16	Generalized Identity-Based Encryption
2016-08-16	Cryptography with Weak, Noisy, Leaky and Tempered Keys
2016-08-16	Electrical Flows and Laplacian Systems: A New Tool for Graph Algorithms
2016-08-16	Broadcast Based Peer Review in Open Source Software Development Projects
2016-08-16	Optimization under Uncertainty: Understanding the Correlation Gap

Tags:

microsoft research

Channel	Latest
Ahmad Faqih Productions	6 hours ago
Dominador Soltes	6 hours ago
asepfahadsultana	6 hours ago
OST Archive	6 hours ago
RESULT CHANNEL	6 hours ago
Tiandee	6 hours ago
Amani Zahira	6 hours ago
justF	6 hours ago
Chavez Channel	6 hours ago
The Gaming Door	6 hours ago
Still One	6 hours ago
Rukesan	6 hours ago
Villain Ki Haveli	6 hours ago
Cyber Crumbs	6 hours ago
grim's Lab tv	6 hours ago
Ggyu tube	6 hours ago
Power_diplomacy	6 hours ago
Zeemira Putri	6 hours ago
Ranaji Gaming	6 hours ago
MatrixAndrAIa	6 hours ago
FBN BOOM	6 hours ago
Shey Joki	6 hours ago
DA REAL FRANCA ADVENTURES	6 hours ago
John Christian Mateo	6 hours ago
Immortal Suyou	6 hours ago