Understanding Oversmoothing in Graph Neural Networks (GNNs): Insights from Two Theoretical Studies

Channel:

Google TechTalks

Subscribers:

349,000

Published on January 26, 2024 7:48:17 PM ● Video Link: https://www.youtube.com/watch?v=MLiEoJOhXJA

Duration: 59:46

1,024 views

A Google TechTalk, presented by Xinyi Wu, 2024-01-18
A Google Algorithm Seminar. ABSTRACT: Oversmoothing in Graph Neural Networks (GNNs) refers to the phenomenon where increasing network depth leads to homogeneous node representations. Over the last few years, it has remained as one of the central challenges of building more powerful Graph Neural Networks (GNNs). In this talk, I will discuss two recent papers on this phenomenon and provide some new insights.

The first work studies why oversmoothing happens at a relatively shallow depth in GNNs. By carefully analyzing the oversmoothing mechanisms in a stylized formulation, we distinguish between adverse mixing that homogenizes nodes across different classes and beneficial denoising within the same class. We quantify these two effects on random graphs sampled from the Contextual Stochastic Block Model (CSBM) and show that oversmoothing occurs once the mixing effect starts to dominate the denoising effect. We establish that the number of layers required for this transition is O(logN/log(logN)) for sufficiently dense graphs with N nodes. We also extend our analysis to study the effects of Personalized PageRank (PPR), or equivalently, the effects of initial residual connections on oversmoothing, and shed light on when and why they might not be an ideal solution to the problem.

In the second work, we study oversmoothing in attention-based GNNs, such as Graph Attention Networks (GATs) and transformers. Treating attention-based GNNs as dynamical systems, our study demonstrates that the graph attention mechanism cannot prevent oversmoothing and loses expressive power exponentially. From a technical point of view, the proposed framework significantly extends the existing results on oversmoothing, and can account for asymmetric, state-dependent and time-varying aggregation operators and a wide range of common nonlinear activation functions, such as ReLU, LeakyReLU, GELU and SiLU.

The talk is based on the following papers: https://arxiv.org/abs/2212.10701, https://arxiv.org/abs/2305.16102. Joint works with Amir Ajorlou (MIT), Zhengdao Chen (NYU/Google), William Wang (MIT), Zihui Wu (Caltech) and Ali Jadbabaie (MIT).

ABOUT THE SPEAKER: Xinyi Wu is a fourth-year Ph.D. student in the Institute for Data, Systems, and Society (IDSS) at Massachusetts Institute of Technology (MIT), advised by Professor Ali Jadbabaie. She is affiliated with the Laboratory for Information and Decision Systems (LIDS). She is a recipient of the MIT Michael Hammer Fellowship. She is interested in applied graph theory, dynamical systems, networks, and machine learning on graphs. Her work on oversmoothing in GNNs has been awarded as Spotlight paper in NeurIPS 2023.

Other Videos By Google TechTalks

2024-05-20	Oblivious RAM: From Theory to Large-scale Real-world Deployment
2024-05-20	Low Cost High Power Membership Inference Attacks
2024-05-20	Can LLMs Keep a Secret? Testing Privacy Implications of Language Models
2024-04-22	Design is Testability
2024-04-12	Charles Hoskinson \| CEO of Input Output Global \| web3 talks \| Apr 4th 2024 \| MC: Marlon Ruiz
2024-04-08	Limitations of Stochastic Selection with Pairwise Independent Priors
2024-04-02	NASA CARA - Air Traffic Control in Spaaaaaaaace
2024-03-28	How Your Brain Processes Code
2024-03-25	Fixed-point Error Bounds for Mean-payoff Markov Decision Processes
2024-03-19	One Tree to Rule Them All: Polylogarithmic Universal Steiner Trees
2024-01-26	Understanding Oversmoothing in Graph Neural Networks (GNNs): Insights from Two Theoretical Studies
2023-12-05	Socially Responsible Software Development (Teaching Software Design Systematically)
2023-12-04	Understanding and Mitigating Copying in Diffusion Models
2023-12-04	Efficient Training Image Extraction from Diffusion Models Ryan Webs
2023-11-30	High-Dimensional Prediction for Sequential Decision Making
2023-09-01	Representational Strengths and Limitations of Transformers
2023-09-01	Steven Goldfeder \| CEO Offchain Labs / Arbitrum \| web3 talks \| Aug 24 2023 \| MC: Marlon Ruiz
2023-08-29	Differentially Private Sampling from Distributions
2023-07-14	Revisiting Nearest Neighbors from a Sparse Signal Approximation View
2023-07-03	2023 Blockly Developer Summit Day 2-5: Plug-ins Demonstration
2023-07-03	2023 Blockly Developer Summit DAY 1-5: The Future of Computational Thinking

Channel	Latest
Zlabus	6 hours ago
Cevlo	6 hours ago
Alterny Vibe	7 hours ago
Gboogie32	7 hours ago
DiZtaRi	7 hours ago
Cory Campbell	7 hours ago
Purple Kyogre	7 hours ago
Everyday Special	7 hours ago
HGW Trilhas Sonoras	7 hours ago
CohhCarnage	8 hours ago
Gamer _-_ 24	8 hours ago
KwingsLetsPlays	8 hours ago
Rediscover Redstone	8 hours ago
markwerbenjagermanjensen	8 hours ago
Gameplay y Manga	8 hours ago
El Mundo Según Alejo	8 hours ago
TNH Nebula	8 hours ago
4K Gaming	8 hours ago
An Sang Wu	8 hours ago
TrollForce	8 hours ago
gattu	8 hours ago
SARU TV	8 hours ago
Syl3ntVoRtX	8 hours ago
Andre Ferdinan Arianto	8 hours ago
INMORTAL FF	8 hours ago