Expressing High Performance Irregular Computations on the GPU

Subscribers:
348,000
Published on ● Video Link: https://www.youtube.com/watch?v=CV3dFkq_vHI



Duration: 56:21
1,151 views
26


A Google TechTalk, presented by Muhammad Osama, 2022/06/07
ABSTRACT: GPUs excel at data analytics problems with ample, regular parallelism. Problems with fine-grained irregular parallelism (where neighboring data elements are assigned different amounts of work), such as those in sparse machine learning and linear algebra, numerical simulation, and graph analytics, are more challenging to map to the GPU. Today's best GPU implementations of irregular-parallel problems employ sophisticated low-level primitives to map irregular amounts of work to the GPU's compute units. Generally, these implementations build application-specific load-balancing techniques that are tightly coupled with application logic. The result is complex code whose load-balance capabilities cannot easily be used in other applications. We describe our implementation of a standalone fine-grained load-balancing framework for GPUs that can address these irregular problems. In our work, we focus on two primary problems: (1) an abstraction that eases programmer complexity by separating the concerns of load balancing from work processing, and (2) interfaces that enable programmers to target load-balanced applications, load-balanced kernel launches, and/or in-kernel load-balancing collectives.

About the Speaker: Muhammad Osama is a Ph.D. candidate advised by Professor John Owens, in the Electrical and Computer Engineering department at the University of California, Davis. Muhammad's current research focuses on General Purpose GPU Computing (GPGPU), specifically GPU load-balancing for dense and sparse workloads. He is also the lead developer of Gunrock, a GPU graph analytics library, and has been a part of DARPA's HIVE (a sparse computation accelerator) and SDH (Software Defined Hardware) projects. Muhammad graduated from the University of Washington, Seattle, with a bachelor's degree in Electrical Engineering with a research focus on real-time graphics.




Other Videos By Google TechTalks


2022-10-25Brandon Tory | CEO & Co-Founder of Formless | web3 talks | Sep 8th 2022 | MC: Raphael Hyde
2022-10-25Raullen Chai | CEO & Co-founder of IoTex | web3 talks | Oct 6th 2022 | Hosted by Raphael Hyde
2022-10-25Peter Schiff | CEO & Chief global strategist of Euro Pacific Cap Inc | web3 talks | Sep 29th 2022
2022-10-25Raoul Pal | CEO of RealVision, GMI, etc. | web3 talks | Sep 29th 2022 | Hosted by Raphael Hyde
2022-10-20Robust Design Discovery and Exploration in Bayesian Optimization
2022-09-20Master Equation for Discrete-Time Stackelberg Mean Field Games
2022-09-12Graph Attention Retrospective
2022-09-09Bayesian Optimization in the Wild: Risk-Averse Decisions and Budget Constraints
2022-07-16Fast Linear Algebra for Distance Matrices
2022-07-12Deep Learning 2.0: How Bayesian Optimization May Power the Next Generation of DL by Frank Hutter
2022-06-13Expressing High Performance Irregular Computations on the GPU
2022-05-24Building Developer Assistants that Think Fast and Slow
2022-05-052022 Blockly Developers Summit: Bad Blocks
2022-05-052022 Blockly Developers Summit: Debugging in Blockly
2022-05-052022 Blockly Developers Summit: Year in Review and Roadmap
2022-05-052022 Blockly Developers Summit: Customizing Blockly
2022-05-052022 Blockly Developers Summit: Blockly at Google - Scratch for CS First
2022-05-052022 Blockly Developers Summit: Serialization
2022-05-052022 Blockly Developers Summit: Block Definitions - Past, Present, and Future
2022-05-052022 Blockly Developers Summit: TypeScript Migration
2022-05-052022 Blockly Developers Summit: Contributing to Blockly