Fast Database and Data Streaming Operations using Graphics Processors

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=AcPLFRyyVhY



Duration: 55:59
854 views
2


We present novel techniques to utilize the high computational power of graphics processing units (GPUs) to significantly accelerate many of the traditional general purpose algorithms on CPUs. As graphics processors are primarily designed to perform fast display of geometric primitives, we abstract many of the essential database and data mining algorithms using basic graphics operations. Our algorithms use efficient data representations and utilize the inherent parallelism in the single instruction multiple data (SIMD) units and the vector processing functionalities of the GPUs to efficiently evaluate the boolean combinations of predicates, aggregates, and join queries. Graphics processors are optimized for processing data streams. We present deterministic algorithms to efficiently estimate quantiles and frequencies in large data streams. We utilize the high computational power and the memory bandwidth on a GPU to perform sorting on a GPU. The sorting algorithm is used as a main computational component for the construction of epsilon-approximate quantile and frequency summaries. We have applied our algorithm to data streams consisting of more than 100 million elements on a 3.4GHz PC with a NVIDIA 6800 Ultra GPU and achieved 2-4 times performance improvement over optimized CPU-based algorithms. Our recent research focuses on using GPUs for sorting very large databases composed of hundreds of gigabytes of data using low-end commodity PCs. Experimental studies on the SortBenchmark indicate that external sorting is highly memory-intensive. As the GPUs internally have a dedicated memory interface, we present an efficient hybrid sorting algorithm to perform the computation on both the GPU and CPU, in parallel. Experimental results on a low-end PC with a NVIDIA 7800 GTX graphics co-processor indicate higher performance than optimized CPU-based algorithms on a high-end PC with 3.6 GHz Dual Xeon processors.




Other Videos By Microsoft Research


2016-09-06Secure Trusted Overlay Networks for Robust Privacy-Protecting Communication
2016-09-06Dialogue Session: Worklife Balance and the Retention of Talent
2016-09-06From textons to parts: Local image features for texture and object recognition
2016-09-06Efficient Actions in Dynamic Auction Environment
2016-09-06Two Network Coding Talks for the price of one: Security, Low Complexity
2016-09-06Some recent results in camera calibration and shape reconstruction
2016-09-06Implicit Feedback: Techniques for Deployment and Evaluation
2016-09-06Better k-best Parsing, Hypergraphs, and Dynamic Programming
2016-09-06Rock 'n Roll : Earthquake & Disaster Preparedness
2016-09-06Understanding Customers: Shaping Our Future through Understanding Social Change
2016-09-06Fast Database and Data Streaming Operations using Graphics Processors
2016-09-06Hyperparameter and Kernel Learning for Graph Based Semi-Supervised Classification
2016-09-06Multi-Engine Machine Translation Guided by Explicit Word Matching
2016-09-06Using Compression Models to Filter Spam; Exploiting Structural Information for Categorization
2016-09-06The Man Who Knew Too Much: Alan Turing and the Invention of the Computer [1/4]
2016-09-06Estimation of intrinsic dimensionality using high-rate vector quantization
2016-09-06Abducted: How People Come to Believe They Were Kidnapped by Aliens [1/11]
2016-09-06Spontaneous Speech: Challenges and Opportunities for Parsing
2016-09-06Some Recent Advances in Gaussian Mixture Modeling for Speech Recognition
2016-09-06How to Survive a Robot Uprising: Tips to Defend Yourself Against The Coming Rebellion
2016-09-06Body for Life for Women



Tags:
microsoft research