Increasing Concurrency using EDGE Architectures

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=qNUER1eUQyI



Duration: 57:56
156 views
4


For the past 15 years microprocessor performance has largely been driven by improvements in clock frequency, which has doubled every two years. However, due to microarchitectural pipelining limits and device limits, this frequency growth will significantly slow down, and in-fact has already slowed down since 2004. Hence, future processors must exploit concurrency to provide performance improvements. In this talk, I will describe novel architecture and microarchitecture mechanisms to exploit concurrency developed as part of the TRIPS project at UT-Austin. First, I describe EDGE ISAs - a new class of ISAs (Instruction Set Architecture) that efficiently express concurrency to the hardware through dataflow graphs. Second, I describe the TRIPS processor which employs a distributed microarchitecture to implement an EDGE ISA. I will explain the key design principles and insights in both the ISA and the microarchitecture. I will focus on three main aspects: (1) amortizing the overheads of bookkeeping for each instruction, (2) expressing dependences efficiently to the hardware, and (3) eliminating centralized resources. Using this approach, we support both programs with irregular concurrency and regular concurrency. We use predication and other compiler heuristics to build large blocks of instructions and express the dependences within this block explicitly in the ISA. To support irregular concurrency, we use control speculation in the hardware and exploit locality to mine concurrency using the hardware. For regular concurrency, where the compiler can identify the parallelism, we studied the fundamental properties of these programs. We discovered many similarities and some surprising differences in the memory behavior and control flow of these programs. Based on this study, we develop a small set of spanning microarchitecture mechanisms to more efficiently execute these programs. In this talk, I will also briefly describe the implementation of the prototype TRIPS chip we have built at UT-Austin.




Other Videos By Microsoft Research


2016-09-06Adaptation=Vulnerability under RoQ Attacks [1/2]
2016-09-06End-User Control in the Smart Home
2016-09-06Exiting the cleanroom: on ecological validity and ubiquitous computing
2016-09-06Memory Model = Instruction Reordering + Store Atomicity
2016-09-06From Wayback Machine to WebLab: New Opportunities for Social Research
2016-09-06Remarks by Senator Obama
2016-09-06Automating the Construction of Compiler Heuristics using Machine Learning
2016-09-06Capacity and Fairness Issues in Enterprise-class Wireless Mesh Networks
2016-09-06Towards Accurate Internet Distance Prediction
2016-09-06Guanxi (The Art of Relationships) : Microsoft, China, and Bill Gates's Plan to Win the Road Ahead
2016-09-06Increasing Concurrency using EDGE Architectures
2016-09-06Decision Procedures for Recursive Data Structures with Integer Arithmetic
2016-09-06Supporting Construction, Analysis, and Understanding of Software Models.
2016-09-06Program Verification via Three-Valued Logic Analysis
2016-09-06Efficient Data Dissemination in Bandwidth-Asymmetric P2P Networks
2016-09-06Tractable Learning of Structured Prediction Models
2016-09-06Future Hype: The Myths of Technology Change
2016-09-06Improving Packet Delivery Efficiency Using Multi-Radio Diversity in Wireless LANs
2016-09-06Algorithmic Foundations of P2P and Wireless Networks
2016-09-06Semi-unsupervised learning of taxonomic and non-taxonomic relationships from the web
2016-09-06The Weather Makers: How Man is Changing the Climate and What it Means for Life on Earth



Tags:
microsoft research