Data Access Optimization in Accelerator-oriented Heterogeneous Architecture through Decoupling and Memory Hierarchy Specialization
tarafından
Ham, Tae Jun, author.
Başlık
:
Data Access Optimization in Accelerator-oriented Heterogeneous Architecture through Decoupling and Memory Hierarchy Specialization
Yazar
:
Ham, Tae Jun, author.
ISBN
:
9780438048522
Yazar Ek Girişi
:
Ham, Tae Jun, author.
Fiziksel Tanımlama
:
1 electronic resource (161 pages)
Genel Not
:
Source: Dissertation Abstracts International, Volume: 79-10(E), Section: B.
Advisors: Margaret R. Martonosi; Juan L. Aragon Committee members: David I. August; Sharad Malik; David Wentzlaff.
Özet
:
For the past fifty years, Moore's Law and Dennard Scaling have been playing important roles in both performance and energy efficiency of computer systems. Unfortunately, they are not likely to continue, and computers no longer benefit from technology scaling as much as they did in the past. Recently, specialized hardware accelerators have emerged as a promising alternative to general-purpose computing for their potential to achieve orders of magnitude speedup and energy efficiency improvements on compute-intensive applications. However, achieving the full potential of accelerators on data-intensive applications remains a challenge since the bottlenecks of such applications do not lie on computation, but data movement. It is particularly problematic because data accesses have become large parts of today's important workloads used for data analytics and scientific computing.
To address this limitation, this thesis presents hardware and software techniques which can be utilized to design a system that can effectively accelerate data-intensive workloads. Specifically, this thesis addresses the two most important aspects in accelerating such workloads ---hiding memory latency and reducing memory bandwidth consumption. First, this thesis attacks the memory latency challenge in accelerator-oriented systems by proposing the Decoupled Supply-Compute (DeSC) framework which provides latency tolerance to accelerators without programmer effort. DeSC utilizes hardware specialization and compiler support to enable a specialized core to work as a high-performance decoupled data supplier, which supplies data to accelerators ahead-of-time to avoid exposing memory latency to them. Second, this thesis presents a way to attack the memory bandwidth challenge for accelerators through the use of customized memory hierarchy and data access optimizations. Specifically, this thesis focuses on graph analytics and presents Graphicionado, a specialized accelerator which effectively accelerates memory bandwidth-bound graph analytics and demonstrates that even such applications can benefit from customized hardware designs.
In summary, this thesis investigates the memory wall challenge in the era of specialization and presents data access optimizations which enables data-intensive workloads to benefit from specialized, heterogeneous systems without being limited by data accesses. With a trend of exponentially increasing demand for data-intensive computing, the techniques presented in this thesis will work as useful tools for acceleration of such important workloads.
Notlar
:
School code: 0181
Konu Başlığı
:
Computer engineering.
Computer science.
Electrical engineering.
Tüzel Kişi Ek Girişi
:
Princeton University. Electrical Engineering.
Elektronik Erişim
:
| Yer Numarası | Demirbaş Numarası | Shelf Location | Shelf Location | Holding Information |
|---|
| XX(681789.1) | 681789-1001 | Proquest E-Tez Koleksiyonu | Proquest E-Tez Koleksiyonu | |