Title | ||
---|---|---|
Efficient Resource Sharing in Concurrent Error Detecting Superscalar Microarchitectures |
Abstract | ||
---|---|---|
Previous proposals for soft-error tolerance have called for redundantly executing a program as two concurrent threads on a superscalar microarchitecture. In a balanced superscalar design, the extra workload from redundant execution induces a severe performance penalty due to increased contention for resources throughout the datapath. This paper identifies and analyzes four key factors that affect the performance of redundant execution, namely 1) issue bandwidth and functional unit contention, 2) issue queue and reorder buffer capacity contention, 3) decode and retirement bandwidth contention, and 4) coupling between redundant threads' dynamic resource requirements. Based on this analysis, we propose the SHREC microarchitecture for asymmetric and staggered redundant execution. This microarchitecture addresses the four factors in an integrated design without requiring prohibitive additional hardware resources. In comparison to conventional single-threaded execution on a state-of-the-art superscalar microarchitecture with comparable cost, SHREC reduces the average performance penalty to within 4% on integer and 15% on floating-point SPEC2K benchmarks by sharing resources more efficiently between the redundant threads. |
Year | DOI | Venue |
---|---|---|
2004 | 10.1109/MICRO.2004.19 | MICRO |
Keywords | Field | DocType |
staggered redundant execution,redundant thread,increased contention,retirement bandwidth contention,efficient resource,functional unit contention,shrec microarchitecture,redundant execution,reorder buffer capacity contention,conventional single-threaded execution,concurrent error detecting superscalar,state-of-the-art superscalar microarchitecture,floating point,resource sharing,bandwidth,decoding,microarchitecture,resource management,soft error,functional unit,hardware | Resource management,Datapath,Yarn,Computer science,Parallel computing,Thread (computing),Real-time computing,Bandwidth (signal processing),Shared resource,Re-order buffer,Embedded system,Microarchitecture | Conference |
ISSN | ISBN | Citations |
1072-4451 | 0-7695-2126-6 | 28 |
PageRank | References | Authors |
1.12 | 13 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Jared C. Smolens | 1 | 109 | 4.52 |
Jangwoo Kim | 2 | 447 | 35.38 |
James C. Hoe | 3 | 2048 | 141.34 |
Babak Falsafi | 4 | 4147 | 242.18 |