Performance potential of data dependence speculation & collapsing
Date
1996Author
Sazeides, YiannakisVassiliadis, Stamatis
Smith, James E.
Publisher
IEEESource
Proceedings of the Annual International Symposium on MicroarchitectureProceedings of the 1996 29th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO-29
Pages
238-247Google Scholar check
Keyword(s):
Metadata
Show full item recordAbstract
Two hardware methods for remedying the effects of true data dependences are studied. The first method, dependence speculation, is used to eliminate address generation-load dependences. This is enabled by address prediction that permits load instructions to proceed speculatively without waiting for their address operands. The second technique, dependence collapsing, is used to eliminate data dependences by combining a dependence among multiple instructions into one instruction. The potential of these techniques for improving processor performance is demonstrated via trace-driven simulation. When both techniques are used with maximum issue widths of 4, 8, 16, and 32, the overall speedups in comparison to a base instruction level parallel machine are 1.20, 1.35, 1.51, and 1.66, respectively. In general, dependence collapsing contributes the majority of the improvement in performance. Under the dependence collapsing model, 29% to 47% of the total number of instructions in a trace may be collapsed. The distance separating the collapsed instructions is nearly always less than 8. Our experimentation also suggests that further performance improvements can be achieved by incorporating mechanisms that increase the address prediction rate.