Performance potential of data dependence speculation & collapsing

Sazeides, Yiannakis; Vassiliadis, Stamatis; Smith, James E.

Conference Object

Date

1996

Author

Sazeides, Yiannakis
Vassiliadis, Stamatis
Smith, James E.

Publisher

IEEE

Source

Proceedings of the Annual International Symposium on Microarchitecture
Proceedings of the 1996 29th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO-29

Pages

238-247

Google Scholar check

Keyword(s):

Data dependence collapsing

Data dependence speculation

Metadata

Show full item record

Abstract

Two hardware methods for remedying the effects of true data dependences are studied. The first method, dependence speculation, is used to eliminate address generation-load dependences. This is enabled by address prediction that permits load instructions to proceed speculatively without waiting for their address operands. The second technique, dependence collapsing, is used to eliminate data dependences by combining a dependence among multiple instructions into one instruction. The potential of these techniques for improving processor performance is demonstrated via trace-driven simulation. When both techniques are used with maximum issue widths of 4, 8, 16, and 32, the overall speedups in comparison to a base instruction level parallel machine are 1.20, 1.35, 1.51, and 1.66, respectively. In general, dependence collapsing contributes the majority of the improvement in performance. Under the dependence collapsing model, 29% to 47% of the total number of instructions in a trace may be collapsed. The distance separating the collapsed instructions is nearly always less than 8. Our experimentation also suggests that further performance improvements can be achieved by incorporating mechanisms that increase the address prediction rate.

Links

https://www.scopus.com/inward/record.uri?eid=2-s2.0-0030409867&partnerID=40&md5=2d01bdeda9323bba1dba4339eae4738c