Streaming Dilated Convolution Engine
Date
2023-01-09ISSN
1557-9999Publisher
IEEESource
IEEE Transactions on Very Large Scale Integration (VLSI) SystemsVolume
31Issue
3Pages
401-405Google Scholar check
Keyword(s):
Metadata
Show full item recordAbstract
Convolution is one of the most critical operations in various application domains and its computation should combine high performance with energy efficiency. This requirement is critical both for standard convolution and for its other spatial variants, such as dilated, strided, or transposed convolutions. In this work, we focus on the design of a streaming convolution engine, called LazyDCstream, that is tuned for dilated convolution. LazyDCstream utilizes a sliding-window architecture for input data reuse and leverages the already-known decomposition of dilated convolution to: (a) maximize window buffer sharing and (b) enable “lazy” data movement that keeps data transfers per clock cycle as few as possible, and, most importantly, independent of the dilation rate. These two architectural features reduce the power consumption relative to efficient streaming convolution engines without introducing any throughput or area penalty.