dc.contributor.author | Filippas, Dionysios | en |
dc.contributor.author | Peltekis, Christodoulos | en |
dc.contributor.author | Dimitrakopoulos, Giorgos | en |
dc.contributor.author | Nicopoulos, Chrysostomos | en |
dc.creator | Filippas, Dionysios | en |
dc.creator | Peltekis, Christodoulos | en |
dc.creator | Dimitrakopoulos, Giorgos | en |
dc.creator | Nicopoulos, Chrysostomos | en |
dc.date.accessioned | 2023-12-19T09:47:26Z | |
dc.date.available | 2023-12-19T09:47:26Z | |
dc.date.issued | 2023-07-07 | |
dc.identifier.isbn | 979-8-3503-3267-4 | |
dc.identifier.issn | 2834-9857 | |
dc.identifier.uri | http://gnosis.library.ucy.ac.cy/handle/7/65817 | en |
dc.description.abstract | The acceleration of deep-learning kernels in hardware relies on matrix multiplications that are executed efficiently on Systolic Arrays (SA). To effectively trade off deep-learning training/inference quality with hardware cost, SA accelerators employ reduced-precision Floating-Point (FP) arithmetic. In this work, we demonstrate the need for new pipeline organizations to reduce latency and improve energy efficiency of reduced-precision FP operators for the chained multiply-add operation imposed by the structure of the SA. The proposed skewed pipeline design reorganizes the pipelined operation of the FP multiplyadd units to enable new forwarding paths for the exponent logic, which allow for parallel execution of the pipeline stages of consecutive PEs. As a result, the latency of the matrix multiplication operation within the SA is significantly reduced with minimal hardware cost, thereby yielding an energy reduction of 8% and 11% for the examined state-of-the-art CNNs. | en |
dc.language.iso | eng | en |
dc.publisher | IEEE | en |
dc.source | IEEE 5th International Conference on Artificial Intelligence Circuits and Systems (AICAS) 2023 | en |
dc.source.uri | https://doi.org/10.1109/AICAS57966.2023.10168556 | en |
dc.source.uri | https://ieeexplore.ieee.org/document/10168556 | en |
dc.title | Reduced-Precision Floating-Point Arithmetic in Systolic Arrays with Skewed Pipelines | en |
dc.type | info:eu-repo/semantics/article | en |
dc.identifier.doi | 10.1109/AICAS57966.2023.10168556 | en |
dc.author.faculty | 007 Πολυτεχνική Σχολή / Faculty of Engineering | |
dc.author.department | Τμήμα Ηλεκτρολόγων Μηχανικών και Μηχανικών Υπολογιστών / Department of Electrical and Computer Engineering | |
dc.type.uhtype | Article | en |
dc.contributor.orcid | Nicopoulos, Chrysostomos [0000-0001-6389-6068] | |
dc.contributor.orcid | Filippas, Dionysios [0000-0002-4729-3336] | |
dc.contributor.orcid | Dimitrakopoulos, Giorgos [0000-0003-3688-7865] | |
dc.type.subtype | CONFERENCE_PROCEEDINGS | en |
dc.gnosis.orcid | 0000-0001-6389-6068 | |
dc.gnosis.orcid | 0000-0002-4729-3336 | |
dc.gnosis.orcid | 0000-0003-3688-7865 | |