Show simple item record

dc.contributor.authorTitopoulos, Vasileiosen
dc.contributor.authorAlexandridis, Kosmasen
dc.contributor.authorPeltekis, Christodoulosen
dc.contributor.authorNicopoulos, Chrysostomosen
dc.contributor.authorDimitrakopoulos, Giorgosen
dc.creatorTitopoulos, Vasileiosen
dc.creatorAlexandridis, Kosmasen
dc.creatorPeltekis, Christodoulosen
dc.creatorNicopoulos, Chrysostomosen
dc.creatorDimitrakopoulos, Giorgosen
dc.date.accessioned2023-12-19T16:09:38Z
dc.date.available2023-12-19T16:09:38Z
dc.date.issued2024-03
dc.identifier.urihttp://gnosis.library.ucy.ac.cy/handle/7/65822en
dc.description.abstractStructured sparsity has been proposed as an efficient way to prune the complexity of modern Machine Learning (ML) applications and to simplify the handling of sparse data in hardware. The acceleration of ML models - for both training and inference - relies primarily on equivalent matrix multiplications that can be executed efficiently on vector processors or custom matrix engines. The goal of this work is to incorporate the simplicity of structured sparsity into vector execution, thereby accelerating the corresponding matrix multiplications. Toward this objective, a new vector index-multiply-accumulate instruction is proposed, which enables the implementation of lowcost indirect reads from the vector register file. This reduces unnecessary memory traffic and increases data locality. The proposed new instruction was integrated in a decoupled RISCV vector processor with negligible hardware cost. Extensive evaluation demonstrates significant speedups of 1.80x-2.14x, as compared to state-of-the-art vectorized kernels, when executing layers of varying sparsity from state-of-the-art Convolutional Neural Networks (CNNs).en
dc.language.isoengen
dc.publisherIEEEen
dc.sourceDesign, Automation and Test in Europe Conference [DATE] 2024en
dc.source.urihttps://doi.org/10.48550/arXiv.2311.07241en
dc.source.urihttps://arxiv.org/abs/2311.07241en
dc.titleIndexMAC: A Custom RISC-V Vector Instruction to Accelerate Structured-Sparse Matrix Multiplicationsen
dc.typeinfo:eu-repo/semantics/articleen
dc.identifier.doi10.48550/arXiv.2311.07241
dc.author.faculty007 Πολυτεχνική Σχολή / Faculty of Engineering
dc.author.departmentΤμήμα Ηλεκτρολόγων Μηχανικών και Μηχανικών Υπολογιστών / Department of Electrical and Computer Engineering
dc.type.uhtypeArticleen
dc.contributor.orcidNicopoulos, Chrysostomos [0000-0001-6389-6068]
dc.contributor.orcidDimitrakopoulos, Giorgos [0000-0003-3688-7865]
dc.type.subtypeCONFERENCE_PROCEEDINGSen
dc.gnosis.orcid0000-0001-6389-6068
dc.gnosis.orcid0000-0003-3688-7865


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record