An online and real-time fault detection and localization mechanism for network-on-chip architectures
Date
2016Author
Chrysanthou, KyprosEnglezakis, Panayiotis
Prodromou, Andreas
Panteli, Andreas

Sazeides, Yiannakis
Dimitrakopoulos, Giorgos N.
ISSN
1544-3566Source
ACM Transactions on Architecture and Code OptimizationVolume
13Issue
2Google Scholar check
Keyword(s):
Metadata
Show full item recordAbstract
Networks-on-Chip (NoC) are becoming increasingly susceptible to emerging reliability threats. The need to detect and localize the occurrence of faults at runtime is steadily becoming imperative. In this work, we propose NoCAlert, a comprehensive online and real-time fault detection and localization mechanism that demonstrates 0% false negatives within the interconnect for the fault models and stimulus set used in this study. Based on the concept of invariance checking, NoCAlert employs a group of lightweight microchecker modules that collectively implement real-time hardware assertions. The checkers operate concurrently with normal NoC operation, thus eliminating the need for periodic, or triggered-based, self-testing. Based on the pattern/signature of asserted checkers, NoCAlert can pinpoint the location of the fault at various granularity levels. Most important, 97% of the transient and 90% of the permanent faults are detected instantaneously, within a single clock cycle upon fault manifestation. The fault localization accuracy ranges from 90% to 100%, depending on the desired localization granularity. Extensive cycle-accurate simulations in a 64-node CMP and analysis at the RTL netlist-level demonstrate the efficacy of the proposed technique. © 2016 ACM.
DOI
10.1145/2930670Collections
Cite as
Related items
Showing items related by title, author, creator and subject.
-
Book Chapter
Fault Diagnosis for Uncertain Networked Systems
Boem, Francesca; Keliris, Christodoulos; Parisini, Thomas; Polycarpou, Marios M. (Springer International Publishing, 2018)Fault diagnosis has been at the forefront of technological developments for several decades. Recent advances in many engineering fields have led to the networked interconnection of various systems. The increased complexity ...
-
Conference Object
Modelling and Assessment of Short-Term Electromagnetic Interference on a Railway System from Pole-to-Ground Faults on HVDC Cable Networks
Charalambous, Charalambos A.; Dimitriou, A.; Gonos, I.; Papadopoulos, T. A. (2019)This paper describes some fundamental modelling principles that are necessary to assess the impact of HVDC pole-to-ground faults on a railway system. Particular emphasis is given on the appropriate modelling of the HVDC ...
-
Article
Fault tolerant maximum likelihood event localization in sensor networks using binary data
Michaelides, M. P.; Panayiotou, Christos G. (2009)