dc.contributor.author | Farquhar, William G. | en |
dc.contributor.author | Evripidou, Paraskevas | en |
dc.creator | Farquhar, William G. | en |
dc.creator | Evripidou, Paraskevas | en |
dc.date.accessioned | 2019-11-13T10:40:01Z | |
dc.date.available | 2019-11-13T10:40:01Z | |
dc.date.issued | 1994 | |
dc.identifier.isbn | 0-8186-5602-6 | |
dc.identifier.uri | http://gnosis.library.ucy.ac.cy/handle/7/53917 | |
dc.description.abstract | This paper introduces the mechanisms required to perform fault detection and recovery in the DART multiprocessor architecture. The DART multiprocessors uses prioritized data-driven scheduling to ensure that multiple hard and soft deadlines are met. A data-driven checkpointing scheme has been developed that ensures that these deadlines are met even in the case of processor failures. The basic approach is to monitor the behavior of each computational thread by means of hardware timers. The results of a thread are released only if the thread completes before its given timeout period expires. Otherwise, the partial computation on the processor is discarded and the thread is rescheduled on a different processor. A strategy to statically predict the system performance in the event of multiple processor failures is presented and evaluated. Simulation results are provided to illustrate the fault detection and recovery response times for single processor failures on DART multiprocessor architectures with 2,3,8,16 and 32 processing elements. | en |
dc.publisher | Publ by IEEE | en |
dc.source | Proceedings of the International Conference on Parallel Processing | en |
dc.source | Proceedings of the 8th International Parallel Processing Symposium | en |
dc.source.uri | https://www.scopus.com/inward/record.uri?eid=2-s2.0-0027986011&partnerID=40&md5=63d94c59ba3991fc7e4132f1b766028f | |
dc.subject | Computers | en |
dc.subject | Computer architecture | en |
dc.subject | Multiprocessing systems | en |
dc.subject | Fault tolerant computer systems | en |
dc.subject | Fault detection | en |
dc.subject | Scheduling | en |
dc.subject | Computer hardware | en |
dc.subject | Checkpointing | en |
dc.subject | Data driven scheduling | en |
dc.title | Fault detection and recovery in a data-driven real-time multiprocessor | en |
dc.type | info:eu-repo/semantics/conferenceObject | |
dc.description.startingpage | 769 | |
dc.description.endingpage | 774 | |
dc.author.faculty | 002 Σχολή Θετικών και Εφαρμοσμένων Επιστημών / Faculty of Pure and Applied Sciences | |
dc.author.department | Τμήμα Πληροφορικής / Department of Computer Science | |
dc.type.uhtype | Conference Object | en |
dc.description.notes | <p>Sponsors: IEEE Computer Society | en |
dc.description.notes | ACM SIGARCH | en |
dc.description.notes | Conference code: 20747 | en |
dc.description.notes | Cited By :1</p> | en |
dc.contributor.orcid | Evripidou, Paraskevas [0000-0002-2335-9505] | |
dc.gnosis.orcid | 0000-0002-2335-9505 | |