Show simple item record

dc.contributor.authorLouca, Soulla P.en
dc.contributor.authorNeophytou, Neophytosen
dc.contributor.authorLachanas, Adrianosen
dc.contributor.authorEvripidou, Paraskevasen
dc.creatorLouca, Soulla P.en
dc.creatorNeophytou, Neophytosen
dc.creatorLachanas, Adrianosen
dc.creatorEvripidou, Paraskevasen
dc.date.accessioned2019-11-13T10:41:07Z
dc.date.available2019-11-13T10:41:07Z
dc.date.issued2000
dc.identifier.urihttp://gnosis.library.ucy.ac.cy/handle/7/54458
dc.description.abstractIn this paper, we propose the design and development of a fault tolerant and recovery scheme for the Message Passing Interface (MPI). The proposed scheme consists of a detection mechanism for detecting process failures, and a recovery mechanism. Two different cases are considered, both assuming the existence of a monitoring process, the Observer which triggers the recovery procedure in case of failure. In the first case, each process keeps a buffer with its own message traffic to be used in case of failure, while the implementor uses periodical tests for notification of failure by the Observer. The recovery function simulates all the communication of the processes with the dead one by re-sending to the replacement process all the messages destined for the dead one. In the second case, the Observer receives and stores all message traffic, and sends to the replacement all the buffered messages destined for the dead process. Solutions are provided to the dead communicator problem caused by the death of a process. A description of the prototype developed is provided along with the results of the experiments performed for efficiency and performance.en
dc.sourceParallel Processing Lettersen
dc.source.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-0034439137&partnerID=40&md5=02b67a4315e0c52f1bc7d7f217ff1670
dc.subjectComputer simulationen
dc.subjectMonitoringen
dc.subjectDistributed computer systemsen
dc.subjectFault tolerant computer systemsen
dc.subjectData communication systemsen
dc.subjectComputer system recoveryen
dc.subjectFault toleranceen
dc.subjectInterfaces (computer)en
dc.subjectTelecommunication trafficen
dc.subjectMPIen
dc.subjectMessage passing interfaceen
dc.subjectMessage trafficen
dc.titleMPI-FT: Portable fault tolerance scheme for MPIen
dc.typeinfo:eu-repo/semantics/article
dc.description.volume10
dc.description.issue4
dc.description.startingpage371
dc.description.endingpage382
dc.author.faculty002 Σχολή Θετικών και Εφαρμοσμένων Επιστημών / Faculty of Pure and Applied Sciences
dc.author.departmentΤμήμα Πληροφορικής / Department of Computer Science
dc.type.uhtypeArticleen
dc.description.notes<p>Cited By :44</p>en
dc.source.abbreviationParallel Process Letten
dc.contributor.orcidEvripidou, Paraskevas [0000-0002-2335-9505]
dc.gnosis.orcid0000-0002-2335-9505


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record