dMPI: Facilitating Debugging of MPI Programs via Deterministic Message Passing - Network and Parallel Computing
Conference Papers Year : 2012

dMPI: Facilitating Debugging of MPI Programs via Deterministic Message Passing

Abstract

This paper presents a novel deterministic MPI implementation (dMPI) to facilitate the debugging of MPI programs. Distinct from existing approaches, dMPI ensures inherent determinism without using any external support (e.g., logs), which achieves convenience and performance simultaneously. The basic idea of dMPI is to use deterministic logical time to solve message races and control asynchronous transmissions, thus we could eliminate the nondeterministic behaviors of the existing message passing mechanism. To avoid deadlocks introduced by dMPI, we also integrate dMPI with a lightweight deadlock checker to dynamically detect and solve these deadlocks. We have implemented dMPI and evaluated it using NPB benchmarks. The results show that dMPI could guarantee determinism with incurring modest overhead (8% on average).
Fichier principal
Vignette du fichier
978-3-642-35606-3_20_Chapter.pdf (176.1 Ko) Télécharger le fichier
Origin Files produced by the author(s)
Loading...

Dates and versions

hal-01551348 , version 1 (30-06-2017)

Licence

Identifiers

Cite

Xu Zhou, Kai Lu, Xicheng Lu, Xiaoping Wang, Baohua Fan. dMPI: Facilitating Debugging of MPI Programs via Deterministic Message Passing. 9th International Conference on Network and Parallel Computing (NPC), Sep 2012, Gwangju, South Korea. pp.172-179, ⟨10.1007/978-3-642-35606-3_20⟩. ⟨hal-01551348⟩
81 View
81 Download

Altmetric

Share

More