Detection of Silent Data Corruptions in Smoothed Particle Hydrodynamics Simulations

Cavelan, Aurélien and Ciorba, Florina M. and Cabezón, Ruben M.. (2018) Detection of Silent Data Corruptions in Smoothed Particle Hydrodynamics Simulations.

Full text not available from this repository.

Official URL: https://edoc.unibas.ch/68677/

Downloads: Statistics Overview


Soft errors, such as silent data corruptions (SDCs) hinder the correctness of large-scale scientific applications. Ghost replication (GR) is proposed herein as the first SDCs detector relying on the fast error propagation inherent to applications that employ the smooth particle hydrodynamics (SPH) method. GR follows a two-steps selective replication scheme. First, an algorithm selects which particles to replicate on a different process. Then, a different algorithm detects SDCs by comparing the data of the selected particles with the data of their ghost. The overhead and scalability of the proposed approach are assessed through a set of strong-scaling experiments conducted on a large HPC system under error-free conditions, using upwards of 3, 000 cores. The results show that GR achieves a recall and precision similar to that of full replication methods, at only a fraction of the cost, with detection rates of 91−99.9%, no false-positives, and an overhead of 1−10%.
Faculties and Departments:05 Faculty of Science > Departement Mathematik und Informatik > Informatik > High Performance Computing (Ciorba)
UniBasel Contributors:Cavelan, Aurélien and Ciorba, Florina M. and Cabezon, Ruben M.
Item Type:Other
Note:Publication type according to Uni Basel Research Database: Other publications
Related URLs:
Last Modified:28 Oct 2020 14:11
Deposited On:28 Oct 2020 14:11

Repository Staff Only: item control page