User-level failure detection and auto-recovery of parallel programs in HPC systems
Crossref DOI link: https://doi.org/10.1007/s11704-020-0190-y
Published Online: 2021-09-01
Published Print: 2021-12
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Zhang, Guozhen
Liu, Yi
Yang, Hailong
Xu, Jun
Qian, Depei
Text and Data Mining valid from 2021-09-01
Version of Record valid from 2021-09-01
Article History
Received: 10 May 2020
Accepted: 5 November 2020
First Online: 1 September 2021