Algorithm-based fault recovery of adaptively refined parallel multilevel grids
On future extreme scale computers, it is expected that faults will become an increasingly serious problem as the number of individual components grows and failures become more frequent. This is driving the interest in designing algorithms with built-in fault tolerance that can continue to operate and that can replace data even if part of the computation is lost in a failure. For fault-free computations, the use of adaptive refinement techniques in combination with finite element methods is well...[Show more]
|Collections||ANU Research Publications|
|Source:||International Journal of High Performance Computing Applications|
|02 Stals Algorithm-based fault recovery 2017. pdf||2.5 MB||Adobe PDF||Request a copy|
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.