Discrete Event Execution and Reversibility: Challenges in the Path to Asynchrony for Massively Parallel Computing

Abstract

To keep up with the increasing number of processing elements in parallel/distributed computing, traditional tightly-coupled time-stepped models must give way to asynchronous models, such that the coupling among model components across processors is relaxed. Two challenges in defining mathematical models amenable to efficient asynchronous execution are: (1) the ability to define/determine discrete events of changes to component state over time with guaranteed bounds on stability and accuracy, despite the staggering of updates, and (2) the ability to take the model backward in time with minimal memory cost, in order to make corrections to local computations that may occur due to relaxation of global synchrony. We illustrate these considerations in some applications of interest, such as molecular dynamics and fluid dynamics, and allude to some ways in which applied mathematics research could impact asynchronous computing.

[Pub 127]

http://www.siam.org/meetings/jmm12/

Kalyan Perumalla
Kalyan Perumalla

As a Federal Program Manager in Advanced Scientific Computing Research at the U.S. Dept. of Energy, Office of Science, Kalyan Perumalla manages a $100-million R&D portfolio covering AI, HPC, Quantum, SciDAC, and Basic Computer Science. In his 25-year R&D leadership experience, he previously led advanced R&D as Distinguished Research Staff Member at the Oak Ridge National Laboratory (ORNL) developing scalable software and applications on the world’s largest supercomputers for 17 years, including as a line manager and a founding group leader. He has held senior faculty and adjunct appointments at UTK, GT, and UNL, and was an IAS Fellow at Durham University.

Next
Previous

Related