Detailed modeling, design, and evaluation of a scalable multi-level checkpointing system AT Moody, G Bronevetsky, KM Mohror, BR de Supinski Lawrence Livermore National Laboratory (LLNL), Livermore, CA, 2010 | 841* | 2010 |
Design, modeling, and evaluation of a scalable multi-level checkpointing system A Moody, G Bronevetsky, K Mohror, BR De Supinski High Performance Computing, Networking, Storage and Analysis (SC), 2010 …, 2010 | 837 | 2010 |
The Spack package manager: bringing order to HPC software chaos T Gamblin, M LeGendre, MR Collette, GL Lee, A Moody, BR de Supinski, ... Proceedings of the International Conference for High Performance Computing …, 2015 | 348 | 2015 |
The design, deployment, and evaluation of the CORAL pre-exascale systems SS Vazhkudai, BR de Supinski, AS Bland, A Geist, J Sexton, J Kahle, ... Proceedings of the International Conference for High Performance Computing …, 2018 | 193 | 2018 |
Design and modeling of a non-blocking checkpointing system K Sato, N Maruyama, K Mohror, A Moody, T Gamblin, BR de Supinski, ... Proceedings of the International Conference on High Performance Computing …, 2012 | 150 | 2012 |
An ephemeral burst-buffer file system for scientific applications T Wang, K Mohror, A Moody, K Sato, W Yu Proceedings of the International Conference for High Performance Computing …, 2016 | 147 | 2016 |
McrEngine: a scalable checkpointing system using data-aware aggregation and compression TZ Islam, K Mohror, S Bagchi, A Moody, BR De Supinski, R Eigenmann Scientific Programming 21 (3-4), 149-163, 2013 | 138 | 2013 |
VeloC: Towards High Performance Adaptive Asynchronous Checkpointing at Large Scale B Nicolae, A Moody, E Gonsiorowski, K Mohror, F Cappello | 105 | 2019 |
Truenorth ecosystem for brain-inspired computing: scalable systems, software, and applications J Sawada, F Akopyan, AS Cassidy, B Taba, MV Debole, P Datta, ... High Performance Computing, Networking, Storage and Analysis, SC16 …, 2016 | 103 | 2016 |
Design of a scalable InfiniBand topology service to enable network-topology-aware placement of processes H Subramoni, S Potluri, K Kandalla, B Barth, J Vienne, J Keasler, ... Proceedings of the International Conference on High Performance Computing …, 2012 | 85 | 2012 |
A 1 PB/s file system to checkpoint three million MPI tasks R Rajachandrasekar, A Moody, K Mohror, DK Panda Proceedings of the 22nd international symposium on High-performance parallel …, 2013 | 84 | 2013 |
A 1 PB/s File System to Checkpoint Three Million MPI Tasks A Moody, K Mohror, K Dhabaleswar | 84* | 2013 |
I/O Characterization and Performance Evaluation of BeeGFS for Deep Learning F Chowdhury, Y Zhu, T Heer, S Paredes, A Moody, R Goldstone, ... Proceedings of the 48th International Conference on Parallel Processing, 80, 2019 | 83 | 2019 |
A user-level infiniband-based file system and checkpoint strategy for burst buffers K Sato, K Mohror, A Moody, T Gamblin, BR De Supinski, N Maruyama, ... Cluster, Cloud and Grid Computing (CCGrid), 2014 14th IEEE/ACM International …, 2014 | 83 | 2014 |
Entropy-Aware I/O Pipelining for Large-Scale Deep Learning on HPC Systems Y Zhu, F Chowdhury, H Fu, A Moody, K Mohror, K Sato, W Yu | 82* | |
Scalable NIC-based reduction on large-scale clusters A Moody, J Fernandez, F Petrini, DK Panda Proceedings of the 2003 ACM/IEEE conference on Supercomputing, 59, 2003 | 76 | 2003 |
Machine Learning Predictions of Runtime and IO Traffic on High-End Clusters R McKenna, S Herbein, A Moody, T Gamblin, M Taufer Cluster Computing (CLUSTER), 2016 IEEE International Conference on, 255-258, 2016 | 63 | 2016 |
Hot-spot avoidance with multi-pathing over infiniband: An mpi perspective A Vishnu, M Koop, A Moody, AR Mamidala, S Narravula, DK Panda Cluster Computing and the Grid, 2007. CCGRID 2007. Seventh IEEE …, 2007 | 58 | 2007 |
PRIONN: Predicting Runtime and IO using Neural Networks MR Wyatt II, S Herbein, T Gamblin, A Moody, DH Ahn, M Taufer Proceedings of the 47th International Conference on Parallel Processing, 46, 2018 | 57 | 2018 |
Managing I/O interference in a shared burst buffer system S Thapaliya, P Bangalore, J Lofstead, K Mohror, A Moody Parallel Processing (ICPP), 2016 45th International Conference on, 416-425, 2016 | 50 | 2016 |