Detailed modeling, design, and evaluation of a scalable multi-level checkpointing system AT Moody, G Bronevetsky, KM Mohror, BR de Supinski Lawrence Livermore National Laboratory (LLNL), Livermore, CA, 2010 | 696* | 2010 |
Design, modeling, and evaluation of a scalable multi-level checkpointing system A Moody, G Bronevetsky, K Mohror, BR De Supinski High Performance Computing, Networking, Storage and Analysis (SC), 2010 …, 2010 | 693 | 2010 |
Design and modeling of a non-blocking checkpointing system K Sato, N Maruyama, K Mohror, A Moody, T Gamblin, BR de Supinski, ... Proceedings of the International Conference on High Performance Computing …, 2012 | 127 | 2012 |
The Spack package manager: bringing order to HPC software chaos T Gamblin, M LeGendre, MR Collette, GL Lee, A Moody, BR de Supinski, ... Proceedings of the International Conference for High Performance Computing …, 2015 | 114 | 2015 |
McrEngine: a scalable checkpointing system using data-aware aggregation and compression TZ Islam, K Mohror, S Bagchi, A Moody, BR De Supinski, R Eigenmann Scientific Programming 21 (3-4), 149-163, 2013 | 114 | 2013 |
The design, deployment, and evaluation of the CORAL pre-exascale systems SS Vazhkudai, BR de Supinski, AS Bland, A Geist, J Sexton, J Kahle, ... Proceedings of the International Conference for High Performance Computing …, 2018 | 81 | 2018 |
Design of a scalable InfiniBand topology service to enable network-topology-aware placement of processes H Subramoni, S Potluri, K Kandalla, B Barth, J Vienne, J Keasler, ... Proceedings of the International Conference on High Performance Computing …, 2012 | 76 | 2012 |
An ephemeral burst-buffer file system for scientific applications T Wang, K Mohror, A Moody, K Sato, W Yu Proceedings of the International Conference for High Performance Computing …, 2016 | 75 | 2016 |
A 1 PB/s file system to checkpoint three million MPI tasks R Rajachandrasekar, A Moody, K Mohror, DK Panda Proceedings of the 22nd international symposium on High-performance parallel …, 2013 | 68 | 2013 |
Scalable NIC-based reduction on large-scale clusters A Moody, J Fernandez, F Petrini, DK Panda Proceedings of the 2003 ACM/IEEE conference on Supercomputing, 59, 2003 | 67 | 2003 |
A user-level infiniband-based file system and checkpoint strategy for burst buffers K Sato, K Mohror, A Moody, T Gamblin, BR De Supinski, N Maruyama, ... Cluster, Cloud and Grid Computing (CCGrid), 2014 14th IEEE/ACM International …, 2014 | 65 | 2014 |
Truenorth ecosystem for brain-inspired computing: scalable systems, software, and applications J Sawada, F Akopyan, AS Cassidy, B Taba, MV Debole, P Datta, ... High Performance Computing, Networking, Storage and Analysis, SC16 …, 2016 | 59 | 2016 |
Hot-spot avoidance with multi-pathing over infiniband: An mpi perspective A Vishnu, M Koop, A Moody, AR Mamidala, S Narravula, DK Panda Cluster Computing and the Grid, 2007. CCGRID 2007. Seventh IEEE …, 2007 | 57 | 2007 |
Fmi: Fault tolerant messaging interface for fast and transparent recovery K Sato, A Moody, K Mohror, T Gamblin, BR de Supinski, N Maruyama, ... Parallel and Distributed Processing Symposium, 2014 IEEE 28th International …, 2014 | 34 | 2014 |
Designing non-blocking allreduce with collective offload on InfiniBand clusters: A case study with conjugate gradient solvers K Kandalla, U Yang, J Keasler, T Kolev, A Moody, H Subramoni, K Tomko, ... Parallel & Distributed Processing Symposium (IPDPS), 2012 IEEE 26th …, 2012 | 33 | 2012 |
Managing I/O interference in a shared burst buffer system S Thapaliya, P Bangalore, J Lofstead, K Mohror, A Moody Parallel Processing (ICPP), 2016 45th International Conference on, 416-425, 2016 | 31 | 2016 |
Detailed modeling and evaluation of a scalable multilevel checkpointing system K Mohror, A Moody, G Bronevetsky, BR de Supinski IEEE Transactions on Parallel and Distributed Systems 25 (9), 2255-2263, 2014 | 30 | 2014 |
Machine Learning Predictions of Runtime and IO Traffic on High-End Clusters R McKenna, S Herbein, A Moody, T Gamblin, M Taufer Cluster Computing (CLUSTER), 2016 IEEE International Conference on, 255-258, 2016 | 26 | 2016 |
Integrated in-system storage architecture for high performance computing D Kimpe, K Mohror, A Moody, B Van Essen, M Gokhale, R Ross, ... Proceedings of the 2nd International Workshop on Runtime and Operating …, 2012 | 26 | 2012 |
Exascale algorithms for generalized MPI_comm_split A Moody, DH Ahn, BR de Supinski European MPI Users' Group Meeting, 9-18, 2011 | 26 | 2011 |