Performance characterization of dnn training using tensorflow and pytorch on modern clusters A Jain, AA Awan, Q Anthony, H Subramoni, DKDK Panda 2019 IEEE International Conference on Cluster Computing (CLUSTER), 1-11, 2019 | 8 | 2019 |
Hypar-flow: Exploiting mpi and keras for scalable hybrid-parallel dnn training using tensorflow AA Awan, A Jain, Q Anthony, H Subramoni, DK Panda arXiv preprint arXiv:1911.05146, 2019 | 5 | 2019 |
GEMS: GPU-Enabled Memory-Aware Model-Parallelism System for Distributed DNN Training A Jain, A Awan, A Aljuhani, J Hashmi, Q Anthony, H Subramoni, D Panda, ... 2020 SC20: International Conference for High Performance Computing …, 2020 | | 2020 |
HyPar-Flow: Exploiting MPI and Keras for Scalable Hybrid-Parallel DNN Training with TensorFlow AA Awan, A Jain, Q Anthony, H Subramoni, DK Panda International Conference on High Performance Computing, 83-103, 2020 | | 2020 |
Efficient Training of Semantic Image Segmentation on Summit using Horovod and MVAPICH2-GDR Q Anthony, AA Awan, A Jain, H Subramoni, DKDK Panda 2020 IEEE International Parallel and Distributed Processing Symposium …, 2020 | | 2020 |
Accelerating GPU-based Machine Learning in Python using MPI Library: A Case Study with MVAPICH2-GDR SM Ghazimirsaeed, Q Anthony, A Shafi, H Subramoni, DKDK Panda | | |