Akshay Venkatesh
Akshay Venkatesh
NVIDIA; Ohio State University
Verified email at nvidia.com - Homepage
Cited by
Cited by
Efficient inter-node MPI communication using GPUDirect RDMA for InfiniBand clusters with NVIDIA GPUs
S Potluri, K Hamidouche, A Venkatesh, D Bureddy, DK Panda
2013 42nd International Conference on Parallel Processing, 80-89, 2013
Disc drive apparatus
T Hayashi
US Patent 6,963,521, 2005
MVAPICH-PRISM: A proxy-based communication framework using InfiniBand and SCIF for Intel MIC clusters
S Potluri, D Bureddy, K Hamidouche, A Venkatesh, K Kandalla, ...
Proceedings of the International Conference on High Performance Computing …, 2013
Efficient intra-node communication on intel-mic clusters
S Potluri, A Venkatesh, D Bureddy, K Kandalla, DK Panda
2013 13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid …, 2013
Method and system for a virtual local area network to span multiple loop free network topology domains
ST Merchant, BW Bailey
US Patent 7,154,861, 2006
A case for application-oblivious energy-efficient MPI runtime
A Venkatesh, A Vishnu, K Hamidouche, N Tallent, D Panda, D Kerbyson, ...
SC'15: Proceedings of the International Conference for High Performance …, 2015
Efficient large message broadcast using NCCL and CUDA-aware MPI for deep learning
AA Awan, K Hamidouche, A Venkatesh, DK Panda
Proceedings of the 23rd European MPI Users' Group Meeting, 15-22, 2016
Omb-gpu: A micro-benchmark suite for evaluating mpi libraries on gpu clusters
D Bureddy, H Wang, A Venkatesh, S Potluri, DK Panda
European MPI Users' Group Meeting, 110-120, 2012
Designing optimized mpi broadcast and allreduce for many integrated core (mic) infiniband clusters
K Kandalla, A Venkatesh, K Hamidouche, S Potluri, D Bureddy, DK Panda
2013 IEEE 21st Annual Symposium on High-Performance Interconnects, 63-70, 2013
Designing MPI library with dynamic connected transport (DCT) of InfiniBand: early experiences
H Subramoni, K Hamidouche, A Venkatesh, S Chakraborty, DK Panda
International Supercomputing Conference, 278-295, 2014
MPI-based parallel synchronous vector evaluated particle swarm optimization for multi-objective design optimization of composite structures
SN Omkar, A Venkatesh, M Mudigere
Engineering Applications of Artificial Intelligence 25 (8), 1611-1627, 2012
Direct injection diesel engine
N Shimazaki
US Patent 6,840,209, 2005
Power-check: An energy-efficient checkpointing framework for HPC clusters
RR Chandrasekar, A Venkatesh, K Hamidouche, DK Panda
2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid …, 2015
UPC on MIC: Early experiences with native and symmetric modes
M Luo, M Li, A Venkatesh, X Lu, DK Panda
7th International Conference on PGAS Programming Models, 198, 2013
Evaluation of energy characteristics of mpi communication primitives with rapl
A Venkatesh, K Kandalla, DK Panda
2013 IEEE International Symposium on Parallel & Distributed Processing …, 2013
Dual-stage gas generator utilizing eco-friendly gas generant formulation
S Daoud
US Patent 6,877,435, 2005
Cuda kernel based collective reduction operations on large-scale gpu clusters
CH Chu, K Hamidouche, A Venkatesh, AA Awan, DK Panda
2016 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid …, 2016
MIC-Check: A distributed check pointing framework for the Intel many integrated cores architecture
R Rajachandrasekar, S Potluri, A Venkatesh, K Hamidouche, ...
Proceedings of the 23rd international symposium on High-performance parallel …, 2014
A comprehensive performance evaluation of OpenSHMEM libraries on InfiniBand clusters
J Jose, J Zhang, A Venkatesh, S Potluri, DKDK Panda
Workshop on OpenSHMEM and Related Technologies, 14-28, 2014
Exploiting GPUDirect RDMA in designing high performance OpenSHMEM for NVIDIA GPU clusters
K Hamidouche, A Venkatesh, AA Awan, H Subramoni, CH Chu, ...
2015 IEEE International Conference on Cluster Computing, 78-87, 2015
The system can't perform the operation now. Try again later.
Articles 1–20