John Pennycook
제목
인용
인용
연도
Exploring SIMD for Molecular Dynamics, Using Intel Xeon Processors and Intel Xeon Phi Coprocessors
SJ Pennycook, CJ Hughes, M Smelyanskiy, SA Jarvis
IEEE International Parallel & Distributed Processing Symposium, 2013
1852013
Performance analysis of a hybrid MPI/CUDA implementation of the NASLU benchmark
SJ Pennycook, SD Hammond, SA Jarvis, GR Mudalige
ACM SIGMETRICS Performance Evaluation Review 38 (4), 23-29, 2011
832011
An investigation of the performance portability of OpenCL
SJ Pennycook, SD Hammond, SA Wright, JA Herdman, I Miller, SA Jarvis
Journal of Parallel and Distributed Computing 73 (11), 1439-1450, 2013
742013
CosmoFlow: Using deep learning to learn the universe at scale
A Mathuriya, D Bard, P Mendygral, L Meadows, J Arnemann, L Shao, ...
SC18: International Conference for High Performance Computing, Networking …, 2018
512018
A metric for performance portability
SJ Pennycook, JD Sewall, VW Lee
arXiv preprint arXiv:1611.07409, 2016
402016
Implications of a metric for performance portability
SJ Pennycook, JD Sewall, VW Lee
Future Generation Computer Systems 92, 947-958, 2019
362019
Parallel file system analysis through application I/O tracing
SA Wright, SD Hammond, SJ Pennycook, RF Bird, JA Herdman, I Miller, ...
The Computer Journal 56 (2), 141-155, 2013
342013
On the acceleration of wavefront applications using distributed many-core architectures
SJ Pennycook, SD Hammond, GR Mudalige, SA Wright, SA Jarvis
The Computer Journal 55 (2), 138-153, 2012
292012
Developing performance-portable molecular dynamics kernels in OpenCL
SJ Pennycook, SA Jarvis
2012 SC Companion: High Performance Computing, Networking Storage and …, 2012
222012
Effective performance portability
SL Harrell, J Kitson, R Bird, SJ Pennycook, J Sewall, D Jacobsen, ...
2018 IEEE/ACM International Workshop on Performance, Portability and …, 2018
132018
Ldplfs: Improving i/o performance without application modification
SA Wright, SD Hammond, SJ Pennycook, I Miller, JA Herdman, SA Jarvis
2012 IEEE 26th International Parallel and Distributed Processing Symposium …, 2012
92012
Light-weight parallel I/O analysis at scale
SA Wright, SD Hammond, SJ Pennycook, SA Jarvis
European Performance Engineering Workshop, 235-249, 2011
72011
Evaluating the impact of proposed openmp 5.0 features on performance, portability and productivity
SJ Pennycook, JD Sewall, JR Hammond
2018 IEEE/ACM International Workshop on Performance, Portability and …, 2018
62018
WMTrace-A Lightweight Memory Allocation Tracker and Analysis Framework
OFJ Perks, SD Hammond, SJ Pennycook, SA Jarvis
62011
Unveiling the Early Universe: Optimizing Cosmology Workloads for Intel Xeon Phi Coprocessors in an SGI UV2000 System
J Briggs, SJ Pennycook, EPS Shellard, C Martins, M Woodacre, K Feind
Tech. Rep.(SGI/Intel White Paper, 2014), 2014
52014
Towards a portable and future-proof particle-in-cell plasma physics code
RF Bird, SJ Pennycook, SA Wright, SA Jarvis
52013
A modern memory management system for OpenMP
JD Sewall, SJ Pennycook, A Duran, X Tian, R Narayanaswamy
2016 Third Workshop on Accelerator Programming Using Directives (WACCPD), 25-35, 2016
42016
Separable projection integrals for higher-order correlators of the cosmic microwave sky: Acceleration by factors exceeding 100
JP Briggs, SJ Pennycook, JR Fergusson, J Jäykkä, EPS Shellard
Journal of Computational Physics 310, 285-300, 2016
42016
Model-led optimisation of a geometric multigrid application
R Bunt, S Pennycook, S Jarvis, L Lapworth, Y Ho
2013 IEEE 10th International Conference on High Performance Computing and …, 2013
42013
WMTools-assessing parallel application memory utilisation at scale
O Perks, SD Hammond, SJ Pennycook, SA Jarvis
European Performance Engineering Workshop, 148-162, 2011
42011
현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.
학술자료 1–20