팔로우
Tze Meng Low
제목
인용
인용
연도
Analytical modeling is enough for high-performance BLIS
TM Low, FD Igual, TM Smith, ES Quintana-Orti
ACM Transactions on Mathematical Software (TOMS) 43 (2), 1-18, 2016
1362016
The BLIS framework: Experiments in portability
FG Van Zee, TM Smith, B Marker, TM Low, RAVD Geijn, FD Igual, ...
ACM Transactions on Mathematical Software (TOMS) 42 (2), 1-19, 2016
1062016
3D-stacked memory-side acceleration: Accelerator and system design
Q Guo, N Alachiotis, B Akin, F Sadi, G Xu, TM Low, L Pileggi, JC Hoe, ...
WoNDP, 2014
1042014
A unified coded deep neural network training strategy based on generalized polydot codes
S Dutta, Z Bai, H Jeong, TM Low, P Grover
2018 IEEE International Symposium on Information Theory (ISIT), 1585-1589, 2018
972018
SPIRAL: Extreme performance portability
F Franchetti, TM Low, DT Popovici, RM Veras, DG Spampinato, ...
Proceedings of the IEEE 106 (11), 1935-1968, 2018
922018
High performance zero-memory overhead direct convolutions
J Zhang, F Franchetti, TM Low
Proceedings of the 35th International Conference on Machine Learning 80 …, 2018
642018
An API for manipulating matrices stored by blocks
TM Low, RA Van de Geijn, FW Note
Computer Science Department, University of Texas at Austin, 2004
612004
Exploiting symmetry in tensors for high performance: Multiplication with symmetric tensors
MD Schatz, TM Low, RA van de Geijn, TG Kolda
SIAM Journal on Scientific Computing 36 (5), C453-C479, 2014
582014
Efficient spmv operation for large and highly sparse matrices using scalable multi-way merge parallelization
F Sadi, J Sweeney, TM Low, JC Hoe, L Pileggi, F Franchetti
Proceedings of the 52nd Annual IEEE/ACM International Symposium on …, 2019
552019
Accumulating Householder transformations, revisited
T Joffrain, TM Low, ES Quintana-Ortí, R Geijn, FGV Zee
ACM Transactions on Mathematical Software (TOMS) 32 (2), 169-179, 2006
542006
Scalable parallelization of FLAME code via the workqueuing model
FG Van Zee, P Bientinesi, TM Low, RA Van De Geijn
ACM Transactions on Mathematical Software (TOMS) 34 (2), 2008
292008
Analytical cache modeling and tilesize optimization for tensor contractions
R Li, A Sukumaran-Rajam, R Veras, TM Low, F Rastello, A Rountev, ...
Proceedings of the International Conference for High Performance Computing …, 2019
242019
High-assurance SPIRAL: End-to-end guarantees for robot and car control
F Franchetti, TM Low, S Mitsch, JP Mendoza, L Gui, A Phaosawasdi, ...
IEEE Control Systems Magazine 37 (2), 82-103, 2017
242017
First look: Linear algebra-based triangle counting without matrix multiplication
TM Low, VN Rao, M Lee, D Popovici, F Franchetti, S McMillan
2017 IEEE High Performance Extreme Computing Conference (HPEC), 1-6, 2017
232017
Evaluation of graph analytics frameworks using the gap benchmark suite
A Azad, MM Aznaveh, S Beamer, M Blanco, J Chen, L D'Alessandro, ...
2020 IEEE International Symposium on Workload Characterization (IISWC), 216-227, 2020
212020
CodeNet: Training large scale neural networks in presence of soft-errors
S Dutta, Z Bai, TM Low, P Grover
arXiv preprint arXiv:1903.01042, 2019
212019
FFTX and SpectralPack: A first look
F Franchetti, DG Spampinato, A Kulkarni, DT Popovici, TM Low, ...
2018 IEEE 25th International Conference on High Performance Computing …, 2018
212018
Masterless coded computing: A fully-distributed coded FFT algorithm
H Jeong, TM Low, P Grover
2018 56th Annual Allerton Conference on Communication, Control, and …, 2018
212018
Linear algebraic formulation of edge-centric k-truss algorithms with adjacency matrices
TM Low, DG Spampinato, A Kutuluru, U Sridhar, DT Popovici, F Franchetti, ...
2018 IEEE High Performance extreme Computing Conference (HPEC), 1-7, 2018
202018
Large bandwidth-efficient FFTs on multicore and multi-socket systems
DT Popovici, TM Low, F Franchetti
2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2018
202018
현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.
학술자료 1–20