Follow
Geonhwa Jeong
Geonhwa Jeong
Research Scientist, Meta
Verified email at meta.com - Homepage
Title
Cited by
Cited by
Year
ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators using Reinforcement Learning
SC Kao, G Jeong, T Krishna
International Symposium on Microarchitecture (MICRO), 622-636, 2020
1252020
TurboFlux: A fast continuous subgraph matching system for streaming graph data
K Kim, I Seo, WS Han, JH Lee, S Hong, H Chafi, H Shin, G Jeong
International Conference on Management of Data (SIGMOD), 411-426, 2018
712018
GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM
H Kang, Q Zhang, S Kundu, G Jeong, Z Liu, T Krishna, T Zhao
arXiv preprint arXiv:2403.05527, 2024
482024
Evaluating Spatial Accelerator Architectures with Tiled Matrix-Matrix Multiplication
GE Moon, H Kwon, G Jeong, P Chatarasi, S Rajamanickam, T Krishna
IEEE Transactions on Parallel and Distributed Systems (TPDS), 2021
272021
Extending Sparse Tensor Accelerators to Support Multiple Compression Formats
E Qin, G Jeong, W Won, SC Kao, H Kwon, S Srinivasan, D Das, GE Moon, ...
International Parallel & Distributed Processing Symposium (IPDPS), 2021
242021
VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs
G Jeong, S Damani, AR Bambhaniya, E Qin, CJ Hughes, S Subramoney, ...
International Symposium on High-Performance Computer Architecture (HPCA), 2023
202023
Union: A Unified HW-SW Co-Design Ecosystem in MLIR for Evaluating Tensor Operations on Spatial Accelerators
G Jeong, G Kestor, P Chatarasi, A Parashar, PA Tsai, S Rajamanickam, ...
International Conference on Parallel Architectures and Compilation …, 2021
192021
RASA: Efficient Register-Aware Systolic Array Matrix Engine for CPU
G Jeong, E Qin, A Samajdar, CJ Hughes, S Subramoney, H Kim, ...
Design Automation Conference (DAC), 2021
192021
Algorithm-Hardware Co-Design of Distribution-Aware Logarithmic-Posit Encodings for Efficient DNN Inference
A Ramachandran, Z Wan, G Jeong, J Gustafson, T Krishna
Design Automation Conference (DAC), 2024
102024
Demystifying Platform Requirements for Diverse LLM Inference Use Cases
A Bambhaniya, R Raj, G Jeong, S Kundu, S Srinivasan, M Elavazhagan, ...
arXiv preprint arXiv:2406.01698, 2024
72024
Characterization of Data Compression in Datacenters
G Jeong, B Sharma, N Terrell, A Dhanotia, Z Zhao, N Agarwal, ...
International Symposium on Performance Analysis of Systems and Software …, 2023
32023
SDQ: Sparse Decomposed Quantization for LLM Inference
G Jeong, PA Tsai, SW Keckler, T Krishna
arXiv preprint arXiv:2406.13868, 2024
12024
Understanding Data Compression in Warehouse-Scale Datacenter Services
G Jeong, B Sharma, N Terrell, A Dhanotia, Z Zhao, N Agarwal, ...
International Symposium on Performance Analysis of Systems and Software …, 2022
12022
Bridging the Frequency Gap in Heterogeneous 3D SoCs through Technology-Specific NoC Router Architectures
JM Joseph, L Bamberg, G Jeong, RT Chien, R Leupers, A Garía-Ortiz, ...
Asia and South Pacific Design Automation Conference (ASP-DAC), 197–203, 2021
12021
Generating sparse neural networks
G Jeong, PA Tsai, JM Pool
US Patent US20240152407A1, 2024
2024
Abstracting Sparse DNN Acceleration via Structured Sparse Tensor Decomposition
G Jeong, PA Tsai, AR Bambhaniya, SW Keckler, T Krishna
arXiv preprint arXiv:2403.07953, 2024
2024
Understanding Performance Implications of LLM Inference on CPUs
S Na, G Jeong, BH Ahn, J Young, T Krishna, H Kim
International Symposium on Workload Characterization (IISWC), 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–17