Jaewoong Sim

Cited by

	All	Since 2019
Citations	3165	2251
h-index	20	18
i10-index	24	24

500

250

125

375

201220132014201520162017201820192020202120222023202414 44 71 95 121 189 336 377 391 482 414 339 248

Public access

View all

4 articles

2 articles

available

not available

Based on funding mandates

Co-authors

Hyesoon KimGeorgia TechVerified email at cc.gatech.edu
Gabriel H. LohAMD Research and Advanced Development (RAD)Verified email at amd.com
Asit MishraNvidiaVerified email at nvidia.com
Srivatsan KrishnanHarvard UniversityVerified email at seas.harvard.edu
Mike O'ConnorNVIDIA ResearchVerified email at nvidia.com
Lifeng NaiGoogleVerified email at google.com
Chris WilkersonIntelVerified email at intel.com
Alaa R. AlameldeenSimon Fraser UniversityVerified email at cs.sfu.ca
Zeshan ChishtiStaff Research Scientist, Intel CorporationVerified email at intel.com
Mithuna ThottethodiPurdue UniversityVerified email at purdue.edu
Philip H.W. LeongProfessor of Computer Systems, The University of SydneyVerified email at sydney.edu.au
Vilas SridharanAMD, Inc.Verified email at amd.com
Richard VuducGeorgia Institute of TechnologyVerified email at cc.gatech.edu
Moinuddin QureshiProfessor, Georgia Institute of TechnologyVerified email at gatech.edu
Jaekyu LeeArm ResearchVerified email at arm.com

Jaewoong Sim

Seoul National University

Verified email at snu.ac.kr - Homepage

Computer Architecture Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Can FPGAs beat GPUs in accelerating next-generation deep neural networks? E Nurvitadhi, G Venkatesh, J Sim, D Marr, R Huang, J Ong Gee Hock, ... Proceedings of the 2017 ACM/SIGDA international symposium on field …, 2017	602	2017
Accelerating binarized neural networks: Comparison of FPGA, CPU, GPU, and ASIC E Nurvitadhi, D Sheffield, J Sim, A Mishra, G Venkatesh, D Marr 2016 International Conference on Field-Programmable Technology (FPT), 77-84, 2016	422	2016
Graphpim: Enabling instruction-level pim offloading in graph computing frameworks L Nai, R Hadidi, J Sim, H Kim, P Kumar, H Kim 2017 IEEE International symposium on high performance computer architecture …, 2017	345	2017
A performance analysis framework for identifying potential benefits in GPGPU applications J Sim, A Dasgupta, H Kim, R Vuduc Proceedings of the 17th ACM SIGPLAN Annual Symposium on Principles and …, 2012	270	2012
Accelerating recurrent neural networks in analytics servers: Comparison of FPGA, CPU, GPU, and ASIC E Nurvitadhi, J Sim, D Sheffield, A Mishra, S Krishnan, D Marr 2016 26th International Conference on Field Programmable Logic and …, 2016	248	2016
Transparent hardware management of stacked dram as part of memory J Sim, AR Alameldeen, Z Chishti, C Wilkerson, H Kim 2014 47th Annual IEEE/ACM International Symposium on Microarchitecture, 13-24, 2014	153	2014
A mostly-clean DRAM cache for effective hit speculation and self-balancing dispatch J Sim, GH Loh, H Kim, M OConnor, M Thottethodi 2012 45th Annual IEEE/ACM International Symposium on Microarchitecture, 247-257, 2012	130	2012
Dynamically configuring regions of a main memory in a write-back mode or a write-through mode J Sim, MS Thottethodi, GH Loh US Patent 9,552,294, 2017	109	2017
A customizable matrix multiplication framework for the intel harpv2 xeon+ fpga platform: A deep learning case study DJM Moss, S Krishnan, E Nurvitadhi, P Ratuszniak, C Johnson, J Sim, ... Proceedings of the 2018 ACM/SIGDA International Symposium on Field …, 2018	104	2018
High performance binary neural networks on the Xeon+ FPGA™ platform DJM Moss, E Nurvitadhi, J Sim, A Mishra, D Marr, S Subhaschandra, ... 2017 27Th International conference on field programmable logic and …, 2017	99	2017
Macsim: A cpu-gpu heterogeneous simulation framework user guide H Kim, J Lee, NB Lakshminarayana, J Sim, J Lim, T Pho Georgia Institute of Technology, 1-57, 2012	95	2012
BSSync: Processing near memory for machine learning workloads with bounded staleness consistency models JH Lee, J Sim, H Kim 2015 International Conference on Parallel Architecture and Compilation (PACT …, 2015	85	2015
Batch-aware unified memory management in GPUs for irregular workloads H Kim, J Sim, P Gera, R Hadidi, H Kim Proceedings of the Twenty-Fifth International Conference on Architectural …, 2020	77	2020
Why compete when you can work together: FPGA-ASIC integration for persistent RNNs E Nurvitadhi, D Kwon, A Jafari, A Boutros, J Sim, P Tomson, H Sumbul, ... 2019 IEEE 27th Annual International Symposium on Field-Programmable Custom …, 2019	70	2019
Resilient die-stacked DRAM caches J Sim, GH Loh, V Sridharan, M O'Connor ACM SIGARCH Computer Architecture News 41 (3), 416-427, 2013	70	2013
FLEXclusion: Balancing cache capacity and on-chip bandwidth via flexible exclusion J Sim, J Lee, MK Qureshi, H Kim ACM SIGARCH Computer Architecture News 40 (3), 321-332, 2012	56	2012
Partitioning caches for sub-entities in computing devices GH Loh, J Sim US Patent 9,098,417, 2015	36	2015
Method and apparatus for implementing a heterogeneous memory subsystem CB Wilkerson, AR Alameldeen, ZA Chishti, J Sim US Patent 9,472,248, 2016	27	2016
CoolPIM: Thermal-aware source throttling for efficient PIM instruction offloading L Nai, R Hadidi, H Xiao, H Kim, J Sim, H Kim 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2018	24	2018
Specializing FGPU for persistent deep learning R Ma, JC Hsu, T Tan, E Nurvitadhi, D Sheffield, R Pelt, M Langhammer, ... ACM Transactions on Reconfigurable Technology and Systems (TRETS) 14 (2), 1-23, 2021	22	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors