Throughput-optimized OpenCL-based FPGA accelerator for large-scale convolutional neural networks N Suda, V Chandra, G Dasika, A Mohanty, Y Ma, S Vrudhula, J Seo, ... Proceedings of the 2016 ACM/SIGDA international symposium on field …, 2016 | 683 | 2016 |
Optimizing loop operation and dataflow in FPGA acceleration of deep convolutional neural networks Y Ma, Y Cao, S Vrudhula, J Seo Proceedings of the 2017 ACM/SIGDA International Symposium on Field …, 2017 | 438 | 2017 |
Optimizing the convolution operation to accelerate deep neural networks on FPGA Y Ma, Y Cao, S Vrudhula, J Seo IEEE Transactions on Very Large Scale Integration (VLSI) Systems 26 (7 …, 2018 | 336 | 2018 |
Scalable and modularized RTL compilation of convolutional neural networks onto FPGA Y Ma, N Suda, Y Cao, J Seo, S Vrudhula 2016 26th international conference on field programmable logic and …, 2016 | 209 | 2016 |
An automatic RTL compiler for high-throughput FPGA implementation of diverse deep convolutional neural networks Y Ma, Y Cao, S Vrudhula, J Seo 2017 27th International Conference on Field Programmable Logic and …, 2017 | 169 | 2017 |
ALAMO: FPGA acceleration of deep learning algorithms with a modularized RTL compiler Y Ma, N Suda, Y Cao, S Vrudhula, J Seo Integration 62, 14-23, 2018 | 113 | 2018 |
End-to-end scalable FPGA accelerator for deep residual networks Y Ma, M Kim, Y Cao, S Vrudhula, J Seo 2017 IEEE International Symposium on Circuits and Systems (ISCAS), 1-4, 2017 | 68 | 2017 |
Performance modeling for CNN inference accelerators on FPGA Y Ma, Y Cao, S Vrudhula, JS Seo IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2019 | 63 | 2019 |
Automatic compilation of diverse CNNs onto high-performance FPGA accelerators Y Ma, Y Cao, S Vrudhula, J Seo IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2018 | 55 | 2018 |
Automatic compiler based FPGA accelerator for CNN training SK Venkataramanaiah, Y Ma, S Yin, E Nurvithadhi, A Dasu, Y Cao, J Seo 2019 29th International Conference on Field Programmable Logic and …, 2019 | 54 | 2019 |
Algorithm-hardware co-design of single shot detector for fast object detection on FPGAs Y Ma, T Zheng, Y Cao, S Vrudhula, J Seo 2018 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), 1-8, 2018 | 34 | 2018 |
A flexible and efficient FPGA accelerator for various large-scale and lightweight CNNs X Wu, Y Ma, M Wang, Z Wang IEEE Transactions on Circuits and Systems I: Regular Papers 69 (3), 1185-1198, 2021 | 33 | 2021 |
7.8 A 22nm delta-sigma computing-in-memory (Δ∑ CIM) SRAM macro with near-zero-mean outputs and LSB-first ADCs achieving 21.38 TOPS/W for 8b-MAC edge AI processing P Chen, M Wu, W Zhao, J Cui, Z Wang, Y Zhang, Q Wang, J Ru, L Shen, ... 2023 IEEE International Solid-State Circuits Conference (ISSCC), 140-142, 2023 | 18 | 2023 |
An efficient FPGA accelerator optimized for high throughput sparse CNN inference J Wen, Y Ma, Z Wang 2020 IEEE Asia Pacific Conference on Circuits and Systems (APCCAS), 165-168, 2020 | 15 | 2020 |
Efficient hardware post processing of anchor-based object detection on FPGA H Zhang, W Wu, Y Ma, Z Wang 2020 IEEE Computer Society Annual Symposium on VLSI (ISVLSI), 580-585, 2020 | 15 | 2020 |
Efficient network construction through structural plasticity X Du, Z Li, Y Ma, Y Cao IEEE Journal on Emerging and Selected Topics in Circuits and Systems 9 (3 …, 2019 | 15 | 2019 |
In-memory computing: The next-generation ai computing paradigm Y Ma, Y Du, L Du, J Lin, Z Wang Proceedings of the 2020 on Great Lakes Symposium on VLSI, 265-270, 2020 | 13 | 2020 |
Hybrid stochastic-binary computing for low-latency and high-precision inference of CNNs Z Chen, Y Ma, Z Wang IEEE Transactions on Circuits and Systems I: Regular Papers 69 (7), 2707-2720, 2022 | 10 | 2022 |
Small-world-based structural pruning for efficient FPGA inference of deep neural networks G Krishnan, Y Ma, Y Cao 2020 IEEE 15th International Conference on Solid-State & Integrated Circuit …, 2020 | 10 | 2020 |
Research progress on low-power artificial intelligence of things (AIoT) chip design L Ye, Z Wang, T Jia, Y Ma, L Shen, Y Zhang, H Li, P Chen, M Wu, Y Liu, ... Science China Information Sciences 66 (10), 200407, 2023 | 5 | 2023 |