Follow
Kai Zhen
Kai Zhen
Alexa Speech
Verified email at amazon.com - Homepage
Title
Cited by
Cited by
Year
Cascaded cross-module residual learning towards lightweight end-to-end speech coding
K Zhen, J Sung, MS Lee, S Beack, M Kim
arXiv preprint arXiv:1906.07769, 2019
352019
Psychoacoustic calibration of loss functions for efficient end-to-end neural audio coding
K Zhen, MS Lee, J Sung, S Beack, M Kim
IEEE Signal Processing Letters 27, 2159-2163, 2020
222020
Efficient and scalable neural residual waveform coding with collaborative quantization
K Zhen, MS Lee, J Sung, S Beack, M Kim
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
152020
Scalable and efficient neural speech coding: A hybrid design
K Zhen, J Sung, MS Lee, S Beack, M Kim
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 12-25, 2021
12*2021
Source-aware neural speech coding for noisy speech compression
H Yang, K Zhen, S Beack, M Kim
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
112021
Sparsification via compressed sensing for automatic speech recognition
K Zhen, HD Nguyen, FJ Chang, A Mouchtaris, A Rastrow
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
112021
On psychoacoustically weighted cost functions towards resource-efficient deep neural networks for speech denoising
K Zhen, A Sivaraman, J Sung, M Kim
arXiv preprint arXiv:1801.09774, 2018
92018
Audio signal encoding method and apparatus and audio signal decoding method and apparatus using psychoacoustic-based weighted error function
J Sung, M Kim, A Sivaraman, K Zhen
US Patent 11,416,742, 2022
72022
A dual-staged context aggregation method towards efficient end-to-end speech enhancement
K Zhen, MS Lee, M Kim
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
7*2020
Sub-8-bit quantization aware training for 8-bit neural network accelerator with on-device speech recognition
K Zhen, HD Nguyen, R Chinta, N Susanj, A Mouchtaris, T Afzal, ...
arXiv preprint arXiv:2206.15408, 2022
52022
Sub-8-bit quantization for on-device speech recognition: a regularization-free approach
K Zhen, M Radfar, H Nguyen, GP Strimel, N Susanj, A Mouchtaris
2022 IEEE Spoken Language Technology Workshop (SLT), 15-22, 2023
42023
A functional flavor of service composition
L Bao, Q Li, K Zhen, W Xiang, P Chen
2011 Eighth International Conference on Fuzzy Systems and Knowledge …, 2011
22011
Audio signal encoding method and audio signal decoding method, and encoder and decoder performing the same
MS Lee, J Sung, M Kim, K Zhen
US Patent 11,276,413, 2022
12022
Apparatus and method for speech processing using a densely connected hybrid neural network
M Kim, MS Lee, SK Beack, J Sung, TJ Lee, JS Choi, K Zhen
US Patent App. 17/308,800, 2021
12021
Hybrid supervised-unsupervised image topic visualization with convolutional neural network and LDA. arXiv
K Zhen, M Birla, D Crandall, B Zhang, J Qiu
12017
Conmer: Streaming Conformer without self-attention for interactive voice assistants
M Radfar, P Lyskawa, B Trujillo, Y Xie, K Zhen, J Heymann, D Filimonov, ...
2023
Residual coding method of linear prediction coding coefficient based on collaborative quantization, and computing device for performing the method
M Kim, K Zhen, MS Lee, SK Beack, J Sung, TJ Lee, JS Choi
US Patent 11,488,613, 2022
2022
Method and apparatus for processing audio signal
MS Lee, SK Beack, J Sung, TJ Lee, JS Choi, M Kim, K Zhen
US Patent App. 17/156,006, 2021
2021
Neural Waveform Coding: Scalability, Efficiency and Psychoacoustic Calibration
K Zhen
Indiana University, 2021
2021
A Hybrid Supervised-unsupervised Method on Image Topic Visualization with Convolutional Neural Network and LDA
K Zhen, M Birla, D Crandall, B Zhang, J Qiu
arXiv preprint arXiv:1703.05243, 2017
2017
The system can't perform the operation now. Try again later.
Articles 1–20