팔로우
Sri Garimella
제목
인용
인용
연도
Analysis of MLP-Based Hierarchical Phoneme Posterior Probability Estimator
J Pinto, S Garimella, M Magimai-Doss, H Hermansky, H Bourlard
Audio, Speech, and Language Processing, IEEE Transactions on 19 (2), 225-241, 2011
1042011
Sparse coding for speech recognition
GSVS Sivaram, SK Nemala, M Elhilali, TD Tran, H Hermansky
2010 IEEE International Conference on Acoustics, Speech and Signal …, 2010
942010
Sparse multilayer perceptron for phoneme recognition
GSVS Sivaram, H Hermansky
IEEE Transactions on Audio, Speech, and Language Processing 20 (1), 23-29, 2011
912011
A design methodology for selection and placement of sensors in multimedia surveillance systems
G Sivaram, KR Ramakrishnan, PK Atrey, VK Singh, MS Kankanhalli
Proceedings of the 4th ACM international workshop on Video surveillance and …, 2006
652006
Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 summer workshop
G Zweig, P Nguyen, D Van Compernolle, K Demuynck, L Atlas, P Clark, ...
2011 IEEE International Conference on Acoustics, Speech and Signal …, 2011
622011
Robust i-vector based adaptation of DNN acoustic model for speech recognition
S Garimella, A Mandal, N Ström, B Hoffmeister, S Matsoukas, ...
592015
fMLLR based feature-space speaker adaptation of DNN acoustic models
SHK Parthasarathi, B Hoffmeister, S Matsoukas, A Mandal, N Ström, ...
422015
Improving ASR confidence scores for Alexa using acoustic and hypothesis embeddings
P Swarup, R Maas, S Garimella, SH Mallidi, B Hoffmeister
342019
Multilayer perceptron with sparse hidden outputs for phoneme recognition
GSVS Sivaram, H Hermansky
2011 IEEE international conference on acoustics, speech and signal …, 2011
222011
Data-driven and feedback based spectro-temporal features for speech recognition
GSVS Sivaram, SK Nemala, N Mesgarani, H Hermansky
IEEE Signal Processing Letters 17 (11), 957-960, 2010
212010
Streaming end-to-end bilingual asr systems with joint language identification
S Punjabi, H Arsikere, Z Raeesy, C Chandak, N Bhave, A Bansal, ...
arXiv preprint arXiv:2007.03900, 2020
192020
Design of multimedia surveillance systems
GSVS Sivaram, MS Kankanhalli, KR Ramakrishnan
ACM Transactions on Multimedia Computing, Communications, and Applications …, 2009
192009
Mixture of auto-associative neural networks for speaker verification
GSVS Sivaram, S Thomas, H Hermansky
Twelfth Annual Conference of the International Speech Communication Association, 2011
182011
Multi-dialect acoustic modeling using phone mapping and online i-vectors
H Arsikere, A Sapru, S Garimella
172019
Factor analysis of auto-associative neural networks with application in speaker verification
S Garimella, H Hermansky
IEEE transactions on neural networks and learning systems 24 (4), 522-528, 2013
162013
Generative modeling of speech using neural networks
S Matsoukas, N Ström, A Rastrow, SVSSR Krishna
US Patent 9,653,093, 2017
152017
Joint ASR and language identification using RNN-T: An efficient approach to dynamic language switching
S Punjabi, H Arsikere, Z Raeesy, C Chandak, N Bhave, A Bansal, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
132021
Regularized auto-associative neural networks for speaker verification
S Garimella, SH Mallidi, H Hermansky
IEEE Signal Processing Letters 19 (12), 841-844, 2012
112012
The UMD-JHU 2011 speaker recognition system
D Garcia-Romero, X Zhou, D Zotkin, B Srinivasan, Y Luo, S Ganapathy, ...
2012 IEEE International Conference on Acoustics, Speech and Signal …, 2012
112012
Discriminant spectrotemporal features for phoneme recognition
N Mesgarani, GSVS Sivaram, SK Nemala, M Elhilali, H Hermansky
Tenth Annual Conference of the International Speech Communication Association, 2009
112009
현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.
학술자료 1–20