Improved Vocal Tract Length Perturbation for a State-of-the-Art End-to-End Speech Recognition System. C Kim, M Shin, A Garg, D Gowda Interspeech, 739-743, 2019 | 48 | 2019 |
A review of on-device fully neural end-to-end automatic speech recognition algorithms C Kim, D Gowda, D Lee, J Kim, A Kumar, S Kim, A Garg, C Han ACSSC 2020: Asilomar Conference on Signals, Systems, and Computers, 2020 | 39 | 2020 |
end-to-end training of a large vocabulary end-to-end speech recognition system C Kim, S Kim, K Kim, M Kumar, J Kim, K Lee, C Han, A Garg, E Kim, ... ASRU 2019 : IEEE Workshop on Automatic Speech Recognition & Understanding, 2019 | 30 | 2019 |
Utterance Confidence Measure for End-to-End Speech Recognition with Applications to Distributed Speech Recognition Scenarios. A Kumar, S Singh, D Gowda, A Garg, S Singh, C Kim Interspeech 2020, 4357-4361, 2020 | 26 | 2020 |
Improved multi-stage training of online attention-based encoder-decoder models A Garg, D Gowda, A Kumar, K Kim, M Kumar, C Kim 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 70-77, 2019 | 23 | 2019 |
Multi-Task Multi-Resolution Char-to-BPE Cross-Attention Decoder for End-to-End Speech Recognition. D Gowda, A Garg, K Kim, M Kumar, C Kim Interspeech, 2783-2787, 2019 | 22 | 2019 |
Hierarchical Multi-Stage Word-to-Grapheme Named Entity Corrector for Automatic Speech Recognition. A Garg, A Gupta, D Gowda, S Singh, C Kim Interspeech, 1793-1797, 2020 | 20 | 2020 |
Streaming On-Device End-to-End ASR System for Privacy-Sensitive Voice-Typing. A Garg, GP Vadisetti, D Gowda, S Jin, A Jayasimha, Y Han, J Kim, J Park, ... Interspeech, 3371-3375, 2020 | 17 | 2020 |
Streaming end-to-end speech recognition with jointly trained neural feature enhancement C Kim, A Garg, D Gowda, S Mun, C Han ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 10 | 2021 |
Utterance Invariant Training for Hybrid Two-Pass End-to-End Speech Recognition. D Gowda, A Kumar, K Kim, H Yang, A Garg, S Singh, J Kim, M Kumar, ... Interspeech, 2827-2831, 2020 | 7 | 2020 |
A comparison of streaming models and data augmentation methods for robust speech recognition J Kim, M Kumar, D Gowda, A Garg, C Kim 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 5 | 2021 |
Method and device for speech recognition DN Gowda, KIM Kwangyoun, A Garg, C Kim US Patent 11,302,331, 2022 | 3 | 2022 |
Data Driven Grapheme-to-Phoneme Representations for a Lexicon-Free Text-to-Speech A Garg, J Kim, S Khyalia, C Kim, D Gowda ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 2 | 2024 |
Self-supervised accent learning for under-resourced accents using native language data M Kumar, J Kim, D Gowda, A Garg, C Kim ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 2 | 2023 |
Semi-supervised transfer learning for language expansion of end-to-end speech recognition models to low-resource languages J Kim, M Kumar, D Gowda, A Garg, C Kim 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 2 | 2021 |
Voice recognition device and method C Kim, DN Gowda, S Kim, M Shin, LP Heck, A Garg, KIM Kwangyoun, ... US Patent 11,961,522, 2024 | 1 | 2024 |
System and method for modifying speech recognition result C Kim, DN Gowda, A Garg, K Lee US Patent 11,521,619, 2022 | 1 | 2022 |
HiTNet: Byte-to-BPE Hierarchical Transcription Network for End-to-End Speech Recognition D Gowda, A Garg, J Kim, M Kumar, S Singh, A Gupta, A Kumar, ... 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | | 2021 |