XLS-R: Self-supervised cross-lingual speech representation learning at scale A Babu, C Wang, A Tjandra, K Lakhotia, Q Xu, N Goyal, K Singh, ... arXiv preprint arXiv:2111.09296, 2021 | 445 | 2021 |
A multi-view approach to audio-visual speaker verification L Sarı, K Singh, J Zhou, L Torresani, N Singhal, Y Saraf ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 42 | 2021 |
Improved language identification through cross-lingual self-supervised learning A Tjandra, DG Choudhury, F Zhang, K Singh, A Conneau, A Baevski, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 41 | 2022 |
Multilingual graphemic hybrid ASR with massive data augmentation C Liu, Q Zhang, X Zhang, K Singh, Y Saraf, G Zweig arXiv preprint arXiv:1909.06522, 2019 | 29 | 2019 |
Conformer-based self-supervised learning for non-speech audio tasks S Srivastava, Y Wang, A Tjandra, A Kumar, C Liu, K Singh, Y Saraf ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 21 | 2022 |
Large scale weakly and semi-supervised learning for low-resource video ASR K Singh, V Manohar, A Xiao, S Edunov, R Girshick, V Liptchinsky, ... arXiv preprint arXiv:2005.07850, 2020 | 9 | 2020 |
Training asr models by generation of contextual information K Singh, D Okhonko, J Liu, Y Wang, F Zhang, R Girshick, S Edunov, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 6 | 2020 |