Zero-shot multi-speaker text-to-speech with state-of-the-art neural speaker embeddings E Cooper, CI Lai, Y Yasuda, F Fang, X Wang, N Chen, J Yamagishi ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 154 | 2020 |
Generalization ability of MOS prediction networks E Cooper, WC Huang, T Toda, J Yamagishi ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 73 | 2022 |
Effect of pronounciations on OOV queries in spoken term detection D Can, E Cooper, A Sethy, C White, B Ramabhadran, M Saraclar 2009 IEEE International Conference on Acoustics, Speech and Signal …, 2009 | 72 | 2009 |
The voicemos challenge 2022 WC Huang, E Cooper, Y Tsao, HM Wang, T Toda, J Yamagishi arXiv preprint arXiv:2203.11389, 2022 | 57 | 2022 |
Improving speech recognition and keyword search for low resource languages using web data G Mendels, E Cooper, V Soto, J Hirschberg, MJF Gales, KM Knill, A Ragni, ... INTERSPEECH 2015: 16th Annual Conference of the International Speech …, 2015 | 47 | 2015 |
Ldnet: Unified listener dependent modeling in mos prediction for synthetic speech WC Huang, E Cooper, J Yamagishi, T Toda ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 36 | 2022 |
Cross-language prominence detection A Rosenberg, EL Cooper, R Levitan, JB Hirschberg Speech Prosody 2012, 2012 | 36 | 2012 |
How do voices from past speech synthesis challenges compare today? E Cooper, J Yamagishi arXiv preprint arXiv:2105.02373, 2021 | 34 | 2021 |
Text-to-speech synthesis using found data for low-resource languages E Cooper Columbia University, 2019 | 23 | 2019 |
Utterance selection for optimizing intelligibility of tts voices trained on asr data E Cooper, X Wang Interspeech 2017 1, 2017 | 23 | 2017 |
Cross-language phrase boundary detection V Soto, E Cooper, A Rosenberg, J Hirschberg 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 22 | 2013 |
An initial investigation for detecting partially spoofed audio L Zhang, X Wang, E Cooper, J Yamagishi, J Patino, N Evans arXiv preprint arXiv:2104.02518, 2021 | 21 | 2021 |
Can speaker augmentation improve multi-speaker end-to-end TTS? E Cooper, CI Lai, Y Yasuda, J Yamagishi arXiv preprint arXiv:2005.01245, 2020 | 20 | 2020 |
Learning disentangled phone and speaker representations in a semi-supervised VQ-VAE paradigm J Williams, Y Zhao, E Cooper, J Yamagishi ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 18 | 2021 |
Improved prosody from learned f0 codebook representations for vq-vae speech waveform reconstruction Y Zhao, H Li, CI Lai, J Williams, E Cooper, J Yamagishi arXiv preprint arXiv:2005.07884, 2020 | 18 | 2020 |
Web derived pronunciations for spoken term detection D Can, E Cooper, A Ghoshal, M Jansche, S Khudanpur, B Ramabhadran, ... Proceedings of the 32nd international ACM SIGIR conference on Research and …, 2009 | 17 | 2009 |
Data Selection and Adaptation for Naturalness in HMM-Based Speech Synthesis. E Cooper, A Chang, Y Levitan, J Hirschberg INTERSPEECH, 357-361, 2016 | 15 | 2016 |
Rescoring confusion networks for keyword search V Soto, E Cooper, L Mangu, A Rosenberg, J Hirschberg 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 15 | 2014 |
Attention back-end for automatic speaker verification with multiple enrollment utterances C Zeng, X Wang, E Cooper, X Miao, J Yamagishi ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 14 | 2022 |
Language-independent speaker anonymization approach using self-supervised pre-trained models X Miao, X Wang, E Cooper, J Yamagishi, N Tomashenko arXiv preprint arXiv:2202.13097, 2022 | 14 | 2022 |