Front-end architecture for a multi-lingual text-to-speech system M Chu, H Peng, Y Zhao US Patent 7,496,498, 2009 | 366 | 2009 |
Refining of segmental boundaries in speech waveforms using contextual-dependent models Y Zhao, M Chu, J Zhou, L Wang US Patent 7,496,512, 2009 | 315 | 2009 |
Providing personalized voice font for text-to-speech applications M Chu, Y Zhao, S Zhao US Patent 7,693,719, 2010 | 307 | 2010 |
Unnatural prosody detection in speech synthesis Y Zhao, FKP Soong, M Chu, L Wang US Patent 8,583,438, 2013 | 240 | 2013 |
Unnatural prosody detection in speech synthesis Y Zhao, FKP Soong, M Chu, L Wang US Patent 8,583,438, 2013 | 240 | 2013 |
Speech unit selection using HMM acoustic models M Chu, P Liu, Y Zhao, Y Li US Patent App. 11/508,093, 2008 | 239 | 2008 |
Defining atom units between phone and syllable for TTS systems M Chu, Y Zhao US Patent 7,418,389, 2008 | 215 | 2008 |
End-to-end attention based text-dependent speaker verification SX Zhang, Z Chen, Y Zhao, J Li, Y Gong 2016 IEEE Spoken Language Technology Workshop (SLT), 171-178, 2016 | 179 | 2016 |
Optimization of an objective measure for estimating mean opinion score of synthesized speech M Chu, H Peng, Y Zhao US Patent 7,386,451, 2008 | 173 | 2008 |
Speaker-invariant training via adversarial learning Z Meng, J Li, Z Chen, Y Zhao, V Mazalov, Y Gong, BH Juang 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 99 | 2018 |
Microsoft Mulan-a bilingual TTS system M Chu, H Peng, Y Zhao, Z Niu, E Chang 2003 IEEE International Conference on Acoustics, Speech, and Signal …, 2003 | 82 | 2003 |
Adversarial speaker verification Z Meng, Y Zhao, J Li, Y Gong ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 59 | 2019 |
Advances in online audio-visual meeting transcription T Yoshioka, I Abramovski, C Aksoylar, Z Chen, M David, D Dimitriadis, ... 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 57 | 2019 |
Conditional teacher-student learning Z Meng, J Li, Y Zhao, Y Gong ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 53 | 2019 |
Refining segmental boundaries for TTS database using fine contextual-dependent boundary models L Wang, Y Zhao, M Chu, J Zhou, Z Cao 2004 IEEE International Conference on Acoustics, Speech, and Signal …, 2004 | 49 | 2004 |
Low-rank plus diagonal adaptation for deep neural networks Y Zhao, J Li, Y Gong 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 46 | 2016 |
Cnn with phonetic attention for text-independent speaker verification T Zhou, Y Zhao, J Li, Y Gong, J Wu 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 42 | 2019 |
Perpetually optimizing the cost function for unit selection in a TTS system with one single run of MOS evaluation H Peng, Y Zhao, M Chu Seventh International Conference on Spoken Language Processing, 2002 | 42 | 2002 |
INVESTIGATING ONLINE LOW-FOOTPRINT SPEAKER ADAPTATION USING GENERALIZED LINEAR REGRESSION AND CLICK-THROUGH DATA Y Zhao, J Li, J Xue, Y Gong | 37 | 2015 |
Microsoft speaker diarization system for the voxceleb speaker recognition challenge 2020 X Xiao, N Kanda, Z Chen, T Zhou, T Yoshioka, S Chen, Y Zhao, G Liu, ... arXiv preprint arXiv:2010.11458, 2020 | 36 | 2020 |