Seuraa
Yao Qian
Nimike
Viittaukset
Viittaukset
Vuosi
Wavlm: Large-scale self-supervised pre-training for full stack speech processing
S Chen, C Wang, Z Chen, Y Wu, S Liu, Z Chen, J Li, N Kanda, T Yoshioka, ...
IEEE Journal of Selected Topics in Signal Processing 16 (6), 1505-1518, 2022
16732022
TTS synthesis with bidirectional LSTM based recurrent neural networks
Y Fan, Y Qian, FL Xie, FK Soong
Fifteenth annual conference of the international speech communication …, 2014
6332014
On the training aspects of deep neural network (DNN) for parametric TTS synthesis
Y Qian, Y Fan, W Hu, FK Soong
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
2812014
Improved mispronunciation detection with deep neural network trained acoustic models and transfer learning based logistic regression classifiers
W Hu, Y Qian, FK Soong, Y Wang
Speech Communication 67, 154-166, 2015
2702015
Speecht5: Unified-modal encoder-decoder pre-training for spoken language processing
J Ao, R Wang, L Zhou, C Wang, S Ren, Y Wu, S Liu, T Ko, Q Li, Y Zhang, ...
arXiv preprint arXiv:2110.07205, 2021
2372021
Part-of-speech tagging with bidirectional long short-term memory recurrent neural network
P Wang, Y Qian, FK Soong, L He, H Zhao
arXiv preprint arXiv:1510.06168, 2015
1652015
Multi-speaker modeling and speaker adaptation for DNN-based TTS synthesis
Y Fan, Y Qian, FK Soong, L He
2015 IEEE international conference on acoustics, speech and signal …, 2015
1612015
Using bidirectional LSTM recurrent neural networks to learn high-level abstractions of sequential features for automated scoring of non-native spontaneous speech
Z Yu, V Ramanarayanan, D Suendermann-Oeft, X Wang, K Zechner, ...
2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015
1362015
Large-scale self-supervised speech representation learning for automatic speaker verification
Z Chen, S Chen, Y Wu, Y Qian, C Wang, S Liu, Y Qian, M Zeng
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
1342022
Unispeech: Unified speech representation learning with labeled and unlabeled data
C Wang, Y Wu, Y Qian, K Kumatani, S Liu, F Wei, M Zeng, X Huang
International Conference on Machine Learning, 10937-10947, 2021
1292021
A unified tagging solution: Bidirectional lstm recurrent neural network with word embedding
P Wang, Y Qian, FK Soong, L He, H Zhao
arXiv preprint arXiv:1511.00215, 2015
1232015
A report on the 2017 native language identification shared task
S Malmasi, K Evanini, A Cahill, J Tetreault, R Pugh, C Hamill, ...
Proceedings of the 12th Workshop on Innovative Use of NLP for Building …, 2017
1212017
A new DNN-based high quality pronunciation evaluation for computer-aided language learning (CALL).
W Hu, Y Qian, FK Soong
Interspeech, 1886-1890, 2013
1112013
Locating boundaries for prosodic constituents in unrestricted Mandarin texts
M Chu, Y Qian
International Journal of Computational Linguistics & Chinese Language …, 2001
1082001
End-to-end neural network based automated speech scoring
L Chen, J Tao, S Ghaffarzadegan, Y Qian
2018 IEEE international conference on acoustics, speech and signal …, 2018
952018
Unispeech-sat: Universal speech representation learning with speaker aware pre-training
S Chen, Y Wu, C Wang, Z Chen, Z Chen, S Liu, J Wu, Y Qian, F Wei, J Li, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
942022
A cross-language state sharing and mapping approach to bilingual (Mandarin–English) TTS
Y Qian, H Liang, FK Soong
IEEE Transactions on Audio, Speech, and Language Processing 17 (6), 1231-1239, 2009
882009
Exploring ASR-free end-to-end modeling to improve spoken language understanding in a cloud-based dialog system
Y Qian, R Ubale, V Ramanaryanan, P Lange, D Suendermann-Oeft, ...
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017
812017
Word embedding for recurrent neural network based TTS synthesis
P Wang, Y Qian, FK Soong, L He, H Zhao
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
752015
An HMM-based Mandarin Chinese text-to-speech system
Y Qian, F Soong, Y Chen, M Chu
Chinese Spoken Language Processing: 5th International Symposium, ISCSLP 2006 …, 2006
742006
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–20