Seuraa
Runnan Li
Runnan Li
Vahvistettu sähköpostiosoite verkkotunnuksessa bupt.edu.cn
Nimike
Viittaukset
Viittaukset
Vuosi
Dilated residual network with multi-head self-attention for speech emotion recognition
R Li, Z Wu, J Jia, S Zhao, H Meng
ICASSP 2019-2019 IEEE international conference on acoustics, speech and …, 2019
792019
Learning discriminative features from spectrograms using center loss for speech emotion recognition
D Dai, Z Wu, R Li, X Wu, J Jia, H Meng
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
602019
Towards Discriminative Representation Learning for Speech Emotion Recognition.
R Li, Z Wu, J Jia, Y Bu, S Zhao, H Meng
IJCAI, 5060-5066, 2019
502019
One-shot voice conversion with global speaker embeddings.
H Lu, Z Wu, D Dai, R Li, S Kang, J Jia, H Meng
Interspeech, 669-673, 2019
472019
Multi-task deep learning for user intention understanding in speech interaction systems
Y Ning, J Jia, Z Wu, R Li, Y An, Y Wang, H Meng
Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017
242017
Inferring user emotive state changes in realistic human-computer conversational dialogs
R Li, Z Wu, J Jia, J Li, W Chen, H Meng
Proceedings of the 26th ACM international conference on Multimedia, 136-144, 2018
202018
Learning cross-lingual knowledge with multilingual BLSTM for emphasis detection with limited training data
Y Ning, Z Wu, R Li, J Jia, M Xu, H Meng, L Cai
2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017
182017
Multi-task learning of structured output layer bidirectional LSTMs for speech synthesis
R Li, Z Wu, X Liu, H Meng, L Cai
2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017
172017
Siamese Recurrent Auto-Encoder Representation for Query-by-Example Spoken Term Detection.
Z Zhu, Z Wu, R Li, H Meng, L Cai
Interspeech, 102-106, 2018
162018
Applying multitask learning to acoustic-phonemic model for mispronunciation detection and diagnosis in l2 english speech
S Mao, Z Wu, R Li, X Li, H Meng, L Cai
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
162018
Transformer-s2a: Robust and efficient speech-to-animation
L Chen, Z Wu, J Ling, R Li, X Tan, S Zhao
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
152022
Knowledge-Based Linguistic Encoding for End-to-End Mandarin Text-to-Speech Synthesis.
J Li, Z Wu, R Li, P Zhi, S Yang, H Meng
INTERSPEECH, 4494-4498, 2019
152019
Memories are one-to-many mapping alleviators in talking face generation
A Tang, T He, X Tan, J Ling, R Li, S Zhao, L Song, J Bian
arXiv preprint arXiv:2212.05005, 2022
142022
A compact framework for voice conversion using wavenet conditioned on phonetic posteriorgrams
H Lu, Z Wu, R Li, S Kang, J Jia, H Meng
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
142019
Integrating articulatory features into acoustic-phonemic model for mispronunciation detection and diagnosis in l2 english speech
S Mao, Z Wu, X Li, R Li, X Wu, H Meng
2018 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2018
112018
Emphatic speech generation with conditioned input layer and bidirectional LSTMS for expressive speech synthesis
R Li, Z Wu, Y Huang, J Jia, H Meng, L Cai
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
112018
Stableface: Analyzing and improving motion stability for talking face generation
J Ling, X Tan, L Chen, R Li, Y Zhang, S Zhao, L Song
IEEE Journal of Selected Topics in Signal Processing, 2023
102023
Era-solver: Error-robust adams solver for fast sampling of diffusion probabilistic models
S Li, L Liu, Z Chai, R Li, X Tan
arXiv preprint arXiv:2301.12935, 2023
92023
Multi-Task Learning for Prosodic Structure Generation Using BLSTM RNN with Structured Output Layer.
Y Huang, Z Wu, R Li, H Meng, L Cai
INTERSPEECH, 779-783, 2017
82017
Emphasis detection for voice dialogue applications using multi-channel convolutional bidirectional long short-term memory network
L Zhang, J Jia, F Meng, S Zhou, W Chen, C Zhang, R Li
2018 11th International Symposium on Chinese Spoken Language Processing …, 2018
72018
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–20