Seuraa
Shun Lei
Shun Lei
PhD student, Tsinghua University
Vahvistettu sähköpostiosoite verkkotunnuksessa mails.tsinghua.edu.cn - Kotisivu
Nimike
Viittaukset
Viittaukset
Vuosi
Mrc-lstm: A hybrid approach of multi-scale residual cnn and lstm to predict bitcoin price
Q Guo, S Lei, Q Ye, Z Fang
2021 International Joint Conference on Neural Networks (IJCNN), 1-8, 2021
452021
Towards Expressive Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis
S Lei, Y Zhou, L Chen, Z Wu, S Kang, H Meng
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
212022
Foundation models for music: A survey
Y Ma, A Øland, A Ragni, BMS Del Sette, C Saitis, C Donahue, C Lin, ...
arXiv preprint arXiv:2408.14340, 2024
122024
MSStyleTTS: Multi-Scale Style Modeling with Hierarchical Context Information for Expressive Speech Synthesis
S Lei, Y Zhou, L Chen, Z Wu, X Wu, S Kang, H Meng
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
102023
GTN-Bailando: Genre Consistent long-Term 3D Dance Generation Based on Pre-Trained Genre Token Network
H Zhuang, S Lei, L Xiao, W Li, L Chen, S Yang, Z Wu, S Kang, H Meng
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
92023
Unsupervised Multi-scale Expressive Speaking Style Modeling with Hierarchical Context Information for Audiobook Speech Synthesis
X Chen, S Lei, Z Wu, D Xu, W Zhao, H Meng
Proceedings of the 29th International Conference on Computational …, 2022
92022
Context-Aware Coherent Speaking Style Prediction with Hierarchical Transformers for Audiobook Speech Synthesis
S Lei, Y Zhou, L Chen, Z Wu, S Kang, H Meng
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
62023
Towards spontaneous style modeling with semi-supervised pre-training for conversational text-to-speech synthesis
W Li, S Lei, Q Huang, Y Zhou, Z Wu, S Kang, H Meng
Proc. Interspeech 2023, 2023
52023
SongCreator: Lyrics-based Universal Song Generation
S Lei, Y Zhou, B Tang, MWY Lam, F Liu, H Liu, J Wu, S Kang, Z Wu, ...
arXiv preprint arXiv:2409.06029, 2024
32024
VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling
Y Zhou, X Qin, Z Jin, S Zhou, S Lei, S Zhou, Z Wu, J Jia
ACM Multimedia 2024, 2024
32024
SimCalib: Graph Neural Network Calibration based on Similarity between Nodes
B Tang, Z Wu, X Wu, Q Huang, J Chen, S Lei, H Meng
Proceedings of the AAAI Conference on Artificial Intelligence 38 (14), 15267 …, 2024
32024
TouchTTS: An Embarrassingly Simple TTS Framework that Everyone Can Touch
X Song, M Xing, C Ma, S Li, D Wu, B Zhang, F Pan, D Zhou, Y Zhang, ...
arXiv preprint arXiv:2412.08237, 2024
22024
Towards Improving the Expressiveness of Singing Voice Synthesis with BERT Derived Semantic Information}}
S Zhou, S Lei, W You, D Tuo, Y You, Z Wu, S Kang, H Meng
Proc. Interspeech 2022, 4292-4296, 2022
22022
MuCodec: Ultra Low-Bitrate Music Codec
Y Xu, H Chen, J Yu, W Tan, R Gu, S Lei, Z Lin, Z Wu
arXiv preprint arXiv:2409.13216, 2024
12024
An End-to-End Approach for Chord-Conditioned Song Generation
S Gao, S Lei, F Zhuo, H Liu, F Liu, B Tang, Q Huang, S Kang, Z Wu
Proc. Interspeech 2024, 2024
12024
AdaMesh: Personalized Facial Expressions and Head Poses for Speech-Driven 3D Facial Animation
L Chen, W Bao, S Lei, B Tang, Z Wu, S Kang, H Huang
arXiv preprint arXiv:2310.07236, 2023
12023
Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts
S Lei, Y Zhou, L Chen, D Luo, Z Wu, X Wu, S Kang, T Jiang, Y Zhou, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2023
12023
The Codec Language Model-based Zero-Shot Spontaneous Style TTS System for CoVoC Challenge 2024
S Zhou, Y Zhou, W Li, J Chen, R Ye, W Wu, Z Lin, S Lei, Z Wu
2024 IEEE 14th International Symposium on Chinese Spoken Language Processing …, 2024
2024
NRAdapt: Noise-Robust Adaptive Text to Speech Using Untranscribed Data
M Cheng, S Lei, D Dai, Z Wu, D Chong
2024 International Joint Conference on Neural Networks (IJCNN), 1-8, 2024
2024
The THU-HCSI Multi-Speaker Multi-Lingual Few-Shot Voice Cloning System for LIMMITS'24 Challenge
Y Zhou, S Zhou, S Lei, Z Wu, M Wu
arXiv preprint arXiv:2404.16619, 2024
2024
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–20