Secap: Speech emotion captioning with large language model Y Xu, H Chen, J Yu, Q Huang, Z Wu, SX Zhang, G Li, Y Luo, R Gu Proceedings of the AAAI Conference on Artificial Intelligence 38 (17), 19323 …, 2024 | 11 | 2024 |
CB-Conformer: Contextual biasing Conformer for biased word recognition Y Xu, B Liu, Q Huang, X Song, Z Wu, S Kang, H Meng ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 10 | 2023 |
Unifiedgesture: A unified gesture synthesis model for multiple skeletons S Yang, Z Wang, Z Wu, M Li, Z Zhang, Q Huang, L Hao, S Xu, X Wu, ... Proceedings of the 31st ACM International Conference on Multimedia, 1033-1044, 2023 | 9 | 2023 |
Towards Spontaneous Style Modeling with Semi-supervised Pre-training for Conversational Text-to-Speech Synthesis W Li, S Lei, Q Huang, Y Zhou, Z Wu, S Kang, H Meng arXiv preprint arXiv:2308.16593, 2023 | 5 | 2023 |
HILvoice: Human-in-the-Loop Style Selection for Elder-Facing Speech Synthesis X Chen, Q Huang, X Wu, Z Wu, H Meng 2022 13th International Symposium on Chinese Spoken Language Processing …, 2022 | 4 | 2022 |
Enhancing Expressiveness in Dance Generation Via Integrating Frequency and Music Style Information Q Huang, X He, B Tang, H Zhuang, L Chen, S Gao, Z Wu, H Huang, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1 | 2024 |
SimCalib: Graph Neural Network Calibration Based on Similarity between Nodes B Tang, Z Wu, X Wu, Q Huang, J Chen, S Lei, H Meng Proceedings of the AAAI Conference on Artificial Intelligence 38 (14), 15267 …, 2024 | 1 | 2024 |
Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model X He, Q Huang, Z Zhang, Z Lin, Z Wu, S Yang, M Li, Z Chen, S Xu, X Wu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 1 | 2024 |