Seuraa
Mengxiao Bi
Mengxiao Bi
Fuxi AI Lab, NetEase Inc.
Vahvistettu sähköpostiosoite verkkotunnuksessa corp.netease.com
Nimike
Viittaukset
Viittaukset
Vuosi
Very deep convolutional neural networks for noise robust speech recognition
Y Qian, M Bi, T Tan, K Yu
IEEE/ACM Transactions on Audio, Speech, and Language Processing 24 (12 …, 2016
3992016
Opencpop: A high-quality open source chinese popular song corpus for singing voice synthesis
Y Wang, X Wang, P Zhu, J Wu, H Li, H Xue, Y Zhang, L Xie, M Bi
arXiv preprint arXiv:2201.07429, 2022
932022
Visinger: Variational inference with adversarial learning for end-to-end singing voice synthesis
Y Zhang, J Cong, H Xue, L Xie, P Zhu, M Bi
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
812022
Very deep convolutional neural networks for LVCSR.
M Bi, Y Qian, K Yu
Interspeech, 3259-3263, 2015
522015
Deep feed-forward sequential memory networks for speech synthesis
M Bi, H Lu, S Zhang, M Lei, Z Yan
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
162018
One-shot voice conversion for style transfer based on speaker adaptation
Z Wang, Q Xie, T Li, H Du, L Xie, P Zhu, M Bi
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
152022
Expressive-vc: Highly expressive voice conversion with attention fusion of bottleneck and perturbation features
Z Ning, Q Xie, P Zhu, Z Wang, L Xue, J Yao, L Xie, M Bi
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
132023
Edtalk: Efficient disentanglement for emotional talking head synthesis
S Tan, B Ji, M Bi, Y Pan
European Conference on Computer Vision, 398-416, 2025
102025
Dualvc: Dual-mode voice conversion using intra-model knowledge distillation and hybrid predictive coding
Z Ning, Y Jiang, P Zhu, J Yao, S Wang, L Xie, M Bi
arXiv preprint arXiv:2305.12425, 2023
102023
Learn2sing 2.0: Diffusion and mutual information-based target speaker svs by learning from singing teacher
H Xue, X Wang, Y Zhang, L Xie, P Zhu, M Bi
arXiv preprint arXiv:2203.16408, 2022
102022
Dualvc 2: Dynamic masked convolution for unified streaming and non-streaming voice conversion
Z Ning, Y Jiang, P Zhu, S Wang, J Yao, L Xie, M Bi
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
42024
DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion
Z Ning, S Wang, P Zhu, Z Wang, J Yao, L Xie, M Bi
arXiv preprint arXiv:2406.07846, 2024
12024
Revealing Directions for Text-guided 3D Face Editing
Z Chen, Y Yan, S Liu, Y Cheng, W Zhao, L Li, M Bi, X Yang
arXiv preprint arXiv:2410.04965, 2024
2024
E1 TTS: Simple and Fast Non-Autoregressive TTS
Z Liu, S Wang, P Zhu, M Bi, H Li
arXiv preprint arXiv:2409.09351, 2024
2024
MacST: Multi-Accent Speech Synthesis via Text Transliteration for Accent Conversion
S Inoue, S Wang, W Wang, P Zhu, M Bi, H Li
arXiv preprint arXiv:2409.09352, 2024
2024
Preconditioned Nonlinear Conjugate Gradient Method for Real-time Interior-point Hyperelasticity
X Shen, R Cai, M Bi, T Lv
ACM SIGGRAPH 2024 Conference Papers, 1-11, 2024
2024
Multi-GradSpeech: Towards Diffusion-based Multi-Speaker Text-to-speech Using Consistent Diffusion Models
H Xue, S Guo, P Zhu, M Bi
arXiv preprint arXiv:2308.10428, 2023
2023
EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis (Supplementary Material)
S Tan, B Ji, M Bi, Y Pan
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–18