Seuraa
Jian Cong
Jian Cong
ByteDance
Vahvistettu sähköpostiosoite verkkotunnuksessa mail.nwpu.edu.cn
Nimike
Viittaukset
Viittaukset
Vuosi
NaturalSpeech: End-to-End Text-to-Speech Synthesis with Human-Level Quality
X Tan, J Chen, H Liu, J Cong, C Zhang, Y Liu, X Wang, Y Leng, Y Yi, L He, ...
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024
882024
Visinger: Variational inference with adversarial learning for end-to-end singing voice synthesis
Y Zhang, J Cong, H Xue, L Xie, P Zhu, M Bi
ICASSP 2022, 2022
452022
Data efficient voice cloning from noisy samples with domain adversarial training
J Cong, S Yang, L Xie, G Yu, G Wan
INTERSPEECH 2020, 2020
312020
Controllable Context-aware Conversational Speech Synthesis
J Cong, S Yang, N Hu, G Li, L Xie, D Su
INTERSPEECH 2021, 2021
252021
Glow-wavegan: Learning speech representations from gan-based variational auto-encoder for high fidelity flow-based speech synthesis
J Cong, S Yang, L Xie, D Su
INTERSPEECH 2021, 2021
242021
Glow-WaveGAN 2: high-quality zero-shot text-to-speech synthesis and any-to-any voice conversion
Y Lei, S Yang, J Cong, L Xie, D Su
INTERSPEECH2022, 2022
102022
DSPGAN: a GAN-based universal vocoder for high-fidelity TTS by time-frequency domain supervision from DSP
K Song, Y Zhang, Y Lei, J Cong, H Li, L Xie, G He, J Bai
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
82023
DiCLET-TTS: Diffusion model based cross-lingual emotion transfer for text-to-speech—A study between English and Mandarin
T Li, C Hu, J Cong, X Zhu, J Li, Q Tian, Y Wang, L Xie
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
22023
AdaVITS: Tiny VITS for Low Computing Resource Speaker Adaptation
K Song, H Xue, X Wang, J Cong, Y Zhang, L Xie, B Yang, X Zhang, D Su
2022 13th International Symposium on Chinese Spoken Language Processing …, 2022
22022
U-Style: Cascading U-nets with Multi-level Speaker and Style Modeling for Zero-Shot Voice Cloning
T Li, Z Wang, X Zhu, J Cong, Q Tian, Y Wang, L Xie
arXiv preprint arXiv:2310.04004, 2023
2023
Robust MelGAN: A robust universal neural vocoder for high-fidelity TTS
K Song, J Cong, X Wang, Y Zhang, L Xie, N Jiang, H Wu
2022 13th International Symposium on Chinese Spoken Language Processing …, 2022
2022
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–11