Seuraa
Meng YU
Meng YU
Tencent AI Lab
Vahvistettu sähköpostiosoite verkkotunnuksessa tencent.com - Kotisivu
Nimike
Viittaukset
Viittaukset
Vuosi
An overview of deep-learning-based audio-visual speech enhancement and separation
D Michelsanti, ZH Tan, SX Zhang, Y Xu, M Yu, D Yu, J Jensen
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1368-1396, 2021
2822021
ADL-MVDR: All deep learning MVDR beamformer for target speech separation
Z Zhang, Y Xu, M Yu, SX Zhang, L Chen, D Yu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
1392021
Time domain audio visual speech separation
J Wu, Y Xu, SX Zhang, LW Chen, M Yu, L Xie, D Yu
2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019
1362019
DurIAN: Duration Informed Attention Network for Speech Synthesis.
C Yu, H Lu, N Hu, M Yu, C Weng, K Xu, P Liu, D Tuo, S Kang, G Lei, D Su, ...
Interspeech, 2027-2031, 2020
1092020
Deep extractor network for target speaker recovery from single channel speech mixtures
J Wang, J Chen, D Su, L Chen, M Yu, Y Qian, D Yu
arXiv preprint arXiv:1807.08974, 2018
1042018
Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information.
R Gu, L Chen, SX Zhang, J Zheng, Y Xu, M Yu, D Su, Y Zou, D Yu
Interspeech, 4290-4294, 2019
1032019
Durian: Duration informed attention network for multimodal synthesis
C Yu, H Lu, N Hu, M Yu, C Weng, K Xu, P Liu, D Tuo, S Kang, G Lei, D Su, ...
arXiv preprint arXiv:1909.01700, 2019
1032019
A comprehensive study of speech separation: spectrogram vs waveform separation
F Bahmaninezhad, J Wu, R Gu, SX Zhang, Y Xu, M Yu, D Yu
arXiv preprint arXiv:1905.07497, 2019
952019
End-to-end multi-channel speech separation
R Gu, J Wu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu
arXiv preprint arXiv:1905.06286, 2019
872019
Self-supervised text-independent speaker verification using prototypical momentum contrastive learning
W Xia, C Zhang, C Weng, M Yu, D Yu
ICASSP 2021-2021 IEEE international conference on acoustics, speech and …, 2021
852021
Deep learning based multi-source localization with source splitting and its effectiveness in multi-talker speech recognition
AS Subramanian, C Weng, S Watanabe, M Yu, D Yu
Computer Speech & Language 75, 101360, 2022
782022
Enhancing end-to-end multi-channel speech separation via spatial feature learning
R Gu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
662020
FAST-RIR: Fast neural diffuse room impulse response generator
A Ratnarajah, SX Zhang, M Yu, Z Tang, D Manocha, D Yu
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
632022
Audio-visual speech separation and dereverberation with a two-stage multimodal network
K Tan, Y Xu, SX Zhang, M Yu, D Yu
IEEE Journal of Selected Topics in Signal Processing 14 (3), 542-553, 2020
632020
Seq2seq attentional siamese neural networks for text-dependent speaker verification
Y Zhang, M Yu, N Li, C Yu, J Cui, D Yu
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
532019
Generalized spatio-temporal RNN beamformer for target speech separation
Y Xu, Z Zhang, M Yu, SX Zhang, D Yu
arXiv preprint arXiv:2101.01280, 2021
452021
Joint training of complex ratio mask based beamformer and acoustic model for noise robust asr
Y Xu, C Weng, L Hui, J Liu, M Yu, D Su, D Yu
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
442019
Neural spatio-temporal beamformer for target speech separation
Y Xu, M Yu, SX Zhang, L Chen, C Weng, J Liu, D Yu
arXiv preprint arXiv:2005.03889, 2020
432020
Far-field location guided target speech extraction using end-to-end speech recognition objectives
AS Subramanian, C Weng, M Yu, SX Zhang, Y Xu, S Watanabe, D Yu
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
432020
Speaker-aware target speaker enhancement by jointly learning with speaker embedding extraction
X Ji, M Yu, C Zhang, D Su, T Yu, X Liu, D Yu
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
422020
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–20