Seuraa
Shi-Xiong (Austin) Zhang
Shi-Xiong (Austin) Zhang
Muut nimetShi-Xiong Zhang, Shixiong Zhang
Sr. Director | AI Foundations@Capital One | ex-Microsoft, ex-Tencent, Cambridge PhD
Vahvistettu sähköpostiosoite verkkotunnuksessa capitalone.com
Nimike
Viittaukset
Viittaukset
Vuosi
An overview of deep-learning-based audio-visual speech enhancement and separation
D Michelsanti, ZH Tan, SX Zhang, Y Xu, M Yu, D Yu, J Jensen
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1368-1396, 2021
2812021
End-to-end attention based text-dependent speaker verification
SX Zhang, Z Chen, Y Zhao, J Li, Y Gong
2016 IEEE Spoken Language Technology Workshop (SLT), 171-178, 2016
2082016
ADL-MVDR: All deep learning MVDR beamformer for target speech separation
Z Zhang, Y Xu, M Yu, SX Zhang, L Chen, D Yu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
1392021
Time Domain Audio Visual Speech Separation
J Wu, Y Xu, SX Zhang, LW Chen, M Yu, L Xie, D Yu
Automatic Speech Recognition and Understanding Workshop, ASRU 2019,, 2019
1362019
Multi-modal multi-channel target speech separation
R Gu, SX Zhang, Y Xu, L Chen, Y Zou, D Yu
IEEE Journal of Selected Topics in Signal Processing 14 (3), 530-541, 2020
1132020
Computerized intelligent assistant for conferences
A Diamant, KM Ben-Dor, E Krupka, R Halaly, Y Smolin, I Gurvich, ...
US Patent 10,867,610, 2020
1062020
Audio-visual Recognition of Overlapped speech for the LRS2 dataset
J Yu, SX Zhang, J Wu, S Ghorbani, B Wu, S Kang, S Liu, X Liu, H Meng, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1042020
Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information
R Gu, L Chen, SX Zhang, J Zheng, Y Xu, M Yu, D Su, Y Zou, D Yu
1032019
Investigation of Multilingual Deep Neural Networks for Spoken Term Detection
K Knill, MJF Gales, S Rath, P Woodland, SX Zhang
ASRU, 2013
1022013
SIMPLIFYING LONG SHORT-TERM MEMORY ACOUSTIC MODELS FOR FAST TRAINING AND DECODING
Y Miao, J Li, Y Wang, S Zhang, Y Gong
ICASSP, 2016
1002016
A comprehensive study of speech separation: spectrogram vs waveform separation
F Bahmaninezhad, J Wu, R Gu, SX Zhang, Y Xu, M Yu, D Yu
arXiv preprint arXiv:1905.07497, 2019
952019
End-to-end multi-channel speech separation
R Gu, J Wu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu
arXiv preprint arXiv:1905.06286, 2019
942019
Enhancing End-to-End Multi-Channel Speech Separation Via Spatial Feature Learning
R Gu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
662020
New era for robust speech recognition: exploiting deep learning
S Watanabe, M Delcroix, F Metze, JR Hershey, et al.
Springer, 2017
65*2017
FAST-RIR: Fast neural diffuse room impulse response generator
A Ratnarajah, SX Zhang, M Yu, Z Tang, D Manocha, D Yu
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
632022
Audio-visual speech separation and dereverberation with a two-stage multimodal network
K Tan, Y Xu, SX Zhang, M Yu, D Yu
IEEE Journal of Selected Topics in Signal Processing 14 (3), 542-553, 2020
632020
Structured SVMs for automatic speech recognition
SX Zhang, MJF Gales
IEEE Transactions on Audio, Speech, and Language Processing 21 (3), 544-555, 2012
502012
DEEP NEURAL SUPPORT VECTOR MACHINES FOR SPEECH RECOGNITION
SX Zhang, C Liu, K Yao, Y Gong
ICASSP 2015, 2015
462015
Generalized Spatio-Temporal RNN Beamformer for Target Speech Separation
Y Xu, Z Zhang, M Yu, SX Zhang, D Yu
arXiv preprint arXiv:2101.01280, 2021
452021
Complex neural spatial filter: Enhancing multi-channel target speech separation in complex domain
R Gu, SX Zhang, Y Zou, D Yu
IEEE Signal Processing Letters 28, 1370-1374, 2021
432021
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–20