Xuankai Chang
Xuankai Chang
Verified email at
Cited by
Cited by
CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings
S Watanabe, M Mandel, J Barker, E Vincent, A Arora, X Chang, ...
arXiv preprint arXiv:2004.09249, 2020
Superb: Speech processing universal performance benchmark
S Yang, PH Chi, YS Chuang, CIJ Lai, K Lakhotia, YY Lin, AT Liu, J Shi, ...
arXiv preprint arXiv:2105.01051, 2021
Recent developments on espnet toolkit boosted by conformer
P Guo, F Boyer, X Chang, T Hayashi, Y Higuchi, H Inaguma, N Kamo, C Li, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
Recognizing multi-talker speech with permutation invariant training
D Yu, X Chang, Y Qian
arXiv preprint arXiv:1704.01985, 2017
MIMO-Speech: End-to-end multi-channel multi-speaker speech recognition
X Chang, W Zhang, Y Qian, J Le Roux, S Watanabe
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
Single-channel multi-talker speech recognition with permutation invariant training
Y Qian, X Chang, D Yu
Speech Communication 104, 1-11, 2018
End-to-end monaural multi-speaker ASR system without pretraining
X Chang, Y Qian, K Yu, S Watanabe
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
End-to-end multi-speaker speech recognition with transformer
X Chang, W Zhang, Y Qian, J Le Roux, S Watanabe
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
Past review, current progress, and challenges ahead on the cocktail party problem
Y Qian, C Weng, X Chang, S Wang, D Yu
Frontiers of Information Technology & Electronic Engineering 19 (1), 40-63, 2018
Unrestricted Vocabulary Keyword Spotting Using LSTM-CTC.
Y Zhuang, X Chang, Y Qian, K Yu
Interspeech, 938-942, 2016
ESPnet-SE: End-to-end speech enhancement and separation toolkit designed for ASR integration
C Li, J Shi, W Zhang, AS Subramanian, X Chang, N Kamo, M Hira, ...
2021 IEEE Spoken Language Technology Workshop (SLT), 785-792, 2021
The 2020 espnet update: new features, broadened applications, performance improvements, and future plans
S Watanabe, F Boyer, X Chang, P Guo, T Hayashi, Y Higuchi, T Hori, ...
2021 IEEE Data Science and Learning Workshop (DSLW), 1-6, 2021
Investigation of end-to-end speaker-attributed ASR for continuous multi-talker recordings
N Kanda, X Chang, Y Gaur, X Wang, Z Meng, Z Chen, T Yoshioka
2021 IEEE Spoken Language Technology Workshop (SLT), 809-816, 2021
Insertion-based modeling for end-to-end automatic speech recognition
Y Fujita, S Watanabe, M Omachi, X Chan
arXiv preprint arXiv:2005.13211, 2020
Monaural Multi-Talker Speech Recognition with Attention Mechanism and Gated Convolutional Networks.
X Chang, Y Qian, D Yu
INTERSPEECH, 1586-1590, 2018
End-to-end far-field speech recognition with unified dereverberation and beamforming
W Zhang, AS Subramanian, X Chang, S Watanabe, Y Qian
arXiv preprint arXiv:2005.10479, 2020
Sequence to multi-sequence learning via conditional chain mapping for mixture signals
J Shi, X Chang, P Guo, S Watanabe, Y Fujita, J Xu, B Xu, L Xie
Advances in Neural Information Processing Systems 33, 3735-3747, 2020
Improving end-to-end single-channel multi-talker speech recognition
W Zhang, X Chang, Y Qian, S Watanabe
IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1385-1394, 2020
Adaptive permutation invariant training with auxiliary information for monaural multi-talker speech recognition
X Chang, Y Qian, D Yu
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
An exploration of self-supervised pretrained representations for end-to-end speech recognition
X Chang, T Maekaku, P Guo, J Shi, YJ Lu, AS Subramanian, T Wang, ...
arXiv preprint arXiv:2110.04590, 2021
The system can't perform the operation now. Try again later.
Articles 1–20