Wangyou Zhang
Wangyou Zhang
Ph.D. candidate, Department of Computer Science and Engineering, Shanghai Jiao Tong University
Verified email at sjtu.edu.cn
Title
Cited by
Cited by
Year
A comparative study on Transformer vs RNN in speech applications
S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ...
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
2092019
MIMO-SPEECH: End-to-End Multi-Channel Multi-Speaker Speech Recognition
X Chang, W Zhang, Y Qian, JL Roux, S Watanabe
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
332019
End-To-End Multi-Speaker Speech Recognition With Transformer
X Chang, W Zhang, Y Qian, J Le Roux, S Watanabe
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
232020
Recent Developments on ESPnet Toolkit Boosted by Conformer
P Guo, F Boyer, X Chang, T Hayashi, Y Higuchi, H Inaguma, N Kamo, C Li, ...
arXiv preprint arXiv:2010.13956, 2020
102020
Improving End-to-End Single-Channel Multi-Talker Speech Recognition
W Zhang, X Chang, Y Qian, S Watanabe
IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1385-1394, 2020
62020
End-to-End Far-Field Speech Recognition with Unified Dereverberation and Beamforming
W Zhang, AS Subramanian, X Chang, S Watanabe, Y Qian
Proc. Interspeech 2020, 324-328, 2020
52020
Robust DOA Estimation Based on Convolutional Neural Network and Time-Frequency Masking
W Zhang, Y Zhou, Y Qian
Proc. Interspeech 2019, 2703-2707, 2019
52019
Knowledge Distillation for End-to-End Monaural Multi-Talker ASR System
W Zhang, X Chang, Y Qian
Proc. Interspeech 2019, 2633-2637, 2019
32019
ESPnet-SE: end-to-end speech enhancement and separation toolkit designed for asr integration
C Li, J Shi, W Zhang, AS Subramanian, X Chang, N Kamo, M Hira, ...
IEEE Spoken Language Technology Workshop (SLT), 785–792, 2021
22021
End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend
W Zhang, C Boeddeker, S Watanabe, T Nakatani, M Delcroix, K Kinoshita, ...
arXiv preprint arXiv:2102.11525, 2021
12021
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans
S Watanabe, F Boyer, X Chang, P Guo, T Hayashi, Y Higuchi, T Hori, ...
arXiv preprint arXiv:2012.13006, 2020
2020
Convolutive Transfer Function Invariant SDR training criteria for Multi-Channel Reverberant Speech Separation
C Boeddeker, W Zhang, T Nakatani, K Kinoshita, T Ochiai, M Delcroix, ...
arXiv preprint arXiv:2011.15003, 2020
2020
End-to-End Overlapped Speech Detection and Speaker Counting with Raw Waveform
W Zhang, M Sun, L Wang, Y Qian
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
2019
Learning Contextual Language Embeddings for Monaural Multi-talker Speech Recognition
W Zhang, Y Qian
Proc. Interspeech 2020, 304-308, 0
The system can't perform the operation now. Try again later.
Articles 1–14