Follow
Qiushi Zhu
Title
Cited by
Cited by
Year
A noise-robust self-supervised pre-training model based speech representation learning for automatic speech recognition
QS Zhu, J Zhang, ZQ Zhang, MH Wu, X Fang, LR Dai
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
322022
A Joint Speech Enhancement and Self-Supervised Representation Learning Framework for Noise-Robust Speech Recognition
QS Zhu, J Zhang, ZQ Zhang, LR Dai
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
19*2023
Robust data2vec: Noise-robust speech representation learning for asr by combining regression and improved contrastive learning
QS Zhu, L Zhou, J Zhang, SJ Liu, YC Hu, LR Dai
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
152023
Gradient remedy for multi-task learning in end-to-end noise-robust speech recognition
Y Hu, C Chen, R Li, Q Zhu, ES Chng
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
122023
Supervised and self-supervised pretraining based COVID-19 detection using acoustic breathing/cough/speech signals
XY Chen*, QS Zhu*, J Zhang, LR Dai
*:Equal Contribution; ICASSP 2022-2022 IEEE International Conference on …, 2022
122022
VatLM: Visual-Audio-Text Pre-Training with Unified Masked Prediction for Speech Representation Learning
Q Zhu, L Zhou, Z Zhang, S Liu, B Jiao, J Zhang, L Dai, D Jiang, J Li, F Wei
IEEE Transactions on Multimedia, 2023
102023
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR
Y Hu, C Chen, Q Zhu, ES Chng
arXiv preprint arXiv:2304.04974, 2023
52023
BASEN: Time-Domain Brain-Assisted Speech Enhancement Network with Convolutional Cross Attention in Multi-talker Conditions
J Zhang, QT Xu, QS Zhu, ZH Ling
Interspeech 2023, 2023
32023
Speech Enhancement Using Self-Supervised Pre-Trained Model and Vector Quantization
XY Zhao, QS Zhu, J Zhang
2022 Asia-Pacific Signal and Information Processing Association Annual …, 2022
32022
An Improved Wav2Vec 2.0 Pre-Training Approach Using Enhanced Local Dependency Modeling for Speech Recognition.
Q Zhu, J Zhang, M Wu, X Fang, LR Dai
Interspeech, 4334-4338, 2021
32021
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition
Y Hu, R Li, C Chen, H Zou, Q Zhu, ES Chng
IJCAI 2023, 2023
22023
Noise-aware Speech Enhancement using Diffusion Probabilistic Model
Y Hu, C Chen, R Li, Q Zhu, ES Chng
arXiv preprint arXiv:2307.08029, 2023
12023
A Complementary Joint Training Approach Using Unpaired Speech and Text A Complementary Joint Training Approach Using Unpaired Speech and Text
Y Du, J Zhang, Q Zhu, L Dai, MH Wu, X Fang, ZW Yang
Proc. Interspeech 2022, 2613-2617, 2022
12022
Rep2wav: Noise Robust text-to-speech Using self-supervised representations
Q Zhu, Y Gu, C Weng, Y Hu, L Dai, J Zhang
arXiv preprint arXiv:2308.14553, 2023
2023
Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition
Y Hu, R Li, C Chen, C Qin, Q Zhu, ES Chng
ACL 2023, 2023
2023
Eeg2vec: Self-Supervised Electroencephalographic Representation Learning
Q Zhu, X Zhao, J Zhang, Y Gu, C Weng, Y Hu
arXiv preprint arXiv:2305.13957, 2023
2023
Speech Enhancement with Multi-granularity Vector Quantization
XY Zhao, QS Zhu, J Zhang
arXiv preprint arXiv:2302.08342, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–17