Online direction of arrival estimation based on deep learning Q Li, X Zhang, H Li 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 71 | 2018 |
A robust text-independent speaker verification method based on speech separation and deep speaker F Zhao, H Li, X Zhang ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 32 | 2019 |
Speakerfilter: Deep learning-based target speaker extraction using anchor speech S He, H Li, X Zhang ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 29 | 2020 |
Using optimal ratio mask as training target for supervised speech separation S Xia, H Li, X Zhang 2017 Asia-Pacific Signal and Information Processing Association Annual …, 2017 | 28 | 2017 |
DBNet: A dual-branch network architecture processing on spectrum and waveform for single-channel speech enhancement K Zhang, S He, H Li, X Zhang arXiv preprint arXiv:2105.02436, 2021 | 18 | 2021 |
Exploiting spectro-temporal structures using NMF for DNN-based supervised speech separation S Nie, S Liang, H Li, XL Zhang, ZL Yang, WJ Liu, LK Dong 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 16 | 2016 |
Jointly Optimizing Activation Coefficients of Convolutive NMF Using DNN for Speech Separation. H Li, S Nie, X Zhang, H Zhang Interspeech, 550-554, 2016 | 14 | 2016 |
Frame-Level Signal-to-Noise Ratio Estimation Using Deep Learning. H Li, DL Wang, X Zhang, G Gao Interspeech, 4626-4630, 2020 | 11 | 2020 |
Neural Multi-Channel and Multi-Microphone Acoustic Echo Cancellation C Zhang, J Liu, H Li, X Zhang IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 2181-2192, 2023 | 6 | 2023 |
Speakerfilter-pro: an improved target speaker extractor combines the time domain and frequency domain S He, H Li, X Zhang 2022 13th International Symposium on Chinese Spoken Language Processing …, 2022 | 6 | 2022 |
Recurrent neural networks and acoustic features for frame-level signal-to-noise ratio estimation H Li, DL Wang, X Zhang, G Gao IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 2878-2887, 2021 | 6 | 2021 |
Integrated speech enhancement method based on weighted prediction error and DNN for dereverberation and denoising H Li, X Zhang, H Zhang, G Gao arXiv preprint arXiv:1708.08251, 2017 | 6 | 2017 |
Beamformed feature for learning-based dual-channel speech separation H Li, X Zhang, G Gao ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 4 | 2020 |
Robust speech dereverberation based on wpe and deep learning H Li, X Zhang, G Gao 2020 Asia-Pacific Signal and Information Processing Association Annual …, 2020 | 3 | 2020 |
Dynamic-attention based encoder-decoder model for speaker extraction with anchor speech H Li, X Zhang, G Gao 2019 Asia-Pacific Signal and Information Processing Association Annual …, 2019 | 3 | 2019 |
Improve Data Utilization with Two-stage Learning in CNN-LSTM-based Voice Activity Detection T Xu, H Li, H Zhang, X Zhang 2019 Asia-Pacific Signal and Information Processing Association Annual …, 2019 | 3 | 2019 |
3S-TSE: Efficient Three-Stage Target Speaker Extraction for Real-Time and Low-Resource Applications S He, J Liu, H Li, Y Yang, F Chen, X Zhang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1 | 2024 |
A Dual-branch Convolutional Network Architecture Processing on both Frequency and Time Domain for Single-channel Speech Enhancement K Zhang, S He, H Li, X Zhang APSIPA Transactions on Signal and Information Processing 12 (3), 2023 | | 2023 |
RAT: RNN-Attention Transformer for Speech Enhancement T Zhang, S He, H Li, X Zhang 2022 13th International Symposium on Chinese Spoken Language Processing …, 2022 | | 2022 |
Guided Training: A Simple Method for Single-channel Speaker Separation H Li, X Zhang, G Gao arXiv preprint arXiv:2103.14330, 2021 | | 2021 |