Fullsubnet: A full-band and sub-band fusion model for real-time single-channel speech enhancement X Hao, X Su, R Horaud, X Li ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 220 | 2021 |
Audio-visual speaker diarization based on spatiotemporal bayesian fusion ID Gebru, S Ba, X Li, R Horaud IEEE transactions on pattern analysis and machine intelligence 40 (5), 1086-1099, 2017 | 125 | 2017 |
Improved bare PCB defect detection approach based on deep feature learning C Zhang, W Shi, X Li, H Zhang, H Liu The Journal of Engineering 2018 (16), 1415-1420, 2018 | 83 | 2018 |
Estimation of the direct-path relative transfer function for supervised sound-source localization X Li, L Girin, R Horaud, S Gannot IEEE/ACM Transactions on Audio, Speech, and Language Processing 24 (11 …, 2016 | 73 | 2016 |
A novel lip descriptor for audio-visual keyword spotting based on adaptive decision fusion P Wu, H Liu, X Li, T Fan, X Zhang IEEE Transactions on Multimedia 18 (3), 326-338, 2016 | 66 | 2016 |
Multiple-speaker localization based on direct-path features and likelihood maximization with spatial sparsity regularization X Li, L Girin, R Horaud, S Gannot IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (10 …, 2017 | 62 | 2017 |
Multichannel speech enhancement based on time-frequency masking using subband long short-term memory X Li, R Horaud 2019 IEEE Workshop on Applications of Signal Processing to Audio and …, 2019 | 53 | 2019 |
Multitask learning of time-frequency CNN for sound source localization C Pang, H Liu, X Li IEEE Access 7, 40725-40737, 2019 | 49 | 2019 |
Online localization and tracking of multiple moving speakers in reverberant environments X Li, Y Ban, L Girin, X Alameda-Pineda, R Horaud IEEE Journal of Selected Topics in Signal Processing 13 (1), 88-103, 2019 | 44 | 2019 |
Binaural sound localization based on reverberation weighting and generalized parametric mapping C Pang, H Liu, J Zhang, X Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (8), 1618 …, 2017 | 38 | 2017 |
Reverberant sound localization with a robot head based on direct-path relative transfer function X Li, L Girin, F Badeig, R Horaud 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2016 | 38 | 2016 |
Estimation of relative transfer function in the presence of stationary noise based on segmental power spectral density matrix subtraction X Li, L Girin, R Horaud, S Gannot 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015 | 37 | 2015 |
Sound source localization for HRI using FOC-based time difference feature and spatial grid matching X Li, H Liu IEEE transactions on cybernetics 43 (4), 1199-1212, 2012 | 36 | 2012 |
SpatialNet: Extensively learning spatial information for multichannel joint speech separation, denoising and dereverberation C Quan, X Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 32, 1310-1323, 2024 | 29 | 2024 |
Roadmap toward the metaverse: An AI perspective S Cheng, Y Zhang, X Li, L Yang, X Yuan, SZ Li The Innovation 3 (5), 2022 | 29 | 2022 |
Online monaural speech enhancement using delayed subband LSTM X Li, R Horaud arXiv preprint arXiv:2005.05037, 2020 | 29 | 2020 |
Multichannel speech separation and enhancement using the convolutive transfer function X Li, L Girin, S Gannot, R Horaud IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (3), 645-659, 2019 | 29 | 2019 |
Sub-band knowledge distillation framework for speech enhancement X Hao, S Wen, X Su, Y Liu, G Gao, X Li arXiv preprint arXiv:2005.14435, 2020 | 26 | 2020 |
A survey of sound source localization for robot audition X Li, H Liu Zhineng Xitong Xuebao(CAAI Transactions on Intelligent Systems) 7 (1), 9-20, 2013 | 26* | 2013 |
Enhancing direct‐path relative transfer function using deep neural network for robust sound source localization B Yang, R Ding, Y Ban, X Li, H Liu CAAI Transactions on Intelligence Technology 7 (3), 446-454, 2022 | 25 | 2022 |