Audio Captioning Based on Transformer and Pre-Trained CNN. K Chen, Y Wu, Z Wang, X Zhang, F Nian, S Li, X Shao DCASE, 21-25, 2020 | 54 | 2020 |
An encoder-decoder based audio captioning system with transfer and reinforcement learning X Mei, Q Huang, X Liu, G Chen, J Wu, Y Wu, J Zhao, S Li, T Ko, HL Tang, ... arXiv preprint arXiv:2108.02752, 2021 | 52 | 2021 |
Self-attention mechanism based system for dcase2018 challenge task1 and task4 J Wang, S Li Proc. DCASE Challenge, 1-5, 2018 | 52 | 2018 |
Computer audition for healthcare: Opportunities and challenges K Qian, X Li, H Li, S Li, W Li, Z Ning, S Yu, L Hou, G Tang, J Lu, F Li, ... Frontiers in Digital Health 2, 5, 2020 | 35 | 2020 |
Competitive Business Model in Audio-book Industry: A Case of China. D Liu, S Li, T Yang J. Softw. 7 (1), 33-40, 2012 | 25 | 2012 |
Multi-level attention model with deep scattering spectrum for acoustic scene classification Z Li, Y Hou, X Xie, S Li, L Zhang, S Du, W Liu 2019 IEEE International Conference on Multimedia & Expo Workshops (ICMEW …, 2019 | 21 | 2019 |
Polyphonic audio tagging with sequentially labelled data using crnn with learnable gated linear units Y Hou, Q Kong, J Wang, S Li arXiv preprint arXiv:1811.07072, 2018 | 19 | 2018 |
An encoder-decoder based audio captioning system with transfer and reinforcement learning for DCASE challenge 2021 task 6 X Mei, Q Huang, X Liu, G Chen, J Wu, Y Wu, J Zhao, S Li, T Ko, HL Tang, ... DCASE2021 Challenge, Tech. Rep, Tech. Rep, 2021 | 17 | 2021 |
DCASE 2019 challenge task1 technical report H Zhu, C Ren, J Wang, S Li, L Wang, L Yang Samsung Research China-Beijing, Beijing University of Posts and …, 2019 | 17 | 2019 |
Visually-aware audio captioning with adaptive audio-visual attention X Liu, Q Huang, X Mei, H Liu, Q Kong, J Sun, S Li, T Ko, Y Zhang, ... arXiv preprint arXiv:2210.16428, 2022 | 15 | 2022 |
Audio captioning based on transformer and pre-training for 2020 DCASE audio captioning challenge Y Wu, K Chen, Z Wang, X Zhang, F Nian, S Li, X Shao DCASE2020 Challenge, Tech. Rep., 2020 | 15 | 2020 |
Sound event detection with sequentially labelled data based on connectionist temporal classification and unsupervised clustering Y Hou, Q Kong, S Li, MD Plumbley ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 14 | 2019 |
Transfer learning for improving singing-voice detection in polyphonic instrumental music Y Hou, FK Soong, J Luan, S Li arXiv preprint arXiv:2008.04658, 2020 | 12 | 2020 |
Bird sound detection based on binarized convolutional neural networks J Song, S Li Proceedings of the 6th Conference on Sound and Music Technology (CSMT …, 2019 | 11 | 2019 |
Acoustic scene classification across cities and devices via feature disentanglement Y Tan, H Ai, S Li, MD Plumbley IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | 9 | 2024 |
Sound event detection in real life audio using multimodel system Y Hou, S Li Proceedings of the Detection and Classification of Acoustic Scenes and Events, 2017 | 9 | 2017 |
DCASE2023 task1 submission: Device simulation and time-frequency separable convolution for acoustic scene classification Y Cai, M Lin, C Zhu, S Li, X Shao Tech. Rep., Detection and Classification of Acoustic Scenes and Events …, 2023 | 8 | 2023 |
Peking opera synthesis via duration informed attention network Y Wu, S Li, C Yu, H Lu, C Weng, L Zhang, D Yu arXiv preprint arXiv:2008.03029, 2020 | 8 | 2020 |
Singing voice detection using multi-feature deep fusion with cnn X Zhang, S Li, Z Li, S Chen, Y Gao, W Li Proceedings of the 7th Conference on Sound and Music Technology (CSMT …, 2020 | 8 | 2020 |
Audio tagging with connectionist temporal classification model using sequentially labelled data Y Hou, Q Kong, S Li Communications, Signal Processing, and Systems: Proceedings of the 2018 CSPS …, 2020 | 8 | 2020 |