Multi-objective learning and mask-based post-processing for deep neural network based speech enhancement Y Xu, J Du, Z Huang, LR Dai, CH Lee arXiv preprint arXiv:1703.07172, 2017 | 135 | 2017 |
Data-driven power outage detection by social sensors H Sun, Z Wang, J Wang, Z Huang, NL Carrington, J Liao IEEE Transactions on Smart Grid 7 (5), 2516-2524, 2016 | 99 | 2016 |
Rapid adaptation for deep neural networks through multi-task learning. Z Huang, J Li, SM Siniscalchi, IF Chen, J Wu, CH Lee Interspeech, 3625-3629, 2015 | 90 | 2015 |
An end-to-end deep learning approach to simultaneous speech dereverberation and acoustic modeling for robust speech recognition B Wu, K Li, F Ge, Z Huang, M Yang, SM Siniscalchi, CH Lee IEEE Journal of Selected Topics in Signal Processing 11 (8), 1289-1300, 2017 | 89 | 2017 |
DNN-based speech bandwidth expansion and its application to adding high-frequency missing features for automatic speech recognition of narrowband speech. K Li, Z Huang, Y Xu, CH Lee INTERSPEECH, 2578-2582, 2015 | 69 | 2015 |
A unified approach to transfer learning of deep neural networks with applications to speaker adaptation in automatic speech recognition Z Huang, SM Siniscalchi, CH Lee Neurocomputing 218, 448-459, 2016 | 68 | 2016 |
Maximum a Posteriori Adaptation of Network Parameters in Deep Models Z Huang, SM Siniscalchi, IF Chen, J Li, J Wu, CH Lee Proc. Interspeech, 2015 | 66 | 2015 |
A maximal figure-of-merit learning approach to maximizing mean average precision with deep neural network based classifiers K Li, Z Huang, YC Cheng, CH Lee 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 66 | 2014 |
SNDCNN: Self-normalizing deep CNNs with scaled exponential linear units for speech recognition Z Huang, T Ng, L Liu, H Mason, X Zhuang, D Liu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 47 | 2020 |
A blind segmentation approach to acoustic event detection based on i-vector. Z Huang, YC Cheng, K Li, V Hautamäki, CH Lee INTERSPEECH, 2282-2286, 2013 | 45 | 2013 |
Beyond cross-entropy: towards better frame-level objective functions for deep neural network training in automatic speech recognition. Z Huang, J Li, C Weng, CH Lee INTERSPEECH, 1214-1218, 2014 | 28 | 2014 |
Feature Space Maximum A Posteriori Linear Regression for Adaptation of Deep Neural Networks Z Huang, J Li, SM Siniscalchi, IF Chen, C Weng, CH Lee INTERSPEECH, 2992-2996, 2014 | 28 | 2014 |
Bayesian unsupervised batch and online speaker adaptation of activation function parameters in deep models for automatic speech recognition Z Huang, SM Siniscalchi, CH Lee IEEE/ACM Transactions on audio, speech, and language processing 25 (1), 64-75, 2016 | 22 | 2016 |
Deep learning vector quantization for acoustic information retrieval Z Huang, C Weng, K Li, YC Cheng, CH Lee 2014 IEEE international conference on acoustics, speech and signal …, 2014 | 22 | 2014 |
A transfer learning and progressive stacking approach to reducing deep model sizes with an application to speech enhancement S Wang, K Li, Z Huang, SM Siniscalchi, CH Lee 2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017 | 15 | 2017 |
Hierarchical Bayesian combination of plug-in maximum a posteriori decoders in deep neural networks-based speech recognition and speaker adaptation Z Huang, SM Siniscalchi, CH Lee Pattern Recognition Letters 98, 1-7, 2017 | 12 | 2017 |
A reverberation-time-aware DNN approach leveraging spatial information for microphone array dereverberation B Wu, M Yang, K Li, Z Huang, SM Siniscalchi, T Wang, CH Lee EURASIP Journal on Advances in Signal Processing 2017, 1-13, 2017 | 11 | 2017 |
Acoustic model fusion for end-to-end speech recognition Z Lei, M Xu, S Han, L Liu, Z Huang, T Ng, Y Zhang, E Pusateri, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023 | 9 | 2023 |
Personalization of ctc-based end-to-end speech recognition using pronunciation-driven subword tokenization Z Lei, E Pusateri, S Han, L Liu, M Xu, T Ng, R Travadi, Y Zhang, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 8 | 2024 |
A unified deep modeling approach to simultaneous speech dereverberation and recognition for the REVERB challenge B Wu, K Li, Z Huang, SM Siniscalchi, M Yang, CH Lee 2017 Hands-free Speech Communications and Microphone Arrays (HSCMA), 36-40, 2017 | 8 | 2017 |