Kevin Wilson
Kevin Wilson
Vahvistettu sähköpostiosoite verkkotunnuksessa google.com - Kotisivu
Nimike
Viittaukset
Viittaukset
Vuosi
CNN architectures for large-scale audio classification
S Hershey, S Chaudhuri, DPW Ellis, JF Gemmeke, A Jansen, RC Moore, ...
2017 ieee international conference on acoustics, speech and signal …, 2017
11902017
Learning the speech front-end with raw waveform CLDNNs
TN Sainath, RJ Weiss, A Senior, KW Wilson, O Vinyals
Sixteenth Annual Conference of the International Speech Communication …, 2015
4702015
Looking to listen at the cocktail party: A speaker-independent audio-visual model for speech separation
A Ephrat, I Mosseri, O Lang, T Dekel, K Wilson, A Hassidim, WT Freeman, ...
arXiv preprint arXiv:1804.03619, 2018
3522018
Speech denoising using nonnegative matrix factorization with priors
KW Wilson, B Raj, P Smaragdis, A Divakaran
2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008
3092008
Speech acoustic modeling from raw multichannel waveforms
Y Hoshen, RJ Weiss, KW Wilson
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
2162015
Multichannel signal processing with deep neural networks for automatic speech recognition
TN Sainath, RJ Weiss, KW Wilson, B Li, A Narayanan, E Variani, ...
IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (5), 965-979, 2017
1572017
Voicefilter: Targeted voice separation by speaker-conditioned spectrogram masking
Q Wang, H Muckenhirn, K Wilson, P Sridhar, Z Wu, J Hershey, ...
arXiv preprint arXiv:1810.04826, 2018
1552018
Acoustic Modeling for Google Home.
B Li, TN Sainath, A Narayanan, J Caroselli, M Bacchiani, A Misra, ...
Interspeech, 399-403, 2017
1332017
Regularized non-negative matrix factorization with temporal dependencies for speech denoising
KW Wilson, B Raj, P Smaragdis
Ninth Annual Conference of the International Speech Communication Association, 2008
1152008
Low latency video storyboard delivery with selectable resolution levels
NO Krahnstoever, KW Wilson
US Patent App. 13/785,913, 2014
1052014
Multiple person and speaker activity tracking with a particle filter
N Checka, KW Wilson, MR Siracusa, T Darrell
2004 IEEE International Conference on Acoustics, Speech, and Signal …, 2004
982004
Visual speech recognition with loosely synchronized feature streams
K Saenko, K Livescu, M Siracusa, K Wilson, J Glass, T Darrell
Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 2 …, 2005
972005
Processing multi-channel audio waveforms
TN Sainath, RJ Weiss, KW Wilson, AW Senior, A Narayanan, Y Hoshen, ...
US Patent 9,697,826, 2017
952017
Neural network adaptive beamforming for robust multichannel speech recognition
B Li, TN Sainath, RJ Weiss, KW Wilson, M Bacchiani
952016
Universal sound separation
I Kavalerov, S Wisdom, H Erdogan, B Patton, K Wilson, J Le Roux, ...
2019 IEEE Workshop on Applications of Signal Processing to Audio and …, 2019
752019
Speaker location and microphone spacing invariant acoustic modeling from raw multichannel waveforms
TN Sainath, RJ Weiss, KW Wilson, A Narayanan, M Bacchiani, A Senior
2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015
752015
Indexing a recording of audiovisual content to enable rich navigation
SW Fu, B Keating, S Vedula, K Wilson, S Ahmad
US Patent App. 11/282,318, 2007
732007
Factored spatial and spectral multichannel raw waveform CLDNNs
TN Sainath, RJ Weiss, KW Wilson, A Narayanan, M Bacchiani
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
692016
Learning a precedence effect-like weighting function for the generalized cross-correlation framework
KW Wilson, T Darrell
IEEE Transactions on audio, speech, and language processing 14 (6), 2156-2164, 2006
482006
A probabilistic framework for multi-modal multi-person tracking
N Checka, K Wilson, V Rangarajan, T Darrell
2003 Conference on Computer Vision and Pattern Recognition Workshop 9, 100-100, 2003
452003
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–20