Yusuke Fujita
Yusuke Fujita
LINE Corporation
Verified email at ieee.org
Title
Cited by
Cited by
Year
CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings
S Watanabe, M Mandel, J Barker, E Vincent, A Arora, X Chang, ...
arXiv preprint arXiv:2004.09249, 2020
642020
End-to-end neural speaker diarization with permutation-free objectives
Y Fujita, N Kanda, S Horiguchi, K Nagamatsu, S Watanabe
Interspeech, 4300-4304, 2019
552019
End-to-end neural speaker diarization with self-attention
Y Fujita, N Kanda, S Horiguchi, Y Xue, K Nagamatsu, S Watanabe
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
522019
The Hitachi/JHU CHiME-5 system: Advances in speech recognition for everyday home environments using multiple microphone arrays
N Kanda, R Ikeshita, S Horiguchi, Y Fujita, K Nagamatsu, X Wang, ...
Proc. CHiME-5, 6-10, 2018
402018
Guided source separation meets a strong ASR backend: Hitachi/Paderborn University joint investigation for dinner party ASR
N Kanda, C Boeddeker, J Heitkaemper, Y Fujita, S Horiguchi, ...
arXiv preprint arXiv:1905.12230, 2019
272019
Investigation of lattice-free maximum mutual information-based acoustic models with sequence-level Kullback-Leibler divergence
N Kanda, Y Fujita, K Nagamatsu
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 69-76, 2017
272017
Acoustic modeling for distant multi-talker speech recognition with single-and multi-channel branches
N Kanda, Y Fujita, S Horiguchi, R Ikeshita, K Nagamatsu, S Watanabe
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
242019
Lattice-free State-level Minimum Bayes Risk Training of Acoustic Models.
N Kanda, Y Fujita, K Nagamatsu
Interspeech, 2923-2927, 2018
242018
End-to-end speaker diarization for an unknown number of speakers with encoder-decoder based attractors
S Horiguchi, Y Fujita, S Watanabe, Y Xue, K Nagamatsu
arXiv preprint arXiv:2005.09921, 2020
222020
Speaker diarization with region proposal network
Z Huang, S Watanabe, Y Fujita, P García, Y Shao, D Povey, S Khudanpur
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
222020
Unified ASR system using LGM-based source separation, noise-robust feature extraction, and word hypothesis selection
Y Fujita, R Takashima, T Homma, R Ikeshita, Y Kawaguchi, T Sumiyoshi, ...
2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015
162015
Neural speaker diarization with speaker-wise chain rule
Y Fujita, S Watanabe, S Horiguchi, Y Xue, J Shi, K Nagamatsu
arXiv preprint arXiv:2006.01796, 2020
142020
End-to-end neural diarization: Reformulating speaker diarization as simple multi-label classification
Y Fujita, S Watanabe, S Horiguchi, Y Xue, K Nagamatsu
arXiv preprint arXiv:2003.02966, 2020
142020
Simultaneous speech recognition and speaker diarization for monaural dialogue recordings with target-speaker acoustic models
N Kanda, S Horiguchi, Y Fujita, Y Xue, K Nagamatsu, S Watanabe
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 31-38, 2019
142019
Acoustic modeling for overlapping speech recognition: JHU CHiME-5 challenge system
V Manohar, SJ Chen, Z Wang, Y Fujita, S Watanabe, S Khudanpur
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
142019
Auxiliary interference speaker loss for target-speaker speech recognition
N Kanda, S Horiguchi, R Takashima, Y Fujita, K Nagamatsu, S Watanabe
arXiv preprint arXiv:1906.10876, 2019
122019
Speech synthesizer
Y Fujita, R Kamoshida, K Nagamatsu
US Patent 7,991,616, 2011
122011
Online end-to-end neural diarization with speaker-tracing buffer
Y Xue, S Horiguchi, Y Fujita, S Watanabe, P García, K Nagamatsu
2021 IEEE Spoken Language Technology Workshop (SLT), 841-848, 2021
92021
Sequence distillation for purely sequence trained acoustic models
N Kanda, Y Fujita, K Nagamatsu
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
82018
Sequence to multi-sequence learning via conditional chain mapping for mixture signals
J Shi, X Chang, P Guo, S Watanabe, Y Fujita, J Xu, B Xu, L Xie
arXiv preprint arXiv:2006.14150, 2020
72020
The system can't perform the operation now. Try again later.
Articles 1–20