Shigeki Karita

Cited by

	All	Since 2019
Citations	3226	3169
h-index	19	18
i10-index	23	23

860

430

215

645

2017201820192020202120222023202410 38 186 441 774 739 850 176

Co-authors

Shinji WatanabeCarnegie Mellon UniversityVerified email at cmu.edu
Tomohiro NakataniNTT Communication Science LaboratoriesVerified email at ieee.org
Marc DelcroixNTT Communication Science LaboratoriesVerified email at ieee.org
Tomoki HayashiHuman Dataware Lab. Co., Ltd., Nagoya UniversityVerified email at g.sp.m.is.nagoya-u.ac.jp
Atsunori OgawaNTT Communication Science LaboratoriesVerified email at ieee.org
Takaaki HoriAppleVerified email at apple.com
Hirofumi InagumaFundamental AI Research (FAIR) at MetaVerified email at meta.com
Nanxin ChenSenior Research Scientist, Google DeepMindVerified email at google.com
Jiro NishitobaRetrieva, Inc.Verified email at retrieva.jp
Michiel BacchianiGoogle Inc.Verified email at google.com
Wangyou ZhangPh.D. candidate, Department of Computer Science and Engineering, Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Jahn HeymannApplied Scientist @ AmazonVerified email at amazon.com
Ryuichi YamamotoLY CorporationVerified email at lycorp.co.jp
Xiaofei WangMicrosoftVerified email at jhu.edu
Ziyan JiangAmazon AGIVerified email at amazon.com
Keisuke KinoshitaResearch Scientist at GoogleVerified email at ieee.org
Tomoharu IwataNTTVerified email at hco.ntt.co.jp
Nobutaka ItoUniversity of Tokyo, Japan (formerly NTT)Verified email at k.u-tokyo.ac.jp
Takuya HiguchiAppleVerified email at apple.com
Kevin DuhJohns Hopkins UniversityVerified email at cs.jhu.edu

Shigeki Karita

Google

Verified email at google.com - Homepage

Machine Learning Speech Recognition


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Espnet: End-to-end speech processing toolkit S Watanabe, T Hori, S Karita, T Hayashi, J Nishitoba, Y Unno, NEY Soplin, ... arXiv preprint arXiv:1804.00015, 2018	1455	2018
A comparative study on transformer vs rnn in speech applications S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ... 2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019	755	2019
Improving Transformer-based End-to-End Speech Recognition with Connectionist Temporal Classification and Language Model Integration S Karita, NEY Soplin, S Watanabe, M Delcroix, A Ogawa, T Nakatani Proc. Interspeech 2019, 1408-1412, 2019	233	2019
ESPnet-ST: All-in-one speech translation toolkit H Inaguma, S Kiyono, K Duh, S Karita, NEY Soplin, T Hayashi, ... arXiv preprint arXiv:2004.10234, 2020	156	2020
Semi-Supervised End-to-End Speech Recognition S Karita, S Watanabe, T Iwata, A Ogawa, M Delcroix INTERSPEECH, 2-6, 2018	76	2018
Frame-by-frame closed-form update for mask-based adaptive MVDR beamforming T Higuchi, K Kinoshita, N Ito, S Karita, T Nakatani IEEE International Conference on Acoustics, Speech and Signal Processing, 2018	63	2018
The 2020 espnet update: new features, broadened applications, performance improvements, and future plans S Watanabe, F Boyer, X Chang, P Guo, T Hayashi, Y Higuchi, T Hori, ... 2021 IEEE Data Science and Learning Workshop (DSLW), 1-6, 2021	50	2021
Semi-Supervised End-to-End Speech Recognition Using Text-to-Speech and Autoencoders S Karita, S Watanabe, T Iwata, M Delcroix, A Ogawa, T Nakatani IEEE International Conference on Acoustics, Speech, and Signal Processing, 2019	50	2019
Auxiliary feature based adaptation of end-to-end ASR systems M Delcroix, S Watanabe, A Ogawa, S Karita, T Nakatani INTERSPEECH, 2018	45	2018
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement Y Koizumi, S Karita, S Wisdom, H Erdogan, JR Hershey, L Jones, ... 2021 IEEE Workshop on Applications of Signal Processing to Audio and …, 2021	44	2021
Far-field speech recognition using CNN-DNN-HMM with convolution in time T Yoshioka, S Karita, T Nakatani 2015 IEEE international conference on acoustics, speech and signal …, 2015	39	2015
Rescoring n-best speech recognition list based on one-on-one hypothesis comparison using encoder-classifier model A Ogawa, M Delcroix, S Karita, T Nakatani IEEE International Conference on Acoustics, Speech and Signal Processing, 2018	27	2018
Sequence training of encoder-decoder model using policy gradient for end-to-end speech recognition S Karita, A Ogawa, M Delcroix, T Nakatani IEEE International Conference on Acoustics, Speech and Signal Processing, 2018	26	2018
Knowledge transfer from large-scale pretrained language models to end-to-end speech recognizers Y Kubo, S Karita, M Bacchiani ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	23	2022
Self-Distillation for Improving CTC-Transformer-Based ASR Systems. T Moriya, T Ochiai, S Karita, H Sato, T Tanaka, T Ashihara, R Masumura, ... INTERSPEECH, 546-550, 2020	23	2020
End-to-End SpeakerBeam for Single Channel Target Speech Recognition. M Delcroix, S Watanabe, T Ochiai, K Kinoshita, S Karita, A Ogawa, ... Interspeech, 451-455, 2019	21	2019
Online meeting recognition in noisy environments with time-frequency mask based MVDR beamforming S Araki, N Ito, M Delcroix, A Ogawa, K Kinoshita, T Higuchi, T Yoshioka, ... 2017 Hands-free Speech Communications and Microphone Arrays (HSCMA), 16-20, 2017	20	2017
Learning device, learning method, and learning program A Ogawa, M Delcroix, S Karita, T Nakatani US Patent App. 16/966,056, 2020	19	2020
Espnet: End-to-end speech processing toolkit. arXiv 2018 S Watanabe, T Hori, S Karita, T Hayashi, J Nishitoba, Y Unno, NEY Soplin, ... arXiv preprint arXiv:1804.00015, 2018	19	2018
Unsupervised learning of disentangled speech content and style representation A Tjandra, R Pang, Y Zhang, S Karita arXiv preprint arXiv:2010.12973, 2020	14	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors