Ryuichi Yamamoto

Cited by

	All	Since 2019
Citations	2473	2381
h-index	14	14
i10-index	20	17

660

330

165

495

20152016201720182019202020212022202320247 16 33 25 62 288 616 602 650 140

Co-authors

Eunwoo SongVoice, Naver CloudVerified email at navercorp.com
Tomoki HayashiHuman Dataware Lab. Co., Ltd., Nagoya UniversityVerified email at g.sp.m.is.nagoya-u.ac.jp
Shinji WatanabeCarnegie Mellon UniversityVerified email at cmu.edu
Takenori YoshimuraNagoya Institute of TechnologyVerified email at nitech.ac.jp
Min-Jae HwangMeta AIVerified email at meta.com
Tomoki TodaNagoya UniversityVerified email at icts.nagoya-u.ac.jp
Shigeki KaritaGoogleVerified email at google.com
Hirofumi InagumaFundamental AI Research (FAIR) at MetaVerified email at meta.com
Brian McFeeMusic and Performing Arts Professions / Center for Data Science, New York UniversityVerified email at nyu.edu
Takaaki SaekiGoogleVerified email at google.com

Ryuichi Yamamoto

LY Corporation

Verified email at lycorp.co.jp - Homepage

Speech Synthesis Voice Conversion Speech Recognition Machine Learning Singing Voice Synthesis


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram R Yamamoto, E Song, JM Kim ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	797	2020
A comparative study on transformer vs rnn in speech applications S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ... 2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019	754	2019
librosa/librosa: 0.6. 3 B McFee, M McVicar, S Balke, C Thomé, C Raffel, D Lee, O Nieto, ... URL: https://doi. org/10.5281/zenodo 2564164, 2019	344*	2019
ESPnet-TTS: Unified, reproducible, and integratable open source end-to-end text-to-speech toolkit T Hayashi, R Yamamoto, K Inoue, T Yoshimura, S Watanabe, T Toda, ... ICASSP 2020-2020 IEEE international conference on acoustics, speech and …, 2020	216	2020
Probability density distillation with generative adversarial networks for high-quality parallel waveform generation R Yamamoto, E Song, JM Kim arXiv preprint arXiv:1904.04472, 2019	54	2019
Espnet2-tts: Extending the edge of tts research T Hayashi, R Yamamoto, T Yoshimura, P Wu, J Shi, T Saeki, Y Ju, ... arXiv preprint arXiv:2110.07840, 2021	42	2021
TTS-by-TTS: TTS-driven data augmentation for fast and high-quality speech synthesis MJ Hwang, R Yamamoto, E Song, JM Kim ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	36	2021
Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators R Yamamoto, E Song, MJ Hwang, JM Kim ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	20	2021
Improved Parallel WaveGAN vocoder with perceptually weighted spectrogram loss E Song, R Yamamoto, MJ Hwang, JS Kim, O Kwon, JM Kim 2021 IEEE Spoken Language Technology Workshop (SLT), 470-476, 2021	20	2021
Semi-supervised speaker adaptation for end-to-end speech synthesis with pretrained models K Inoue, S Hara, M Abe, T Hayashi, R Yamamoto, S Watanabe ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	18	2020
Ryry: A real-time score-following automatic accompaniment playback system capable of real performances with errors, repeats and jumps S Sako, R Yamamoto, T Kitamura Active Media Technology: 10th International Conference, AMT 2014, Warsaw …, 2014	16	2014
Cross-speaker emotion transfer for low-resource text-to-speech using non-parallel voice conversion with pitch-shift data augmentation R Terashima, R Yamamoto, E Song, Y Shirahata, HW Yoon, JM Kim, ... arXiv preprint arXiv:2204.10020, 2022	15	2022
Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesis K Futamata, B Park, R Yamamoto, K Tachibana arXiv preprint arXiv:2104.12395, 2021	15	2021
High-Fidelity Parallel WaveGAN with Multi-Band Harmonic-Plus-Noise Model. MJ Hwang, R Yamamoto, E Song, JM Kim Interspeech, 2227-2231, 2021	14	2021
Improving lpcnet-based text-to-speech with linear prediction-structured mixture density network MJ Hwang, E Song, R Yamamoto, F Soong, HG Kang ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	14	2020
Score following handling performances with arbitrary repeats and skips and automatic accompaniment E Nakamura, H Takeda, R Yamamoto, Y Saito, S Sako, S Sagayama IPSJ Journal 54 (4), 1338-1349, 2013	14	2013
Language model-based emotion prediction methods for emotional speech synthesis systems HW Yoon, O Kwon, H Lee, R Yamamoto, E Song, JM Kim, MJ Hwang arXiv preprint arXiv:2206.15067, 2022	12	2022
Neural text-to-speech with a modeling-by-generation excitation vocoder E Song, MJ Hwang, R Yamamoto, JS Kim, O Kwon, JM Kim arXiv preprint arXiv:2008.00132, 2020	11	2020
Wavenet vocoder R Yamamoto Wavenet vocoder, 2018	10	2018
Robust on-line algorithm for real-time audio-to-score alignment based on a delayed decision and anticipation framework R Yamamoto, S Sako, T Kitamura 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013	10	2013

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors