Ryuichi Yamamoto
Ryuichi Yamamoto
LINE Corp.
Verified email at linecorp.com - Homepage
Title
Cited by
Cited by
Year
A comparative study on transformer vs rnn in speech applications
S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ...
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
2092019
librosa/librosa: 0.6. 3
B McFee, M McVicar, S Balke, C Thomé, C Raffel, D Lee, O Nieto, ...
URL: https://doi. org/10.5281/zenodo 2564164, 2019
131*2019
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram
R Yamamoto, E Song, JM Kim
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1252020
ESPnet-TTS: Unified, reproducible, and integratable open source end-to-end text-to-speech toolkit
T Hayashi, R Yamamoto, K Inoue, T Yoshimura, S Watanabe, T Toda, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
532020
Probability density distillation with generative adversarial networks for high-quality parallel waveform generation
R Yamamoto, E Song, JM Kim
arXiv preprint arXiv:1904.04472, 2019
212019
Ryry: A real-time score-following automatic accompaniment playback system capable of real performances with errors, repeats and jumps
S Sako, R Yamamoto, T Kitamura
International Conference on Active Media Technology, 134-145, 2014
112014
Score following handling performances with arbitrary repeats and skips and automatic accompaniment
E Nakamura, H Takeda, R Yamamoto, Y Saito, S Sako, S Sagayama
IPSJ Journal 54 (4), 1338-1349, 2013
112013
Wavenet vocoder
R Yamamoto
82018
Robust on-line algorithm for real-time audio-to-score alignment based on a delayed decision and anticipation framework
R Yamamoto, S Sako, T Kitamura
2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013
82013
Improving lpcnet-based text-to-speech with linear prediction-structured mixture density network
MJ Hwang, E Song, R Yamamoto, F Soong, HG Kang
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
52020
Semi-supervised speaker adaptation for end-to-end speech synthesis with pretrained models
K Inoue, S Hara, M Abe, T Hayashi, R Yamamoto, S Watanabe
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
32020
Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators
R Yamamoto, E Song, MJ Hwang, JM Kim
arXiv preprint arXiv:2010.14151, 2020
22020
Real-time audio to score alignment using segmental conditional random fields and linear dynamical system
R Yamamoto, S Sako, T Kitarmura
Proceedings of the International Society for Music Information Retrieval …, 2012
22012
IEICE Technical Report
K Endo, K Nishikawa, H Kiya, K Slavakis, I Yamada, N Ogura, MT Akhtar, ...
Radio Communication 115 (113), 2008
22008
Improved parallel WaveGAN vocoder with perceptually weighted spectrogram loss
E Song, R Yamamoto, MJ Hwang, JS Kim, O Kwon, JM Kim
2021 IEEE Spoken Language Technology Workshop (SLT), 470-476, 2021
12021
Neural text-to-speech with a modeling-by-generation excitation vocoder
E Song, MJ Hwang, R Yamamoto, JS Kim, O Kwon, JM Kim
arXiv preprint arXiv:2008.00132, 2020
12020
TTS-by-TTS: TTS-driven Data Augmentation for Fast and High-Quality Speech Synthesis
MJ Hwang, R Yamamoto, E Song, JM Kim
arXiv preprint arXiv:2010.13421, 2020
2020
Robust On-line Algorithm For Real-time Audio-to-score
R Yamamoto, S Sako, T Kitamura
Journal, vol 65 (2-3), 389-409, 2006
2006
The system can't perform the operation now. Try again later.
Articles 1–18