Follow
Takuma Okamoto
Title
Cited by
Cited by
Year
Sound-space recording and binaural presentation system based on a 252-channel microphone array
S Sakamoto, S Hongo, T Okamoto, Y Iwaya, Y Suzuki
Acoustical Science and technology 36 (6), 516-526, 2015
402015
Real-Time Neural Text-to-Speech with Sequence-to-Sequence Acoustic Model and WaveGlow or Single Gaussian WaveRNN Vocoders.
T Okamoto, T Toda, Y Shiga, H Kawai
INTERSPEECH, 1308-1312, 2019
322019
An investigation of subband WaveNet vocoder covering entire audible frequency range with limited acoustic features
T Okamoto, K Tachibana, T Toda, Y Shiga, H Kawai
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
322018
Experimental validation of spatial Fourier transform-based multiple sound zone generation with a linear loudspeaker array
T Okamoto, A Sakaguchi
The Journal of the Acoustical Society of America 141 (3), 1769-1780, 2017
312017
High order Ambisonic decoding method for irregular loudspeaker arrays
J Trevino, T Okamoto, Y Iwaya, Y Suzuki
Proceedings of 20th International Congress on Acoustics, 23-27, 2010
312010
Tacotron-based acoustic model using phoneme alignment for practical neural text-to-speech systems
T Okamoto, T Toda, Y Shiga, H Kawai
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
272019
Estimation of sound source positions using a surrounding microphone array
T Okamoto, R Nishimura, Y Iwaya
Acoustical science and technology 28 (3), 181-189, 2007
272007
Quasi-periodic parallel WaveGAN: A non-autoregressive raw waveform generative model with pitch-dependent dilated convolution neural network
YC Wu, T Hayashi, T Okamoto, H Kawai, T Toda
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 792-806, 2021
262021
3D spatial sound systems compatible with human's active listening to realize rich high-level kansei information
Y Suzuki, T Okamoto, J Trevino, ZL Cui, Y Iwaya, S Sakamoto, M Otani
Interdisciplinary information sciences 18 (2), 71-82, 2012
242012
Generation of multiple sound zones by spatial filtering in wavenumber domain using a linear array of loudspeakers
T Okamoto
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
222014
Text-to-speech synthesis
Y Shiga, J Ni, K Tachibana, T Okamoto
Speech-to-Speech Translation, 39-52, 2020
212020
Improving FFTNet vocoder with noise shaping and subband approaches
T Okamoto, T Toda, Y Shiga, H Kawai
2018 IEEE Spoken Language Technology Workshop (SLT), 304-311, 2018
212018
Subband WaveNet with overlapped single-sideband filterbanks
T Okamoto, K Tachibana, T Toda, Y Shiga, H Kawai
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017
212017
Analytical methods of generating multiple sound zones for open and baffled circular loudspeaker arrays
T Okamoto
2015 IEEE Workshop on Applications of Signal Processing to Audio and …, 2015
202015
2.5 D higher order ambisonics for a sound field described by angular spectrum coefficients
T Okamoto
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
192016
Transformer-based text-to-speech with weighted forced attention
T Okamoto, T Toda, Y Shiga, H Kawai
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
182020
Analytical approach to 2.5 D sound field control using a circular double-layer array of fixed-directivity loudspeakers
T Okamoto
2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017
162017
Least squares approach in wavenumber domain for sound field recording and reproduction using multiple parallel linear arrays
T Okamoto, S Enomoto, R Nishimura
Applied acoustics 86, 95-103, 2014
152014
Implementation of a high-definition 3D audio-visual display based on higher-order Ambisonics using a 157-loudspeaker array combined with a 3D projection display
T Okamoto, ZL Cui, Y Iwaya, Y Suzuki
2010 2nd IEEE InternationalConference on Network Infrastructure and Digital …, 2010
152010
Multi-stream HiFi-GAN with data-driven waveform decomposition
T Okamoto, T Toda, H Kawai
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
142021
The system can't perform the operation now. Try again later.
Articles 1–20