Follow
Rafael Valle
Rafael Valle
NVIDIA, UC Berkeley, CNMAT
Verified email at nvidia.com - Homepage
Title
Cited by
Cited by
Year
Waveglow: A flow-based generative network for speech synthesis
R Prenger, R Valle, B Catanzaro
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
12032019
Flowtron: an autoregressive flow-based generative network for text-to-speech synthesis
R Valle, K Shih, R Prenger, B Catanzaro
International Conference on Learning Representations 2021, 2020
1632020
Mellotron: Multispeaker expressive voice synthesis by conditioning on rhythm, pitch and global style tokens
R Valle, J Li, R Prenger, B Catanzaro
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1592020
Missing data imputation for supervised learning
J Poulos, R Valle
Applied Artificial Intelligence 32 (2), 186-196, 2018
832018
One TTS alignment to rule them all
R Badlani, A Łańcucki, KJ Shih, R Valle, W Ping, B Catanzaro
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
742022
Character-based handwritten text transcription with attention networks
J Poulos, R Valle
Neural Computing and Applications 33 (16), 10563-10573, 2021
502021
RAD-TTS: Parallel flow-based TTS with robust alignment learning and diverse synthesis
KJ Shih, R Valle, R Badlani, A Lancucki, W Ping, B Catanzaro
ICML Workshop on Invertible Neural Networks, Normalizing Flows, and Explicit …, 2021
502021
Machine improvisation with formal specifications
A Donzé, R Valle, I Akkaya, S Libkind, SA Seshia, D Wessel
Ann Arbor, MI: Michigan Publishing, University of Michigan Library, 2014
402014
Space: Speech-driven portrait animation with controllable expression
S Gururani, A Mallya, TC Wang, R Valle, MY Liu
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
272023
Attacking speaker recognition with deep generative models
W Cai, A Doshi, R Valle
arXiv preprint arXiv:1801.02384, 2018
252018
Control improvisation with probabilistic temporal specifications
I Akkaya, DJ Fremont, R Valle, A Donzé, EA Lee, SA Seshia
2016 IEEE First International Conference on Internet-of-Things Design and …, 2016
252016
Audio flamingo: A novel audio language model with few-shot learning and dialogue abilities
Z Kong, A Goel, R Badlani, W Ping, R Valle, B Catanzaro
arXiv preprint arXiv:2402.01831, 2024
142024
TequilaGAN: How to easily identify GAN samples
R Valle, W Cai, A Doshi
arXiv preprint arXiv:1807.04919, 2018
142018
Hands-On Generative Adversarial Networks with Keras: Your guide to implementing next-generation generative adversarial networks
R Valle
Packt Publishing Ltd, 2019
132019
ABROA: Audio Based Room Occupancy Analysis using Gaussian Mixtures and Hidden Markov Models
R Valle
Future Technologies Conference (FTC), 2016, 2016
132016
Any-to-Any Voice Conversion with F0 and Timbre Disentanglement and Novel Timbre Conditioning
S Kovela, R Valle, A Dantrey, B Catanzaro
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
82023
Neural odes for image segmentation with level sets
R Valle, F Reda, M Shoeybi, P Legresley, A Tao, B Catanzaro
arXiv preprint arXiv:1912.11683, 2019
82019
Specification mining for machine improvisation with formal specifications
R Valle, A Donzé, DJ Fremont, I Akkaya, SA Seshia, A Freed, D Wessel
Computers in Entertainment (CIE) 14 (3), 1-20, 2016
82016
RAD-MMM: Multilingual multiaccented multispeaker text to speech
R Badlani, R Valle, KJ Shih, JF Santos, S Gururani, B Catanzaro
Proc. Interspeech 2023, 626-630, 2023
62023
P-flow: a fast and data-efficient zero-shot TTS through speech prompting
S Kim, K Shih, JF Santos, E Bakhturina, M Desta, R Valle, S Yoon, ...
Advances in Neural Information Processing Systems 36, 2024
52024
The system can't perform the operation now. Try again later.
Articles 1–20