Rafael Valle

Cited by

	All	Since 2019
Citations	2011	1954
h-index	13	13
i10-index	15	15

480

240

120

360

20162017201820192020202120222023202411 4 31 129 274 395 440 468 234

Public access

View all

4 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Sanjit A. SeshiaProfessor of EECS, University of California, BerkeleyVerified email at eecs.berkeley.edu
Ilge AkkayaOpenAI, UC Berkeley EECSVerified email at openai.com
Jason PoulosHarvard Medical SchoolVerified email at hcp.med.harvard.edu
Daniel J. FremontAssistant Professor, University of California, Santa CruzVerified email at ucsc.edu
Adrian FreedUC BerkeleyVerified email at adrianfreed.com
Edward A. LEEProfessor of Electrical Engineering and Computer Sciences, University of California at BerkeleyVerified email at berkeley.edu

Rafael Valle

NVIDIA, UC Berkeley, CNMAT

Verified email at nvidia.com - Homepage

Machine Listening and Improvisation


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Waveglow: A flow-based generative network for speech synthesis R Prenger, R Valle, B Catanzaro ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	1203	2019
Flowtron: an autoregressive flow-based generative network for text-to-speech synthesis R Valle, K Shih, R Prenger, B Catanzaro International Conference on Learning Representations 2021, 2020	163	2020
Mellotron: Multispeaker expressive voice synthesis by conditioning on rhythm, pitch and global style tokens R Valle, J Li, R Prenger, B Catanzaro ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	159	2020
Missing data imputation for supervised learning J Poulos, R Valle Applied Artificial Intelligence 32 (2), 186-196, 2018	83	2018
One TTS alignment to rule them all R Badlani, A Łańcucki, KJ Shih, R Valle, W Ping, B Catanzaro ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	74	2022
Character-based handwritten text transcription with attention networks J Poulos, R Valle Neural Computing and Applications 33 (16), 10563-10573, 2021	50	2021
RAD-TTS: Parallel flow-based TTS with robust alignment learning and diverse synthesis KJ Shih, R Valle, R Badlani, A Lancucki, W Ping, B Catanzaro ICML Workshop on Invertible Neural Networks, Normalizing Flows, and Explicit …, 2021	50	2021
Machine improvisation with formal specifications A Donzé, R Valle, I Akkaya, S Libkind, SA Seshia, D Wessel Ann Arbor, MI: Michigan Publishing, University of Michigan Library, 2014	40	2014
Space: Speech-driven portrait animation with controllable expression S Gururani, A Mallya, TC Wang, R Valle, MY Liu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	27	2023
Attacking speaker recognition with deep generative models W Cai, A Doshi, R Valle arXiv preprint arXiv:1801.02384, 2018	25	2018
Control improvisation with probabilistic temporal specifications I Akkaya, DJ Fremont, R Valle, A Donzé, EA Lee, SA Seshia 2016 IEEE First International Conference on Internet-of-Things Design and …, 2016	25	2016
Audio flamingo: A novel audio language model with few-shot learning and dialogue abilities Z Kong, A Goel, R Badlani, W Ping, R Valle, B Catanzaro arXiv preprint arXiv:2402.01831, 2024	14	2024
TequilaGAN: How to easily identify GAN samples R Valle, W Cai, A Doshi arXiv preprint arXiv:1807.04919, 2018	14	2018
Hands-On Generative Adversarial Networks with Keras: Your guide to implementing next-generation generative adversarial networks R Valle Packt Publishing Ltd, 2019	13	2019
ABROA: Audio Based Room Occupancy Analysis using Gaussian Mixtures and Hidden Markov Models R Valle Future Technologies Conference (FTC), 2016, 2016	13	2016
Any-to-Any Voice Conversion with F₀ and Timbre Disentanglement and Novel Timbre Conditioning S Kovela, R Valle, A Dantrey, B Catanzaro ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	8	2023
Neural odes for image segmentation with level sets R Valle, F Reda, M Shoeybi, P Legresley, A Tao, B Catanzaro arXiv preprint arXiv:1912.11683, 2019	8	2019
Specification mining for machine improvisation with formal specifications R Valle, A Donzé, DJ Fremont, I Akkaya, SA Seshia, A Freed, D Wessel Computers in Entertainment (CIE) 14 (3), 1-20, 2016	8	2016
RAD-MMM: Multilingual multiaccented multispeaker text to speech R Badlani, R Valle, KJ Shih, JF Santos, S Gururani, B Catanzaro Proc. Interspeech 2023, 626-630, 2023	6	2023
P-flow: a fast and data-efficient zero-shot TTS through speech prompting S Kim, K Shih, JF Santos, E Bakhturina, M Desta, R Valle, S Yoon, ... Advances in Neural Information Processing Systems 36, 2024	5	2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors