Berrak Sisman

Cited by

	All	Since 2019
Citations	1930	1893
h-index	23	23
i10-index	37	35

520

260

130

390

201720182019202020212022202320245 29 86 193 313 435 508 341

Public access

View all

31 articles

1 article

available

not available

Based on funding mandates

Co-authors

Haizhou LiThe Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen), China; NUS, SingaporeVerified email at u.nus.edu
Kun ZhouAlibaba Group; National University of SingaporeVerified email at u.nus.edu
Rui Liu (刘瑞)Professor, Inner Mongolia UniversityVerified email at mail.imu.edu.cn
Björn SchullerProfessor, Technische Universität München (TUM) / Imperial College London & CSO, audEERINGVerified email at tum.de
Simon KingProfessor of Speech Processing, University of EdinburghVerified email at ed.ac.uk
Carlos BussoProfessor of Electrical Engineering, The University of Texas at DallasVerified email at utdallas.edu
Junichi YamagishiNational Institute of Informatics, Tokyo, JapanVerified email at nii.ac.jp
Sakriani SaktiProfessor, Nara Institute of Science and TechnologyVerified email at is.naist.jp
Satoshi NakamuraNara Institute of Science and TechnologyVerified email at is.naist.jp
Andros TjandraFacebook AI (research scientist)Verified email at fb.com
Dorien HerremansSingapore University of Technology and DesignVerified email at sutd.edu.sg
Najim DehakAssociate Professor at ECE department, Johns Hopkins University.Verified email at jhu.edu
Nancy F. ChenFellow, Multimodal Generative AI Group Leader, AI for Education Programme Head at A*STARVerified email at csail.mit.edu
Onur KayaProfessor of Electrical and Electronics Eng, Işık UniversityVerified email at isikun.edu.tr
Sennur UlukusProfessor of Electrical and Computer Engineering, University of MarylandVerified email at umd.edu

Berrak Sisman

Electrical & Computer Engineering Department, The University of Texas at Dallas

Verified email at utdallas.edu - Homepage

machine learning speech processing speech synthesis voice conversion emotion


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
An overview of voice conversion and its challenges: From statistical modeling to deep learning B Sisman, J Yamagishi, S King, H Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 132-157, 2021	332	2021
Seen and unseen emotional style transfer for voice conversion with a new emotional speech dataset K Zhou, B Sisman, R Liu, H Li IEEE ICASSP 2021 International Conference on Acoustics, Speech, and Signal …, 2021	177	2021
Emotional Voice Conversion: Theory, Databases and ESD K Zhou, B Sisman, R Liu, H Li Speech Communication, 2022	130	2022
Expressive TTS Training with Frame and Style Reconstruction Loss R Liu, B Sisman, G Gao, H Li IEEE/ACM Transactions on Audio, Speech and Language Processing, 2021	86	2021
VQVAE Unsupervised Unit Discovery and Multi-scale Code2Spec Inverter for Zerospeech Challenge 2019 A Tjandra, B Sisman, M Zhang, S Sakti, H Li, S Nakamura Proc. Interspeech 2019, 2019	86	2019
Transforming Spectrum and Prosody for Emotional Voice Conversion with Non-Parallel Training Data K Zhou, B Sisman, H Li Proc. Odyssey 2020, Tokyo, Japan, 2020	81	2020
Teacher-Student Training for Robust Tacotron-based TTS R Liu, B Sisman, J Li, F Bao, G Gao, H Li IEEE ICASSP 2020 International Conference on Acoustics, Speech, and Signal …, 2020	64	2020
Converting Anyone's Emotion: Towards Speaker-Independent Emotional Voice Conversion K Zhou, B Sisman, M Zhang, H Li Proc. Interspeech 2020, 2020	60	2020
A voice conversion framework with tandem feature sparse representation and speaker-adapted wavenet vocoder B Sisman, M Zhang, H Li Proc. Interspeech, 1978 -1982, 2018	60	2018
Group sparse representation with wavenet vocoder adaptation for spectrum and prosody conversion B Sisman, M Zhang, H Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (6), 1085 …, 2019	51	2019
Sparse representation of phonetic features for voice conversion with and without parallel data B Sisman, H Li, KC Tan Automatic Speech Recognition and Understanding Workshop (ASRU), 2017 IEEE …, 2017	51	2017
SINGAN: Singing voice conversion with generative adversarial networks B Sisman, K Vijayan, M Dong, H Li Asia-Pacific Signal and Information Processing Association Annual Summit and …, 2019	45	2019
Adaptive Wavenet Vocoder for Residual Compensation in GAN-based Voice Conversion B Sisman, M Zhang, S Sakti, H Li, S Nakamura 2018 IEEE Spoken Language Technology Workshop (SLT), 282-289, 2018	45	2018
Emotion Intensity and its Control for Emotional Voice Conversion K Zhou, B Sisman, R Rana, BW Schuller, H Li IEEE Transactions on Affective Computing, 2023	44	2023
Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability R Liu, B Sisman, H Li INTERSPEECH 2021, 2021	39	2021
VAW-GAN for Disentanglement and Recomposition of Emotional Elements in Speech K Zhou, B Sisman, H Li 2021 IEEE Spoken Language Technology Workshop (SLT 2021), 2021	39	2021
On the study of Generative Adversarial Networks for Cross-lingual Voice Conversion B Sisman, M Zhang, M Dong, H Li IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) 2019, 2019	39	2019
Transformation of prosody in voice conversion B Sisman, H Li, KC Tan Asia-Pacific Signal and Information Processing Association Annual Summit and …, 2017	38	2017
Speech Synthesis with Mixed Emotions K Zhou, B Sisman, R Rana, BW Schuller, H Li IEEE Transactions on Affective Computing, 2023	35	2023
Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-stage Sequence-to-Sequence Training K Zhou, B Sisman, H Li INTERSPEECH 2021, 2021	34	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors