Follow
Aku Rouhe
Aku Rouhe
PhD Student, Aalto University
Verified email at aalto.fi
Title
Cited by
Cited by
Year
SpeechBrain: A general-purpose speech toolkit
M Ravanelli, T Parcollet, P Plantinga, A Rouhe, S Cornell, L Lugosch, ...
arXiv preprint arXiv:2106.04624, 2021
2842021
Samuele Cornell
M Ravanelli, T Parcollet, P Plantinga, A Rouhe
Loren Lugosch, Cem Subakan, Nauman Dawalatabad, Abdelwahab Heba, Jianyuan …, 2021
502021
Multimodal machine translation through visuals and speech
U Sulubacak, O Caglayan, SA Grönroos, A Rouhe, D Elliott, L Specia, ...
Machine Translation 34, 97-147, 2020
502020
Speechbrain
M Ravanelli, T Parcollet, A Rouhe, P Plantinga, E Rastorgueva, ...
GitHub repository, 2021
172021
Samuele Cornell, Loren Lugosch, Cem Subakan, Nauman Dawalatabad, Abdelwahab Heba, Jianyuan Zhong, et al.,“Speechbrain: A general-purpose speech toolkit,”
M Ravanelli, T Parcollet, P Plantinga, A Rouhe
arXiv preprint arXiv:2106.04624, 2021
162021
Digitala: An Augmented Test and Review Process Prototype for High-Stakes Spoken Foreign Language Examination.
R Karhila, A Rouhe, P Smit, A Mansikkaniemi, H Kallio, E Lindroos, ...
Interspeech, 784-785, 2016
152016
Samuele Cornell, Loren Lugosch, Cem Subakan, Nauman Dawalatabad, Abdelwahab Heba, Jianyuan Zhong, et al. 2021. SpeechBrain: A general-purpose speech toolkit
M Ravanelli, T Parcollet, P Plantinga, A Rouhe
arXiv preprint arXiv:2106.04624, 2021
132021
Finnish ASR with Deep Transformer Models.
A Jain, A Rouhe, SA Grönroos, M Kurimo
Interspeech, 3630-3634, 2020
112020
SpeechBrain: A general-purpose speech toolkit. arXiv 2021
M Ravanelli, T Parcollet, P Plantinga, A Rouhe, S Cornell, L Lugosch, ...
arXiv preprint arXiv:2106.04624, 0
11
Self-supervised end-to-end ASR for low resource L2 Swedish
R Al-Ghezi, Y Getman, A Rouhe, R Hildén, M Kurimo
22nd Annual Conference of the International Speech Communication Association …, 2021
102021
Lahjoita puhetta: a large-scale corpus of spoken Finnish with some benchmarks
A Moisio, D Porjazovski, A Rouhe, Y Getman, A Virkkunen, R AlGhezi, ...
Language Resources and Evaluation, 1-33, 2022
72022
Speaker-aware training of attention-based end-to-end speech recognition using neural speaker embeddings
A Rouhe, T Kaseva, M Kurimo
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
62020
SpeechBrain: A General-Purpose Speech Toolkit. arXiv
M Ravanelli, T Parcollet, P Plantinga, A Rouhe, S Cornell, L Lugosch, ...
arXiv preprint arXiv:2106.04624, 2021
52021
Samuele Cornell, Sung-Lin Yeh, Hwidong Na, Yan Gao, Szu-Wei Fu, Cem Subakan, Renato De Mori, and Yoshua Bengio. Speechbrain
M Ravanelli, T Parcollet, A Rouhe, P Plantinga, E Rastorgueva, ...
52021
Spherediar: An effective speaker diarization system for meeting data
T Kaseva, A Rouhe, M Kurimo
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
52019
Speechbrain: A general-purpose speech toolkit
T Parcollet, M Ravanelli, P Plantinga, A Rouhe, S Cornell, L Lugosch, ...
32022
An equal data setting for attention-based encoder-decoder and HMM/DNN models: A case study in Finnish ASR
A Rouhe, A Van Camp, M Singh, H Van Hamme, M Kurimo
Speech and Computer: 23rd International Conference, SPECOM 2021, St …, 2021
32021
Reading Validation for Pronunciation Evaluation in the Digitala Project.
A Rouhe, R Karhila, P Smit, M Kurimo
Interspeech, 2050-2051, 2017
32017
Finnish parliament ASR corpus: Analysis, benchmarks and statistics
A Virkkunen, A Rouhe, N Phan, M Kurimo
Language Resources and Evaluation, 1-26, 2023
22023
Low Resource Comparison of Attention-based and Hybrid ASR Exploiting wav2vec 2.0
A Rouhe, A Virkkunen, J Leinonen, M Kurimo
Interspeech, 3543-3547, 2022
22022
The system can't perform the operation now. Try again later.
Articles 1–20