The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens N Zhou, Y Jiang, TR Bergquist, AJ Lee, BZ Kacsoh, AW Crocker, ... Genome biology 20, 1-23, 2019 | 399 | 2019 |
Capturing evolution in word usage: Just add more clusters? M Martinc, S Montariol, E Zosa, L Pivovarova Companion Proceedings of the Web Conference 2020, 343-349, 2020 | 62 | 2020 |
Topic modelling discourse dynamics in historical newspapers J Marjanen, E Zosa, S Hengchen, L Pivovarova, M Tolonen arXiv preprint arXiv:2011.10428, 2020 | 35 | 2020 |
Discovery team at SemEval-2020 task 1: Context-sensitive embeddings not always better than static for semantic change detection M Martinc, S Montariol, E Zosa, L Pivovarova Proceedings of the Fourteenth Workshop on Semantic Evaluation, 67-73, 2020 | 25 | 2020 |
Multilingual dynamic topic model E Zosa, M Granroth-Wilding Proceedings of the International Conference on Recent Advances in Natural …, 2019 | 22 | 2019 |
Clustering ideological terms in historical newspaper data with diachronic word embeddings J Marjanen, L Pivovarova, E Zosa, J Kurunmäki 5th International Workshop on Computational History, HistoInformatics 2019, 2019 | 17 | 2019 |
SemEval-2024 Shared Task 6: SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes T Mickus, E Zosa, R Vázquez, T Vahtola, J Tiedemann, V Segonne, ... arXiv preprint arXiv:2403.07726, 2024 | 15 | 2024 |
SemEval-2024 Task 6: SHROOM, a shared-task on hallucinations and related observable overgeneration mistakes T Mickus, E Zosa, R Vázquez, T Vahtola, J Tiedemann, V Segonne, ... Proceedings of the 18th International Workshop on Semantic Evaluation …, 2024 | 11 | 2024 |
The expansion of isms, 1820-1917: Data-driven analysis of political language in digitized newspaper collections J Marjanen, J Kurunmäki, L Pivovarova, E Zosa Journal of Data Mining & Digital Humanities, 2020 | 10 | 2020 |
Multilingual topic labelling of news topics using ontological mapping E Zosa, L Pivovarova, M Boggia, S Ivanova European Conference on Information Retrieval, 248-256, 2022 | 9 | 2022 |
EMBEDDIA tools, datasets and challenges: Resources and hackathon contributions S Pollak, M Robnik-Šikonja, M Purver, M Boggia, R Shekhar, M Pranjić, ... Proceedings of the EACL Hackashop on News Media Content Analysis and …, 2021 | 9 | 2021 |
Poro 34B and the Blessing of Multilinguality R Luukkonen, J Burdge, E Zosa, A Talman, V Komulainen, V Hatanpää, ... arXiv preprint arXiv:2404.01856, 2024 | 8 | 2024 |
Multilingual and multimodal topic modelling with pretrained embeddings E Zosa, L Pivovarova arXiv preprint arXiv:2211.08057, 2022 | 8 | 2022 |
Evaluating the robustness of embedding-based topic models to OCR noise E Zosa, S Mutuvi, M Granroth-Wilding, A Doucet Towards Open and Trustworthy Digital Societies: 23rd International …, 2021 | 7 | 2021 |
Visual Topic Modelling for NewsImage Task at MediaEval 2021. L Pivovarova, E Zosa MediaEval, 2021 | 6 | 2021 |
A comparison of unsupervised methods for ad hoc cross-lingual document retrieval E Zosa, M Granroth-Wilding, L Pivovarova Proceedings of the workshop on Cross-Language Search and Summarization of …, 2020 | 6 | 2020 |
Effectiveness of Data Augmentation and Pretraining for Improving Neural Headline Generation in Low-Resource Settings M Martinc, S Montariol, L Pivovarova, E Zosa Proceedings of the Thirteenth Language Resources and Evaluation Conference …, 2022 | 5 | 2022 |
Word clustering for historical newspapers analysis L Pivovarova, J Marjanen, E Zosa Workshop on Language Technology for Digital Historical Archives: with a …, 2019 | 5 | 2019 |
Benchmarks for unsupervised discourse change detection QQ Duong, L Pivovarova, E Zosa International Workshop on Computational History, 2021 | 3 | 2021 |
Disappearing discourses: Avoiding anachronisms and teleology with data-driven methods in studying digital newspaper collections E Zosa, S Hengchen, J Marjanen, L Pivovarova, M Tolonen Digital humanities in the Nordic countries DHN 2020, Riga, Latvia, March 17–20, 2020 | 3 | 2020 |