BLOOM: A 176B-Parameter Open-Access Multilingual Language Model TL Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ... arXiv preprint arXiv:2211.05100, 2022 | 1484 | 2022 |
Counterfactual Data Augmentation for Mitigating Gender Stereotypes in Languages with Rich Morphology R Zmigrod, SJ Mielke, H Wallach, R Cotterell arXiv preprint arXiv:1906.04571, 2019 | 304 | 2019 |
Tired of Topic Models? Clusters of Pretrained Word Embeddings Make for Fast and Good Topics too! S Sia, A Dalmia, SJ Mielke arXiv preprint arXiv:2004.14914, 2020 | 158 | 2020 |
The CoNLL--SIGMORPHON 2018 Shared Task: Universal Morphological Reinflection R Cotterell, C Kirov, J Sylak-Glassman, G Walther, E Vylomova, ... arXiv preprint arXiv:1810.07125, 2018 | 151 | 2018 |
UniMorph 2.0: Universal Morphology. C Kirov, R Cotterell, J Sylak-Glassman, G Walther, E Vylomova, P Xia, ... LREC, 2018 | 146 | 2018 |
Reducing Conversational Agents’ Overconfidence Through Linguistic Calibration SJ Mielke, A Szlam, E Dinan, YL Boureau Transactions of the Association for Computational Linguistics 10, 857-872, 2022 | 130* | 2022 |
The SIGMORPHON 2019 shared task: Crosslinguality and context in morphology AD McCarthy, E Vylomova, S Wu, C Malaviya, L Wolf-Sonkin, G Nicolai, ... Proceedings of the 16th SIGMORPHON Workshop on Computational Research in …, 2019 | 118* | 2019 |
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP SJ Mielke, Z Alyafeai, E Salesky, C Raffel, M Dey, M Gallé, A Raja, C Si, ... arXiv preprint arXiv:2112.10508, 2021 | 114* | 2021 |
Are All Languages Equally Hard to Language-Model? R Cotterell, SJ Mielke, J Eisner, B Roark Proceedings of the 2018 Conference of the North American Chapter of the …, 2018 | 96 | 2018 |
UniMorph 3.0: Universal Morphology AD McCarthy, C Kirov, M Grella, A Nidhi, P Xia, K Gorman, E Vylomova, ... Proceedings of The 12th Language Resources and Evaluation Conference, 3922-3931, 2020 | 88 | 2020 |
What Kind of Language Is Hard to Language-Model? SJ Mielke, R Cotterell, K Gorman, B Roark, J Eisner arXiv preprint arXiv:1906.04726, 2019 | 72 | 2019 |
Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset B Roark, L Wolf-Sonkin, C Kirov, SJ Mielke, C Johny, I Demirsahin, K Hall Proceedings of The 12th Language Resources and Evaluation Conference, 2413-2423, 2020 | 71 | 2020 |
SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection E Vylomova, J White, E Salesky, SJ Mielke, S Wu, E Ponti, RH Maudslay, ... arXiv preprint arXiv:2006.11572, 2020 | 60 | 2020 |
Spell once, summon anywhere: A two-level open-vocabulary language model SJ Mielke, J Eisner Proceedings of the AAAI Conference on Artificial Intelligence 33, 6843-6850, 2019 | 32 | 2019 |
SIGTYP 2020 Shared Task: Prediction of Typological Features J Bjerva, E Salesky, SJ Mielke, A Chaudhary, GGA Celano, EM Ponti, ... arXiv preprint arXiv:2010.08246, 2020 | 29 | 2020 |
Sigmorphon 2021 shared task on morphological reinflection: Generalization across languages T Pimentel, M Ryskina, SJ Mielke, S Wu, E Chodroff, B Leonard, G Nicolai, ... Proceedings of the 18th SIGMORPHON Workshop on Computational Research in …, 2021 | 27 | 2021 |
UniMorph 4.0: Universal Morphology K Batsuren, O Goldman, S Khalifa, N Habash, W Kieraś, G Bella, ... arXiv preprint arXiv:2205.03608, 2022 | 26 | 2022 |
It's Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information E Bugliarello, SJ Mielke, A Anastasopoulos, R Cotterell, N Okazaki arXiv preprint arXiv:2005.02354, 2020 | 19 | 2020 |
Can you compare perplexity across different segmentations? SJ Mielke Available in: https://sjmielke.com/comparing-perplexities.htm, 2019 | 16 | 2019 |
SIGTYP 2021 shared task: Robust spoken language identification E Salesky, BM Abdullah, S Mielke, E Klyachko, O Serikov, EM Ponti, ... Proceedings of the Third Workshop on Computational Typology and Multilingual …, 2021 | 14 | 2021 |