Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ... arXiv preprint arXiv:2312.11805, 2023 | 2213 | 2023 |
Are Sixteen Heads Really Better than One? P Michel, O Levy, G Neubig NeurIPS 2019, 2019 | 1110 | 2019 |
Gemma: Open models based on gemini research and technology G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ... arXiv preprint arXiv:2403.08295, 2024 | 740 | 2024 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ... arXiv preprint arXiv:2403.05530, 2024 | 702 | 2024 |
Dynet: The dynamic neural network toolkit G Neubig, C Dyer, Y Goldberg, A Matthews, W Ammar, A Anastasopoulos, ... arXiv preprint arXiv:1701.03980, 2017 | 451* | 2017 |
Weight Poisoning Attacks on Pre-trained Models K Kurita, P Michel, G Neubig ACL 2020, 2020 | 430 | 2020 |
Gemma 2: Improving open language models at a practical size G Team, M Riviere, S Pathak, PG Sessa, C Hardin, S Bhupatiraju, ... arXiv preprint arXiv:2408.00118, 2024 | 203 | 2024 |
On Evaluation of Adversarial Perturbations for Sequence-to-Sequence Models P Michel, X Li, G Neubig, JM Pino NAACL 2019, 2019 | 148 | 2019 |
MTNT: A Testbed for Machine Translation of Noisy Text P Michel, G Neubig EMNLP 2018, 2018 | 147 | 2018 |
compare-mt: A Tool for Holistic Comparison of Language Generation Systems G Neubig, ZY Dou, J Hu, P Michel, D Pruthi, X Wang NAACL 2019 Demo, 2019 | 133 | 2019 |
Extreme Adaptation for Personalized Neural Machine Translation P Michel, G Neubig ACL 2018, 2018 | 114 | 2018 |
Examining and Combating Spurious Features under Distribution Shift C Zhou, X Ma, P Michel, G Neubig ICML 2021, 2021 | 69 | 2021 |
Findings of the first shared task on machine translation robustness X Li, P Michel, A Anastasopoulos, Y Belinkov, N Durrani, O Firat, P Koehn, ... WMT 2019, 2019 | 69 | 2019 |
Optimizing data usage via differentiable rewards X Wang, H Pham, P Michel, A Anastasopoulos, J Carbonell, G Neubig International Conference on Machine Learning, 9983-9995, 2020 | 65 | 2020 |
Modeling the Second Player in Distributionally Robust Optimization P Michel, T Hashimoto, G Neubig ICLR 2021, 2021 | 38 | 2021 |
Blind phoneme segmentation with temporal prediction errors P Michel, O Räsänen, R Thiolliere, E Dupoux ACL SRW 2017, 2016 | 32 | 2016 |
Should we be pre-training? an argument for end-task aware training as an alternative LM Dery, P Michel, A Talwalkar, G Neubig ICLR 2022, 2021 | 31 | 2021 |
Codegemma: Open code models based on gemma CG Team, H Zhao, J Hui, J Howland, N Nguyen, S Zuo, A Hu, ... arXiv preprint arXiv:2406.11409, 2024 | 30 | 2024 |
Findings of the WMT 2020 shared task on machine translation robustness L Specia, Z Li, J Pino, V Chaudhary, F Guzmán, G Neubig, N Durrani, ... Proceedings of the Fifth Conference on Machine Translation, 76-91, 2020 | 30 | 2020 |
Distributionally Robust Models with Parametric Likelihood Ratios P Michel, T Hashimoto, G Neubig ICLR 2022, 2022 | 23 | 2022 |