Pathologies of neural models make interpretations difficult S Feng, E Wallace, A Grissom II, M Iyyer, P Rodriguez, J Boyd-Graber arXiv preprint arXiv:1804.07781, 2018 | 238 | 2018 |
Trick me if you can: Human-in-the-loop generation of adversarial examples for question answering E Wallace, P Rodriguez, S Feng, I Yamada, J Boyd-Graber Transactions of the Association for Computational Linguistics 7, 387-401, 2019 | 103 | 2019 |
Quizbowl: The case for incremental question answering P Rodriguez, S Feng, M Iyyer, H He, J Boyd-Graber arXiv preprint arXiv:1904.04792, 2019 | 30 | 2019 |
Evaluation Examples Are Not Equally Informative: How Should That Change NLP Leaderboards? P Rodriguez, J Barrow, AM Hoyle, JP Lalor, R Jia, J Boyd-Graber Proceedings of the 59th Annual Meeting of the Association for Computational …, 2021 | 29 | 2021 |
Mitigating noisy inputs for question answering D Peskov, J Barrow, P Rodriguez, G Neubig, J Boyd-Graber arXiv preprint arXiv:1908.02914, 2019 | 14 | 2019 |
Trick me if you can: Adversarial writing of trivia challenge questions E Wallace, J Boyd-Graber ACL Student Research Workshop, 2018 | 11 | 2018 |
Information Seeking in the Spirit of Learning: a Dataset for Conversational Curiosity P Rodriguez, P Crook, S Moon, Z Wang Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 2020 | 10 | 2020 |
Human-computer question answering: The case for quizbowl J Boyd-Graber, S Feng, P Rodriguez The NIPS'17 Competition: Building Intelligent Systems, 169-180, 2018 | 10 | 2018 |
Evaluation paradigms in question answering P Rodriguez, J Boyd-Graber Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021 | 6 | 2021 |
Dynatask: A framework for creating dynamic AI benchmark tasks T Thrush, K Tirumala, A Gupta, M Bartolo, P Rodriguez, T Kane, ... arXiv preprint arXiv:2204.01906, 2022 | 5 | 2022 |
py-irt: A Scalable Item Response Theory Library for Python JP Lalor, P Rodriguez INFORMS Journal on Computing 35 (1), 5-13, 2023 | 3 | 2023 |
The stability wheel: An intuitive and didactic decision-making framework PA Rodriguez, S Rodriguez, J Carielo Proceedings of the 2014 International Snow Science Workshop, 1212-1214, 2014 | 2 | 2014 |
Clustering Examples in Multi-Dataset Benchmarks with Item Response Theory P Rodriguez, PM Htut, JP Lalor, J Sedoc Proceedings of the Third Workshop on Insights from Negative Results in NLP …, 2022 | 1 | 2022 |
Introduction to NIPS 2017 Competition Track S Escalera, M Weimer, M Burtsev, V Malykh, V Logacheva, R Lowe, ... The NIPS'17 Competition: Building Intelligent Systems, 1-23, 2018 | 1 | 2018 |
Applications of low cost and low power FMCW radar in the characterization of dry snow S Rodriguez, HP Marshall, P Rodriguez Proceedings of the International Snow Science Workshop, Banff, UK, 857-861, 2014 | 1 | 2014 |
Fighting FIRe with FIRE: Assessing the Validity of Text-to-Video Retrieval Benchmarks P Rodriguez, M Azab, B Silvert, R Sanchez, L Labson, H Shah, S Moon arXiv preprint arXiv:2210.05038, 2022 | | 2022 |
Proceedings of the First Workshop on Dynamic Adversarial Data Collection M Bartolo, H Kirk, P Rodriguez, K Margatina, T Thrush, R Jia, P Stenetorp, ... Proceedings of the First Workshop on Dynamic Adversarial Data Collection, 2022 | | 2022 |
: A Scalable Item Response Theory Library for Python JP Lalor, P Rodriguez arXiv preprint arXiv:2203.01282, 2022 | | 2022 |
Evaluating Machine Intelligence With Question Answering P Rodriguez University of Maryland, College Park, 2021 | | 2021 |
Application of a K-band FMCW Radar for the temporal tracking of dry snow stratigraphy, snowpack melt-freeze states, and mixed snow-rain precipitation events C Rodriguez, HP Marshall, P Rodriguez AGU Fall Meeting Abstracts 2018, C13D-1179, 2018 | | 2018 |