Follow
Xuhui Zhou
Xuhui Zhou
Verified email at cs.cmu.edu - Homepage
Title
Cited by
Cited by
Year
Annotators with attitudes: How annotator beliefs and identities bias toxic language detection
M Sap, S Swayamdipta, L Vianna, X Zhou, Y Choi, NA Smith
Proceedings of the 2022 Conference of the North American Chapter of the …, 2021
2452021
Webarena: A realistic web environment for building autonomous agents
S Zhou, FF Xu, H Zhu, X Zhou, R Lo, A Sridhar, X Cheng, T Ou, Y Bisk, ...
arXiv preprint arXiv:2307.13854, 2023
2242023
Evaluating commonsense in pre-trained language models
X Zhou, Y Zhang, L Cui, D Huang
Proceedings of the AAAI conference on artificial intelligence 34 (05), 9733-9740, 2020
2102020
Challenges in automated debiasing for toxic language detection
X Zhou
Proceedings of the 16th Conference of the European Chapter of the …, 2021
1562021
Clever hans or neural theory of mind? stress testing social reasoning in large language models
N Shapira, M Levy, SH Alavi, X Zhou, Y Choi, Y Goldberg, M Sap, ...
arXiv preprint arXiv:2305.14763, 2023
1022023
Sotopia: Interactive evaluation for social intelligence in language agents
X Zhou, H Zhu, L Mathur, R Zhang, H Yu, Z Qi, LP Morency, Y Bisk, ...
arXiv preprint arXiv:2310.11667, 2023
822023
FANToM: A benchmark for stress-testing machine theory of mind in interactions
H Kim, M Sclar, X Zhou, RL Bras, G Kim, Y Choi, M Sap
arXiv preprint arXiv:2310.15421, 2023
522023
Can llms keep a secret? testing privacy implications of language models via contextual integrity theory
N Mireshghallah, H Kim, X Zhou, Y Tsvetkov, M Sap, R Shokri, Y Choi
arXiv preprint arXiv:2310.17884, 2023
512023
Linguistically-informed transformations (LIT): A method for automatically generating contrast sets
C Li, L Shengshuo, LZ Liu, X Wu, X Zhou, S Steinert-Threlkeld
Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting …, 2020
342020
Tianyue Ou, Yonatan Bisk, Daniel Fried, Uri Alon, and Graham Neubig
S Zhou, FF Xu, H Zhu, X Zhou, R Lo, A Sridhar, X Cheng
Webarena: A realistic web environment for building autonomous agents 2 (3), 4, 2023
282023
Cobra frames: Contextual reasoning about effects and harms of offensive statements
X Zhou, H Zhu, A Yerukola, T Davidson, JD Hwang, S Swayamdipta, ...
Proceedings of the Association for Computational Linguistics (ACL), 2023
232023
Tianyue Ou, Yonatan Bisk, Daniel Fried, Uri Alon, and Graham Neubig. 2023. WebArena: A Realistic Web Environment for Building Autonomous Agents
S Zhou, FF Xu, H Zhu, X Zhou, R Lo, A Sridhar, X Cheng
arXiv preprint arXiv:2307.13854, 0
21
Multilevel text alignment with cross-document attention
X Zhou, N Pappas, NA Smith
Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 2020
202020
Is this the real life? is this just fantasy? the misleading success of simulating social interactions with llms
X Zhou, Z Su, T Eisape, H Kim, M Sap
arXiv preprint arXiv:2403.05020, 2024
182024
Consent in crisis: The rapid decline of the ai data commons
S Longpre, R Mahari, AN Lee, CS Lund, H Oderinwale, W Brannon, ...
The Thirty-eight Conference on Neural Information Processing Systems …, 2024
162024
Extracting and inferring personal attributes from dialogue
Z Wang
Proceedings of the 4th Workshop on NLP for Conversational AI, 2021
152021
Clever hans or neural theory of mind
N Shapira, M Levy, SH Alavi, X Zhou, Y Choi, Y Goldberg, M Sap, ...
Stress testing social reasoning in large language models, 2023
122023
Emergent Communication Fine-tuning (EC-FT) for Pretrained Language Models
S Steinert-Threlkeld, X Zhou, Z Liu, CM Downey
Emergent Communication Workshop at ICLR 2022, 2022
122022
PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language Models
D Jain, P Kumar, S Gehman, X Zhou, T Hartvigsen, M Sap
arXiv preprint arXiv:2405.09373, 2024
92024
Don't Take This Out of Context! On the Need for Contextual Models and Evaluations for Stylistic Rewriting
A Yerukola, X Zhou, E Clark, M Sap
arXiv preprint arXiv:2305.14755, 2023
52023
The system can't perform the operation now. Try again later.
Articles 1–20