Xuhui Zhou
Xuhui Zhou
Verified email at - Homepage
Cited by
Cited by
Evaluating commonsense in pre-trained language models
X Zhou, Y Zhang, L Cui, D Huang
Proceedings of the AAAI conference on artificial intelligence 34 (05), 9733-9740, 2020
Annotators with attitudes: How annotator beliefs and identities bias toxic language detection
M Sap, S Swayamdipta, L Vianna, X Zhou, Y Choi, NA Smith
Proceedings of the 2022 Conference of the North American Chapter of the …, 2021
Challenges in automated debiasing for toxic language detection
X Zhou
Proceedings of the 16th Conference of the European Chapter of the …, 2021
Webarena: A realistic web environment for building autonomous agents
S Zhou, FF Xu, H Zhu, X Zhou, R Lo, A Sridhar, X Cheng, Y Bisk, D Fried, ...
arXiv preprint arXiv:2307.13854, 2023
Clever hans or neural theory of mind? stress testing social reasoning in large language models
N Shapira, M Levy, SH Alavi, X Zhou, Y Choi, Y Goldberg, M Sap, ...
arXiv preprint arXiv:2305.14763, 2023
Linguistically-informed transformations (LIT): A method for automatically generating contrast sets
C Li, L Shengshuo, LZ Liu, X Wu, X Zhou, S Steinert-Threlkeld
Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting …, 2020
Sotopia: Interactive evaluation for social intelligence in language agents
X Zhou, H Zhu, L Mathur, R Zhang, H Yu, Z Qi, LP Morency, Y Bisk, ...
arXiv preprint arXiv:2310.11667, 2023
Can llms keep a secret? testing privacy implications of language models via contextual integrity theory
N Mireshghallah, H Kim, X Zhou, Y Tsvetkov, M Sap, R Shokri, Y Choi
arXiv preprint arXiv:2310.17884, 2023
Multilevel text alignment with cross-document attention
X Zhou, N Pappas, NA Smith
Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 2020
FANToM: A benchmark for stress-testing machine theory of mind in interactions
H Kim, M Sclar, X Zhou, RL Bras, G Kim, Y Choi, M Sap
arXiv preprint arXiv:2310.15421, 2023
Extracting and inferring personal attributes from dialogue
Z Wang
Proceedings of the 4th Workshop on NLP for Conversational AI, 2021
Cobra frames: Contextual reasoning about effects and harms of offensive statements
X Zhou, H Zhu, A Yerukola, T Davidson, JD Hwang, S Swayamdipta, ...
Proceedings of the Association for Computational Linguistics (ACL), 2023
Emergent Communication Fine-tuning (EC-FT) for Pretrained Language Models
S Steinert-Threlkeld, X Zhou, Z Liu, CM Downey
Emergent Communication Workshop at ICLR 2022, 2022
Is this the real life? is this just fantasy? the misleading success of simulating social interactions with llms
X Zhou, Z Su, T Eisape, H Kim, M Sap
arXiv preprint arXiv:2403.05020, 2024
RPD: a distance function between word embeddings
X Zhou, Z Zheng, S Huang
Proceedings of the 58th Annual Meeting of the Association for Computational …, 2020
Don't Take This Out of Context! On the Need for Contextual Models and Evaluations for Stylistic Rewriting
A Yerukola, X Zhou, E Clark, M Sap
arXiv preprint arXiv:2305.14755, 2023
PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language Models
D Jain, P Kumar, S Gehman, X Zhou, T Hartvigsen, M Sap
arXiv preprint arXiv:2405.09373, 2024
Learning to translate by learning to communicate
CM Downey*, X Zhou*, LZ Liu, S Steinert-Threlkeld
EMNLP 2023 MRL, 2022
The system can't perform the operation now. Try again later.
Articles 1–18