Dylan Slack

Cited by

	All	Since 2019
Citations	1628	1623
h-index	12	12
i10-index	12	12

580

290

145

435

2020202120222023202475 202 361 568 409

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Sameer SinghAssociate Professor, UC IrvineVerified email at uci.edu
Himabindu LakkarajuAssistant Professor, Harvard UniversityVerified email at seas.harvard.edu
Sorelle A. FriedlerShibulal Family Professor of Computer Science, Haverford CollegeVerified email at haverford.edu
Satyapriya KrishnaHarvard UniversityVerified email at g.harvard.edu
Chitradeep Dutta RoyBrown UniversityVerified email at brown.edu
Krishnaram KenthapadiFiddler AIVerified email at CS.Stanford.EDU
Carlos ScheideggerPosit PBC (fka RStudio)Verified email at posit.co
Sanjiv DasSanta Clara UniversityVerified email at scu.edu
Chenhao TanUniversity of ChicagoVerified email at chenhaot.com
Yuxin ChenUniversity of Chicago, Assistant Professor of Computer ScienceVerified email at uchicago.edu
Jens TuylsPhD Student, Princeton UniversityVerified email at princeton.edu
Gavin KerriganUC IrvineVerified email at uci.edu
Michele DoniniAmazonVerified email at amazon.com
Cédric ArchambeauHelsing, BerlinVerified email at helsing.ai
Jiaqi MaAssistant Professor, University of Illinois Urbana-ChampaignVerified email at illinois.edu
Asma GhandehariounResearch Scientist, Google ResearchVerified email at google.com
Muhammad Bilal ZafarRuhr University Bochum & Research Center for Trustworthy Data Science and SecurityVerified email at rub.de
Nathalie RauschmayrVerified email at amazon.com
Yinlam ChowResearch Scientist, Google ResearchVerified email at google.com
Nevan WichersGoogleVerified email at google.com

Dylan Slack

Google

Verified email at google.com - Homepage

deep learning natural language processing robustness


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Fooling LIME and SHAP: Adversarial Attacks on Post hoc Explanation Methods D Slack, S Hilgard, E Jia, S Singh, H Lakkaraju AAAI/ACM Conference on Artificial Intelligence, Ethics, and Society (AIES), 2020	874	2020
Reliable Post hoc Explanations: Modeling Uncertainty in Explainability D Slack, S Hilgard, S Singh, H Lakkaraju NeurIPS, 2021	187	2021
Counterfactual Explanations Can Be Manipulated D Slack, S Hilgard, H Lakkaraju, S Singh NeurIPS, 2021	134	2021
Rethinking Explainability as a Dialogue: A Practitioner's Perspective H Lakkaraju, D Slack, Y Chen, C Tan, S Singh HCAI @ NuerIPS, 2022	82	2022
Fairness Warnings and Fair-MAML: Learning Fairly with Minimal Data D Slack, S Friedler, E Givental ACM Conference on Fairness, Accountability and Transparency (FAccT), 2020	66	2020
Assessing the Local Interpretability of Machine Learning Models D Slack, SA Friedler, C Scheidegger, C Dutta Roy Workshop on Human Centric Machine Learning, NeurIPS, 2019	61*	2019
Explaining machine learning models with interactive natural language conversations using TalkToModel D Slack, S Krishna, H Lakkaraju, S Singh Nature Machine Intelligence 5 (8), 873-883, 2023	60*	2023
Differentially Private Language Models Benefit from Public Pre-training G Kerrigan, D Slack, J Tuyls EMNLP PrivateNLP Workshop, 2020	50	2020
Post hoc explanations of language models can improve language models S Krishna, J Ma, D Slack, A Ghandeharioun, S Singh, H Lakkaraju NeurIPS, 2023	39	2023
A Careful Examination of Large Language Model Performance on Grade School Arithmetic H Zhang, J Da, D Lee, V Robinson, C Wu, W Song, T Zhao, P Raja, ...	17	2024
On the Lack of Robust Interpretability of Neural Text Classifiers MB Zafar, M Donini, D Slack, C Archambeau, S Das, K Kenthapadi Findings of ACL, 2021	17	2021
Active Meta-Learning for Predicting and Selecting Perovskite Crystallization Experiments V Shekar, G Nicholas, MA Najeeb, M Zeile, V Yu, X Wang, D Slack, Z Li, ... The Journal of Chemical Physics, 2021	15	2021
Feature attributions and counterfactual explanations can be manipulated D Slack, S Hilgard, S Singh, H Lakkaraju arXiv preprint arXiv:2106.12563, 2021	8	2021
Tablet: Learning from instructions for tabular data D Slack, S Singh arXiv preprint arXiv:2304.13188, 2023	5	2023
SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition D Slack, Y Chow, B Dai, N Wichers DARL @ ICML, 2022	5	2022
Context, language modeling, and multimodal data in finance S Das, C Giggins, J He, G Karypis, S Krishnamurthy, M Mahajan, ...	3	2021
Defuse: Harnessing Unrestricted Adversarial Examples for Debugging Models Beyond Test Accuracy D Slack, N Rauschmayr, K Kenthapadi NeurIPS XAI4Debugging Workshop, 2021	3*	2021
Robust Interactions with Machine Learning Models D Slack University of California, Irvine, 2023	2	2023
Learning Goal-Conditioned Representations for Language Reward Models V Nath, D Slack, J Da, Y Ma, H Zhang, S Whitehead, S Hendryx arXiv preprint arXiv:2407.13887, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–19

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors