Follow
Dylan Slack
Title
Cited by
Cited by
Year
Fooling LIME and SHAP: Adversarial Attacks on Post hoc Explanation Methods
D Slack, S Hilgard, E Jia, S Singh, H Lakkaraju
AAAI/ACM Conference on Artificial Intelligence, Ethics, and Society (AIES), 2020
7332020
Reliable Post hoc Explanations: Modeling Uncertainty in Explainability
D Slack, S Hilgard, S Singh, H Lakkaraju
NeurIPS, 2021
153*2021
Counterfactual Explanations Can Be Manipulated
D Slack, S Hilgard, H Lakkaraju, S Singh
NeurIPS, 2021
1052021
Assessing the Local Interpretability of Machine Learning Models
D Slack, SA Friedler, C Scheidegger, C Dutta Roy
Workshop on Human Centric Machine Learning, NeurIPS, 2019
60*2019
Rethinking Explainability as a Dialogue: A Practitioner's Perspective
H Lakkaraju, D Slack, Y Chen, C Tan, S Singh
HCAI @ NuerIPS, 2022
592022
Fairness Warnings and Fair-MAML: Learning Fairly with Minimal Data
D Slack, S Friedler, E Givental
ACM Conference on Fairness, Accountability and Transparency (FAccT), 2020
542020
Differentially Private Language Models Benefit from Public Pre-training
G Kerrigan, D Slack, J Tuyls
EMNLP PrivateNLP Workshop, 2020
402020
Explaining machine learning models with interactive natural language conversations using TalkToModel
D Slack, S Krishna, H Lakkaraju, S Singh
Nature Machine Intelligence 5 (8), 873-883, 2023
26*2023
On the Lack of Robust Interpretability of Neural Text Classifiers
MB Zafar, M Donini, D Slack, C Archambeau, S Das, K Kenthapadi
Findings of ACL, 2021
192021
Active Meta-Learning for Predicting and Selecting Perovskite Crystallization Experiments
V Shekar, G Nicholas, MA Najeeb, M Zeile, V Yu, X Wang, D Slack, Z Li, ...
The Journal of Chemical Physics, 2021
152021
Post hoc explanations of language models can improve language models
S Krishna, J Ma, D Slack, A Ghandeharioun, S Singh, H Lakkaraju
NeurIPS, 2023
102023
Feature attributions and counterfactual explanations can be manipulated
D Slack, S Hilgard, S Singh, H Lakkaraju
arXiv preprint arXiv:2106.12563, 2021
72021
Defuse: Harnessing Unrestricted Adversarial Examples for Debugging Models Beyond Test Accuracy
D Slack, N Rauschmayr, K Kenthapadi
NeurIPS XAI4Debugging Workshop, 2021
4*2021
SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition
D Slack, Y Chow, B Dai, N Wichers
DARL @ ICML, 2022
32022
Context, language modeling, and multimodal data in finance
S Das, C Goggins, J He, G Karypis, S Krishnamurthy, M Mahajan, ...
The Journal of Financial Data Science 3 (3), 52-66, 2021
32021
TABLET: Learning From Instructions For Tabular Data
D Slack, S Singh
arXiv preprint arXiv:2304.13188, 2023
12023
Robust Interactions with Machine Learning Models
D Slack
UC Irvine, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–17