Dylan Slack
Cited by
Cited by
Fooling LIME and SHAP: Adversarial Attacks on Post hoc Explanation Methods
D Slack, S Hilgard, E Jia, S Singh, H Lakkaraju
AAAI/ACM Conference on Artificial Intelligence, Ethics, and Society (AIES), 2020
Reliable Post hoc Explanations: Modeling Uncertainty in Explainability
D Slack, S Hilgard, S Singh, H Lakkaraju
NeurIPS, 2021
Assessing the Local Interpretability of Machine Learning Models
D Slack, SA Friedler, C Scheidegger, C Dutta Roy
Workshop on Human Centric Machine Learning, NeurIPS, 2019
Counterfactual Explanations Can Be Manipulated
D Slack, S Hilgard, H Lakkaraju, S Singh
NeurIPS, 2021
Fairness Warnings and Fair-MAML: Learning Fairly with Minimal Data
D Slack, S Friedler, E Givental
ACM Conference on Fairness, Accountability and Transparency (FAccT), 2020
Differentially Private Language Models Benefit from Public Pre-training
G Kerrigan, D Slack, J Tuyls
EMNLP PrivateNLP Workshop, 2020
Rethinking Explainability as a Dialogue: A Practitioner's Perspective
H Lakkaraju, D Slack, Y Chen, C Tan, S Singh
HCAI @ NuerIPS, 2022
On the Lack of Robust Interpretability of Neural Text Classifiers
MB Zafar, M Donini, D Slack, C Archambeau, S Das, K Kenthapadi
Findings of ACL, 2021
Active Meta-Learning for Predicting and Selecting Perovskite Crystallization Experiments
V Shekar, G Nicholas, MA Najeeb, M Zeile, V Yu, X Wang, D Slack, Z Li, ...
The Journal of Chemical Physics, 2021
TalkToModel: Understanding Machine Learning Models With Open Ended Dialogues
D Slack, S Krishna, H Lakkaraju, S Singh
TSRML @ NeurIPS, 2022
SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition
D Slack, Y Chow, B Dai, N Wichers
DARL @ ICML, 2022
Defuse: Harnessing Unrestricted Adversarial Examples for Debugging Models Beyond Test Accuracy
D Slack, N Rauschmayr, K Kenthapadi
NeurIPS XAI4Debugging Workshop, 2021
Context, language modeling, and multimodal data in finance
S Das, C Goggins, J He, G Karypis, S Krishnamurthy, M Mahajan, ...
The Journal of Financial Data Science, 2021
Post Hoc Explanations of Language Models Can Improve Language Models
J Ma, D Slack, A Ghandeharioun, S Singh, H Lakkaraju
arXiv preprint arXiv:2305.11426, 2023
TABLET: Learning From Instructions For Tabular Data
D Slack, S Singh
arXiv preprint arXiv:2304.13188, 2023
The system can't perform the operation now. Try again later.
Articles 1–15