Evaluating Deep Taylor Decomposition for Reliability Assessment in the Wild

Research output: Contribution to journal › Conference article › Research › peer-review

Documents

Evaluating Deep Taylor
Final published version, 2.9 MB, PDF document

We argue that we need to evaluate model interpretability methods 'in the wild', i.e., in situations where professionals make critical decisions, and models can potentially assist them. We present an in-the-wild evaluation of token attribution based on Deep Taylor Decomposition, with professional journalists performing reliability assessments. We find that using this method in conjunction with RoBERTa-Large, fine-tuned on the Gossip Corpus, led to faster and better human decision-making, as well as a more critical attitude toward news sources among the journalists. We present a comparison of human and model rationales, as well as a qualitative analysis of the journalists' experiences with machine-in-the-loop decision making.

Original language	English
Journal	Proceedings of the International AAAI Conference on Web and Social Media
Volume	16
Pages (from-to)	1368-1372
ISSN	2162-3449
DOIs	https://doi.org/10.1609/icwsm.v16i1.19389
Publication status	Published - 2022
Event	16th International AAAI Conference on Web and Social Media - Atlanta, United States Duration: 6 Jun 2022 → 9 Jun 2022

Conference

Conference	16th International AAAI Conference on Web and Social Media
Country	United States
City	Atlanta
Period	06/06/2022 → 09/06/2022

ID: 339852192

Department of Computer Science