approved
Explaining Sentiment Classification with Synthetic Exemplars and Counter-Exemplars

We present xspells, a model-agnostic local approach for explaining the decisions of a black box model for sentiment classification of short texts. The explanations provided consist of a set of exemplar sentences and a set of counter-exemplar sentences. The former are examples classified by the black box with the same label as the text to explain. The latter are examples classified with a different label (a form of counter-factuals). Both are close in meaning to the text to explain, and both are meaningful sentences – albeit they are synthetically generated. xspells generates neighbors of the text to explain in a latent space using Variational Autoencoders for encoding text and decoding latent instances. A decision tree is learned from randomly generated neighbors, and used to drive the selection of the exemplars and counter-exemplars. We report experiments on two datasets showing that xspells outperforms the well-known lime method in terms of quality of explanations, fidelity, and usefulness, and that is comparable to it in terms of stability.

Tags
Data and Resources
To access the resources you must log in
Additional Info
Field Value
Creator Lampridis, Orestis, lorestis@csd.auth.gr
Creator Guidotti, Riccardo, riccardo.guidotti@unipi.it
Creator Ruggieri, Salvatore, salvatore.ruggieri@unipi.it
DOI https://doi.org/10.1007/978-3-030-61527-7_24
Group Ethics and Legality
Publisher Springer Link
Source International Conference on Discovery Science DS 2020: Discovery Science pp 357-373
Thematic Cluster Social Network Analysis [SNA]
Thematic Cluster Text and Social Media Mining [TSMM]
system:type ConferencePaper
Management Info
Field Value
Author Wright Joanna
Maintainer Guidotti Riccardo
Version 1
Last Updated 8 September 2023, 18:02 (CEST)
Created 4 February 2021, 15:48 (CET)