Explaining black-box classifiers: Properties and functions - LAAS - Laboratoire d'Analyse et d'Architecture des Systèmes Access content directly
Journal Articles International Journal of Approximate Reasoning Year : 2023

Explaining black-box classifiers: Properties and functions


Explaining black-box classification models is a hot topic in AI, with the overall goal of improving trust in decisions made by such models. Several works have been done and diverse functions have been proposed. However, their formal properties and links have not been sufficiently studied. This paper presents four contributions: The first consists of investigating global explanations of black-box classifiers. We provide a formal and unifying framework in which such explanations are defined from the whole feature space. The framework is based on two concepts, which are seen as two types of global explanations: arguments in favour of (or pro) predictions and arguments against (or con) predictions. The second contribution consists of defining various types of local explanations (abductive explanations, counterfactuals, contrastive explanations) from the whole feature space, investigating their properties, links and differences, and showing how they relate to global explanations. The third contribution consists of analysing and defining explanation functions that generate (global, local) abductive explanations from incomplete information (i.e., from a subset of the feature space). We start by proposing two desirable properties that an explainer would satisfy, namely success and coherence. The former ensures the existence of explanations while the latter ensures their correctness. We show that in the incomplete case, the two properties cannot be satisfied together. The fourth contribution consists of proposing two functions that generate abductive explanations and which satisfy coherence at the expense of success.
Not file

Dates and versions

hal-03967209 , version 1 (01-02-2023)



Leila Amgoud. Explaining black-box classifiers: Properties and functions. International Journal of Approximate Reasoning, 2023, 155 : special issue SI: Extended papers from the Sixteenth European Conference on Symbolic and Quantitative Approaches to Reasoning with Uncertainty (ECSQARU 2021), pp.40-65. ⟨10.1016/j.ijar.2023.01.004⟩. ⟨hal-03967209⟩
2 View
0 Download



Gmail Facebook Twitter LinkedIn More