About the workshop

Safety and reliability concerns are major obstacles for the adoption of AI in practice. In addition, European regulation will make explaining the decisions of models a requirement for so-called high-risk AI applications.

When models grow in size and complexity they tend to become less interpretable. Explainability in machine learning attempts to address this problem by providing insights into the inference process of a model. It can be used as a tool to make models more trustworthy, reliable, transferable, fair, and robust. However, it is not without its own problems, with algorithms often reporting contradictory explanations for the same phenomena.

This workshop provides an overview of the state-of-the-art in explainable AI, with a focus on practical considerations. We discuss different approaches to explainability, their strengths and weaknesses, and how they can be integrated into the machine learning pipeline. We also discuss the challenges of explainability in the context of deep learning and present interpretable deep-learning architectures for important domains such as computer vision and forecasting.

Learning outcomes

Get an overview about the potential applications of explainability in machine learning.
Get to know taxonomy of explainability methods and their applicability to different types of models.
Learn how explainability considerations can be integrated into the machine learning pipeline: Exploratory data analysis, feature engineering, model selection, evaluation, visualization, and interpretation. Exploratory data analysis
Get to know model agnostic and model specific explainability methods.
Understand the difference between black-box explanations and intrinsically interpretable models.
Learn about specialized techniques for important subfields such as time series forecasting or computer vision.
Learn how to evaluate the quality of explanations.

Part 1: Introduction to explainable AI

We start the workshop with a gentle introduction into the topic of explainable AI. We discuss the motivation for explainability, the different types of explanations, and the challenges of explainability.

AI incidents and the need for explainability
Trust in AI
Overview and taxonomy of explainability methods
Intrinsically interpretable models and black box explanations

Part 2: Post-hoc explainability methods

Most machine learning models today are not intrinsically interpretable. Model agnostic post-hoc explainability methods try to provide insights into the decision process of a model with considering its internal structure. Most current approaches produce feature attributions, which are a measure of how much each feature contributes to the prediction. Explanations of this kind have special implications for the feature engineering process. We exemplify such considerations on a bike rental prediction example. Concerning the explainability methods, we present two important subclasses: Statistical approaches (e.g. partial dependence and accumulated local effects) and local surrogate models (e.g. LIME and SHAP). We discuss the different interpretations of feature attribution as well as their strengths and weaknesses.

Shap additive explanations

Partial dependence, individual conditional expectation, and accumulated local effects
Additive explanations: LIME, Shapley values, and SHAP

Part 3: Deep learning specific methods

Deep learning models are often more complex than traditional machine learning models. They are usually composed of many layers and have millions of parameters. This makes them hard to understand. However, neural networks are differentiable, which allows the use of gradient-based methods to understand their predictions. Saliency maps are among the most popular methods to understand the predictions of deep learning models. They use the gradient of the output with respect to the input to identify the most important features of the input. Another line of research is to use the gradient of the loss with respect to a weighting of the training data to determine the importance of a training point for the performance of the model. In particular, we apply gradient based methods to identify unhelpful data points in a dataset and to understand the predictions of pre-built image classifiers. Saliency map via integrated gradients

Data valuation: influence functions
Saliency maps

Part 4: Interpretable computer vision

Computer vision has long been a field of AI were it was commonly believed that one needs to sacrifice interpretability for performance. However, recent advances in explainable AI have shown that this is not necessarily the case. We will discuss the recently proposed class of interpretable prototype networks. We evaluate their performance on a bird classification task and discuss the extent to which they are interpretable.

Prototypes
Transferlab ProtoTreeNet library

Part 5: Interpretable time series forecasting

Interpretability plays an important role in time series forecasting. Since predictions are made about an intrinsically uncertain future, it is important to understand the reasons for a prediction. If an interpretable model accurately captures causalities in the data, it can provide insights to analysts that work with forecasting models. For the very same reasons, correct quantification of uncertainty is of great importance for the proper interpretability of a forecast. We will discuss interpretable probabilistic forecasting models as well as interpretable deep learning architectures like attention based transformers. We work under the analyst in the loop approach and apply interpretable models to predict the price on the spanish electricity market. Finally, we draw insights about the market from our models. Additive seasons-trend decomposition of a time series

Quantile prediction of a transformer with attention map on the history

Prophet
Neural hierarchical interpolation for time series forecasting
Attention: Temporal Fusion Transformer

Prerequisites

We assume prior exposure to machine learning and deep learning and a general understanding of the underlying mathematical concepts.
Basic knowledge of Python is required to complete the exercises. Knowledge of the python ML stack is recommended. Source: XKCD 2237

Companion Seminar

Accompanying the course, we offer a seminar covering neighbouring topics that cannot make it into the course due to time constraints. It is held online over the course of several weeks and consists of talks reviewing papers in the field of explainable AI. The seminar is informal and open to everyone: we welcome participation, both in the discussions or presenting papers.

References

[Mol22I]

Interpretable Machine Learning, Christoph Molnar.

2022

Machine learning has great potential for improving products, processes and research. But computers usually do not explain their predictions which is a barrier to the adoption of machine learning. This book is about making machine learning models and their decisions interpretable. After exploring the concepts of interpretability, you will learn about simple, interpretable models such as decision …

[Kam21E]

Explainable Artificial Intelligence: An Introduction to Interpretable Machine Learning, Uday Kamath, John Liu.

2021

In recent years, we have seen gains in adoption of machine learning and artificial intelligence applications. However, continued adoption is being constrained by several limitations. The field of Explainable AI addresses one of the largest shortcomings of machine learning and deep learning algorithms today: the interpretability and explainability of models. As algorithms become more powerful and …

[Alb22C]

Counterfactual Shapley Additive Explanations, Emanuele Albini, Jason Long, Danial Dervovic, Daniele Magazzeni.

Jun 2022

Feature attributions are a common paradigm for model explanations due to their simplicity in assigning a single numeric score for each input feature to a model. In the actionable recourse setting, wherein the goal of the explanations is to improve outcomes for model consumers, it is often unclear how feature attributions should be correctly used. With this work, we aim to strengthen and clarify …

[Apl19V]

Visualizing the Effects of Predictor Variables in Black Box Supervised Learning Models, Daniel W. Apley, Jingyu Zhu.

Aug 2019

When fitting black box supervised learning models (e.g., complex trees, neural networks, boosted trees, random forests, nearest neighbors, local kernel-weighted methods, etc.), visualizing the main effects of the individual predictor variables and their low-order interaction effects is often important, and partial dependence (PD) plots are the most popular approach for accomplishing this. However, …

[Bur21S]

A Survey on the Explainability of Supervised Machine Learning, Nadia Burkart, Marco F. Huber.

Jan 2021

Predictions obtained by, e.g., artificial neural networks have a high accuracy but humans often perceive the models as black boxes. Insights about the decision making are mostly opaque for humans. Particularly understanding the decision making in highly sensitive areas such as healthcare or finance, is of paramount importance. The decision-making behind the black boxes requires it to be more …

[Cha22N]

N-HiTS: Neural Hierarchical Interpolation for Time Series Forecasting, Cristian Challu, Kin G. Olivares, Boris N. Oreshkin, Federico Garza, Max Mergenthaler-Canseco, Artur Dubrawski.

Sep 2022

Recent progress in neural forecasting accelerated improvements in the performance of large-scale forecasting systems. Yet, long-horizon forecasting remains a very difficult task. Two common challenges afflicting the task are the volatility of the predictions and their computational complexity. We introduce N-HiTS, a model which addresses both challenges by incorporating novel hierarchical …

[Che18L]

Learning to Explain: An Information-Theoretic Perspective on Model Interpretation, Jianbo Chen, Le Song, Martin J. Wainwright, Michael I. Jordan.

Jun 2018

We introduce instancewise feature selection as a methodology for model interpretation. Our method is based on learning a function to extract a subset of features that are most informative for each given example. This feature selector is trained to maximize the mutual information between selected features and the response variable, where the conditional distribution of the response variable given …

[Cov20U]

Understanding Global Feature Contributions With Additive Importance Measures, Ian Covert, Scott Lundberg, Su-In Lee.

Oct 2020

Understanding the inner workings of complex machine learning models is a long-standing problem and most recent research has focused on local interpretability. To assess the role of individual input features in a global sense, we explore the perspective of defining feature importance through the predictive power associated with each feature. We introduce two notions of predictive power (model-based …

[Fry21S]

Shapley Values for Feature Selection: The Good, the Bad, and the Axioms, Daniel Fryer, Inga Strümke, Hien Nguyen.

2021

The Shapley value has become popular in the Explainable AI (XAI) literature, thanks, to a large extent, to a solid theoretical foundation, including four “favourable and fair” axioms for attribution in transferable utility games. The Shapley value is probably the only solution concept satisfying these axioms. In this paper, we introduce the Shapley value and draw attention to its recent uses as a …

[Gal21E]

Explaining Black-Box Algorithms Using Probabilistic Contrastive Counterfactuals, Sainyam Galhotra, Romila Pradhan, Babak Salimi.

Jun 2021

There has been a recent resurgence of interest in explainable artificial intelligence (XAI) that aims to reduce the opaqueness of AI-based decision-making systems, allowing humans to scrutinize and trust them. Prior work in this context has focused on the attribution of responsibility for an algorithm's decisions to its inputs wherein responsibility is typically approached as a purely …

[Kar20M]

Model-Agnostic Counterfactual Explanations for Consequential Decisions, Amir-Hossein Karimi, Gilles Barthe, Borja Balle, Isabel Valera.

Feb 2020

Predictive models are being increasingly used to support consequential decision making at the individual level in contexts such as pretrial bail and loan approval. As a result, there is increasing social and legal pressure to provide explanations that help the affected individuals not only to understand why a prediction was output, but also how to act to obtain a desired outcome. To this end, …

[Kum20P]

Problems with Shapley-value-based explanations as feature importance measures, I. Elizabeth Kumar, Suresh Venkatasubramanian, Carlos Scheidegger, Sorelle Friedler.

Nov 2020

Game-theoretic formulations of feature importance have become popular as a way to

[Kum21S]

Shapley Residuals: Quantifying the limits of the Shapley value for explanations, Indra Kumar, Carlos Scheidegger, Suresh Venkatasubramanian, Sorelle Friedler.

2021

[Li19L]

Learning interpretable deep state space model for probabilistic time series forecasting, Longyuan Li, Junchi Yan, Xiaokang Yang, Yaohui Jin.

Aug 2019

Probabilistic time series forecasting involves estimating the distribution of future based on its history, which is essential for risk management in downstream decision-making. We propose a deep state space model for probabilistic time series forecasting whereby the non-linear emission model and transition model are parameterized by networks and the dependency is modeled by recurrent neural nets. …

[Lim21T]

Temporal Fusion Transformers for interpretable multi-horizon time series forecasting, Bryan Lim, Sercan Ö. Arık, Nicolas Loeff, Tomas Pfister.

Oct 2021

Multi-horizon forecasting often contains a complex mix of inputs – including static (i.e. time-invariant) covariates, known future inputs, and other exogenous time series that are only observed in the past – without any prior information on how they interact with the target. Several deep learning methods have been proposed, but they are typically ‘black-box’ models that do not shed light on how …

[Mer19E]

The Explanation Game: Explaining Machine Learning Models Using Shapley Values, Luke Merrick, Ankur Taly.

Sep 2019

A number of techniques have been proposed to explain a machine learning model's prediction by attributing it to the corresponding input features. Popular among these are techniques that apply the Shapley value method from cooperative game theory. While existing papers focus on the axiomatic motivation of Shapley values, and efficient techniques for computing them, they offer little justification …

[Ore22N]

N-BEATS: Neural basis expansion analysis for interpretable time series forecasting, Boris N. Oreshkin, Dmitri Carpov, Nicolas Chapados, Yoshua Bengio.

Mar 2022

We focus on solving the univariate times series point forecasting problem using deep learning. We propose a deep neural architecture based on backward and forward residual links and a very deep stack of fully-connected layers. The architecture has a number of desirable properties, being interpretable, applicable without modification to a wide array of target domains, and fast to train. We test the …

[Ran18D]

Deep State Space Models for Time Series Forecasting, Syama Sundar Rangapuram, Matthias W Seeger, Jan Gasthaus, Lorenzo Stella, Yuyang Wang, Tim Januschowski.

2018

We present a novel approach to probabilistic time series forecasting that combines state space models with deep learning. By parametrizing a per-time-series linear state space model with a jointly-learned recurrent neural network, our method retains desired properties of state space models such as data efficiency and interpretability, while making use of the ability to learn complex patterns from …

[Rib18A]

Anchors: High-Precision Model-Agnostic Explanations, Marco Tulio Ribeiro, Sameer Singh, Carlos Guestrin.

Apr 2018

We introduce a novel model-agnostic system that explains the behavior of complex models with high-precision rules called anchors, representing local, “sufﬁcient” conditions for predictions. We propose an algorithm to efﬁciently compute these explanations for any black-box model with high-probability guarantees. We demonstrate the ﬂexibility of anchors by explaining a myriad of different models for …

[Rud19S]

Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead, Cynthia Rudin.

May 2019

Black box machine learning models are currently being used for high-stakes decision making throughout society, causing problems in healthcare, criminal justice and other domains. Some people hope that creating methods for explaining these black box models will alleviate some of the problems, but trying to explain black box models, rather than creating models that are interpretable in the first …

[Sun17A]

Axiomatic Attribution for Deep Networks, Mukund Sundararajan, Ankur Taly, Qiqi Yan.