论文标题
因果沙普利价值:利用因果知识来解释复杂模型的个人预测
Causal Shapley Values: Exploiting Causal Knowledge to Explain Individual Predictions of Complex Models
论文作者
论文摘要
Shapley的价值是可解释的人工智能中最流行的模型不足方法之一。这些值旨在将模型预测与平均基线之间的差异归因于模型输入的不同特征。 Shapley的价值基于实体游戏理论原理,独特地满足了几种理想的属性,这就是为什么它们越来越多地用于解释可能复杂且高度非线性的机器学习模型的预测。当功能独立时,Shapley值可以很好地校准用户的直觉,但是当违反独立性假设时,可能会导致不良的,违反直觉的解释。 在本文中,我们提出了一个用于计算沙普利值的新框架,该框架概括了旨在规避独立性假设的最新工作。通过使用珍珠的Do-Calculus,我们展示了如何在不牺牲其任何理想特性的情况下得出这些“因果” shapley值。此外,因果沙普利值使我们能够分离直接和间接影响的贡献。当仅提供部分信息并在现实世界示例上说明其实用性时,我们为基于因果链图计算因果沙普利值的实用实现。
Shapley values underlie one of the most popular model-agnostic methods within explainable artificial intelligence. These values are designed to attribute the difference between a model's prediction and an average baseline to the different features used as input to the model. Being based on solid game-theoretic principles, Shapley values uniquely satisfy several desirable properties, which is why they are increasingly used to explain the predictions of possibly complex and highly non-linear machine learning models. Shapley values are well calibrated to a user's intuition when features are independent, but may lead to undesirable, counterintuitive explanations when the independence assumption is violated. In this paper, we propose a novel framework for computing Shapley values that generalizes recent work that aims to circumvent the independence assumption. By employing Pearl's do-calculus, we show how these 'causal' Shapley values can be derived for general causal graphs without sacrificing any of their desirable properties. Moreover, causal Shapley values enable us to separate the contribution of direct and indirect effects. We provide a practical implementation for computing causal Shapley values based on causal chain graphs when only partial information is available and illustrate their utility on a real-world example.