论文标题
基于相似性的解释评估
Evaluation of Similarity-based Explanations
论文作者
论文摘要
解释复杂的机器学习模型做出的预测有助于用户自信地理解和接受预测的输出。一种有希望的方法是使用基于相似性的解释,该解释提供相似的实例作为支持模型预测的证据。将几个相关指标用于此目的。在这项研究中,我们研究了可以为用户提供合理解释的相关指标。具体而言,我们采用了三个测试来评估相关指标是否满足基于相似性的解释的最低要求。我们的实验表明,损失梯度的余弦相似性表现最佳,这在实践中是推荐的选择。此外,我们表明某些指标在我们的测试中的表现不佳,并分析了其失败的原因。我们希望我们的见解能够帮助从业者选择适当的相关指标,并帮助进一步的研究设计更好的相关指标来解释。
Explaining the predictions made by complex machine learning models helps users to understand and accept the predicted outputs with confidence. One promising way is to use similarity-based explanation that provides similar instances as evidence to support model predictions. Several relevance metrics are used for this purpose. In this study, we investigated relevance metrics that can provide reasonable explanations to users. Specifically, we adopted three tests to evaluate whether the relevance metrics satisfy the minimal requirements for similarity-based explanation. Our experiments revealed that the cosine similarity of the gradients of the loss performs best, which would be a recommended choice in practice. In addition, we showed that some metrics perform poorly in our tests and analyzed the reasons of their failure. We expect our insights to help practitioners in selecting appropriate relevance metrics and also aid further researches for designing better relevance metrics for explanations.