石灰的统计稳定指数：获得机器学习模型的可靠解释

论文标题

石灰的统计稳定指数：获得机器学习模型的可靠解释

Statistical stability indices for LIME: obtaining reliable explanations for Machine Learning models

论文作者

Visani, Giorgio, Bagli, Enrico, Chesani, Federico, Poluzzi, Alessandro, Capuzzo, Davide

论文摘要

如今，我们目睹了业务流程向更计算驱动的方法的转变。机器学习技术的使用不断增加是这种趋势的最清晰的例子。这种革命通常会提供优势，例如预测准确性的提高以及获得结果的时间减少。但是，这些方法是一个主要的缺点：很难理解算法采取的决定。为了解决这个问题，我们考虑使用石灰方法。然后，我们对石灰的一般背景，我们关注稳定性问题：在相同条件下使用方法重复的时间可能会产生不同的解释。提出了两个互补指数，以测量石灰稳定性。对于从业者而言，重要的是要意识到这个问题，并拥有一个用于发现问题的工具。稳定性保证石灰的解释是可靠的，因此通过拟议指数进行的稳定评估至关重要。作为案例研究，我们将机器学习和经典统计技术应用于信用风险数据。我们在机器学习算法上测试石灰并检查其稳定性。最终，我们研究了返回解释的好处。

Nowadays we are witnessing a transformation of the business processes towards a more computation driven approach. The ever increasing usage of Machine Learning techniques is the clearest example of such trend. This sort of revolution is often providing advantages, such as an increase in prediction accuracy and a reduced time to obtain the results. However, these methods present a major drawback: it is very difficult to understand on what grounds the algorithm took the decision. To address this issue we consider the LIME method. We give a general background on LIME then, we focus on the stability issue: employing the method repeated times, under the same conditions, may yield to different explanations. Two complementary indices are proposed, to measure LIME stability. It is important for the practitioner to be aware of the issue, as well as to have a tool for spotting it. Stability guarantees LIME explanations to be reliable, therefore a stability assessment, made through the proposed indices, is crucial. As a case study, we apply both Machine Learning and classical statistical techniques to Credit Risk data. We test LIME on the Machine Learning algorithm and check its stability. Eventually, we examine the goodness of the explanations returned.

下载PDF全文

下载文献需遵守相关版权规定

论文标题