因果学习算法的评估方法和措施

论文标题

因果学习算法的评估方法和措施

Evaluation Methods and Measures for Causal Learning Algorithms

论文作者

Cheng, Lu, Guo, Ruocheng, Moraffah, Raha, Sheth, Paras, Candan, K. Selcuk, Liu, Huan

论文摘要

方便地访问丰富的多面数据，鼓励机器学习研究人员重新考虑基于相关性的学习，并拥抱基于因果关系的学习机会，即因果机器学习（因果学习）。因此，近年来见证了旨在开发旨在帮助AI实现人类智能的因果学习算法的巨大努力。由于缺乏基础数据，当前因果学习研究中最大的挑战之一是算法评估。这在很大程度上阻碍了人工智能和因果推断的交叉授粉，并阻碍了这两个领域从另一个领域的进步中受益。从传统的因果推理（即基于统计方法）到通过大数据（即因果推理和机器学习的交集）到因果学习，在本调查中，我们审查了使用与传统机器相似的评估管道的普遍使用的数据集，评估方法和因果学习的措施。我们专注于两项基本因果推理任务和因果感知机器学习任务。还讨论了当前评估程序的局限性。然后，我们检查了流行的因果推理工具/软件包，并以大数据时代的因果学习算法为基准的主要挑战和机会结束。该调查旨在将公开可用的基准和建立共识标准用于因果学习评估，并带有与观察数据的因果学习评估的紧迫性。在此过程中，我们希望扩大讨论并促进合作，以推动因果学习的创新和应用。

The convenient access to copious multi-faceted data has encouraged machine learning researchers to reconsider correlation-based learning and embrace the opportunity of causality-based learning, i.e., causal machine learning (causal learning). Recent years have therefore witnessed great effort in developing causal learning algorithms aiming to help AI achieve human-level intelligence. Due to the lack-of ground-truth data, one of the biggest challenges in current causal learning research is algorithm evaluations. This largely impedes the cross-pollination of AI and causal inference, and hinders the two fields to benefit from the advances of the other. To bridge from conventional causal inference (i.e., based on statistical methods) to causal learning with big data (i.e., the intersection of causal inference and machine learning), in this survey, we review commonly-used datasets, evaluation methods, and measures for causal learning using an evaluation pipeline similar to conventional machine learning. We focus on the two fundamental causal-inference tasks and causality-aware machine learning tasks. Limitations of current evaluation procedures are also discussed. We then examine popular causal inference tools/packages and conclude with primary challenges and opportunities for benchmarking causal learning algorithms in the era of big data. The survey seeks to bring to the forefront the urgency of developing publicly available benchmarks and consensus-building standards for causal learning evaluation with observational data. In doing so, we hope to broaden the discussions and facilitate collaboration to advance the innovation and application of causal learning.

下载PDF全文

下载文献需遵守相关版权规定

论文标题