论文标题
描述有向无环图内的确定性变量(DAGS):识别和解释涉及重言式关联,组成数据和复合变量的因果效应的帮助
Depicting deterministic variables within directed acyclic graphs (DAGs): An aid for identifying and interpreting causal effects involving tautological associations, compositional data, and composite variables
论文作者
论文摘要
确定性变量是由一个或多个父变量完全解释的变量。当变量是由一个或多个父变量(如复合变量)以及组成数据中的代数构建时,它们通常会产生,其中“整个”变量是从其“部分”确定的。 本文介绍了如何在定向的无环图(DAG)中描述确定性变量,以帮助识别和解释涉及重言式关联,组成数据和复合变量的因果效应。我们提出了一种两步的方法,其中最初考虑所有变量,然后做出明确的选择,无论是专注于确定性变量还是确定的父母。 描绘DAG中的确定性变量带来了一些好处。更容易识别和避免误解重言式关联,即具有共享代数父变量的变量之间的自我实现关联。在组成数据中,更容易理解对“整体”变量的调理的后果,并正确识别总和因果关系。对于综合变量,它鼓励对目标估计和更大的审查进行更大的考虑,并对一致性和交换性假设进行更大的审查。 具有确定变量的DAG是计划和解释涉及重言式关联,组成数据和/或复合变量的有用帮助。
Deterministic variables are variables that are fully explained by one or more parent variables. They commonly arise when a variable has been algebraically constructed from one or more parent variables, as with composite variables, and in compositional data, where the 'whole' variable is determined from its 'parts'. This article introduces how deterministic variables may be depicted within directed acyclic graphs (DAGs) to help with identifying and interpreting causal effects involving tautological associations, compositional data, and composite variables. We propose a two-step approach in which all variables are initially considered, and an explicit choice is then made whether to focus on the deterministic variable(s) or the determining parents. Depicting deterministic variables within DAGs bring several benefits. It is easier to identify and avoid misinterpreting tautological associations, i.e., self-fulfilling associations between variables with shared algebraic parent variables. In compositional data, it is easier to understand the consequences of conditioning on the 'whole' variable, and correctly identify total and relative causal effects. For composite variables, it encourages greater consideration of the target estimand and greater scrutiny of the consistency and exchangeability assumptions. DAGs with deterministic variables are a useful aid for planning and interpreting analyses involving tautological associations, compositional data, and/or composite variables.