论文标题
DeepCoda:组成健康数据的个性化解释性
DeepCoDA: personalized interpretability for compositional health data
论文作者
论文摘要
可解释性使域 - 专家可以直接评估模型的相关性和可靠性,这种实践可提供保证并建立信任。在医疗保健环境中,可解释的模型应牵涉到与数据预处理等技术因素无关的相关生物学机制。我们将个性化的解释性定义为衡量特定特征特征归因的量度,并将其视为精确健康模型以证明其结论合理的最低要求。一些健康数据,尤其是由高通量测序实验产生的数据,具有损害精度健康模型及其解释的细微差别。这些数据是组成的,这意味着每个功能都有条件取决于所有其他功能。我们提出了深层组成数据分析(DEEPCODA)框架,以将精确的健康建模扩展到高维组成数据,并通过特定于患者的权重提供个性化的可解释性。我们的体系结构在25个现实世界数据集中保持了最先进的性能,同时产生了个性化且完全连贯的构图数据的解释。
Interpretability allows the domain-expert to directly evaluate the model's relevance and reliability, a practice that offers assurance and builds trust. In the healthcare setting, interpretable models should implicate relevant biological mechanisms independent of technical factors like data pre-processing. We define personalized interpretability as a measure of sample-specific feature attribution, and view it as a minimum requirement for a precision health model to justify its conclusions. Some health data, especially those generated by high-throughput sequencing experiments, have nuances that compromise precision health models and their interpretation. These data are compositional, meaning that each feature is conditionally dependent on all other features. We propose the Deep Compositional Data Analysis (DeepCoDA) framework to extend precision health modelling to high-dimensional compositional data, and to provide personalized interpretability through patient-specific weights. Our architecture maintains state-of-the-art performance across 25 real-world data sets, all while producing interpretations that are both personalized and fully coherent for compositional data.