论文标题
嵌套:通过弱监督隔离共同因素
NestedVAE: Isolating Common Factors via Weak Supervision
论文作者
论文摘要
公平,公正的机器学习是一个重要且积极的研究领域,因为决策过程越来越多地由从数据中学习的模型驱动。不幸的是,数据中存在的任何偏差都可以通过模型来学到,从而不适当地将这些偏见转移到决策过程中。我们确定了减少偏差的任务与域之间共同的隔离因素的联系,同时鼓励域特定的不变性。为了隔离常见因素,我们将深层变量模型的理论与信息瓶颈理论结合在一起,以使数据自然地跨域进行配对,并且不需要其他监督。结果是嵌套的变异自动编码器(NestedVae)。两个具有共同权重的外部VAE试图重建输入并推断潜在空间,而嵌套的VAE试图从其配对图像的潜在表示中重建一个图像的潜在表示。这样一来,嵌套的vae隔离了常见的潜在因素/原因,并成为成对图像之间未共享的不必要因素。我们还提出了一个新的指标,以提供一种平衡的方法,以评估跨域的一致性和分类器性能,我们称为调整后的平等度量指标。在域和属性不变性,变化检测和学习生物性别预测的共同因素上的嵌套评估表明,嵌套显着优于替代方法。
Fair and unbiased machine learning is an important and active field of research, as decision processes are increasingly driven by models that learn from data. Unfortunately, any biases present in the data may be learned by the model, thereby inappropriately transferring that bias into the decision making process. We identify the connection between the task of bias reduction and that of isolating factors common between domains whilst encouraging domain specific invariance. To isolate the common factors we combine the theory of deep latent variable models with information bottleneck theory for scenarios whereby data may be naturally paired across domains and no additional supervision is required. The result is the Nested Variational AutoEncoder (NestedVAE). Two outer VAEs with shared weights attempt to reconstruct the input and infer a latent space, whilst a nested VAE attempts to reconstruct the latent representation of one image, from the latent representation of its paired image. In so doing, the nested VAE isolates the common latent factors/causes and becomes invariant to unwanted factors that are not shared between paired images. We also propose a new metric to provide a balanced method of evaluating consistency and classifier performance across domains which we refer to as the Adjusted Parity metric. An evaluation of NestedVAE on both domain and attribute invariance, change detection, and learning common factors for the prediction of biological sex demonstrates that NestedVAE significantly outperforms alternative methods.