估计面部模型的结构差异

论文标题

估计面部模型的结构差异

Estimating Structural Disparities for Face Models

论文作者

Ardeshir, Shervin, Segalin, Cristina, Kallus, Nathan

论文摘要

在机器学习中，通常通过测量模型的性能或结果的差异，跨数据点的不同子选项（组）来定义差异指标。因此，差异量化的输入包括模型的预测$ \ hat {y} $，预测$ y $的地面真相标签以及数据点的组标签$ g $。每个组模型的性能是通过比较特定组中数据点的$ \ hat {y} $和$ y $来计算的，结果，可以计算各个组的性能差异。但是，在许多现实世界中，在培训和验证时间内可能无法大规模提供小组标签（$ g $），或者收集它们可能是不可行或不可取的，因为它们通常是敏感的信息。结果，评估跨分类组的差异指标是不可行的。另一方面，在许多情况下，可以使用某种形式的代理可以获得嘈杂的分组，这将允许在子人群中测量差异指标。在这里，我们探索对在人脸上训练的计算机视觉模型以及诸如面部属性预测和影响估计等任务的计算机视觉模型进行此类分析。我们的实验表明，由现成的面部识别模型产生的嵌入可以有意义地作为此类估计的代理。

In machine learning, disparity metrics are often defined by measuring the difference in the performance or outcome of a model, across different sub-populations (groups) of datapoints. Thus, the inputs to disparity quantification consist of a model's predictions $\hat{y}$, the ground-truth labels for the predictions $y$, and group labels $g$ for the data points. Performance of the model for each group is calculated by comparing $\hat{y}$ and $y$ for the datapoints within a specific group, and as a result, disparity of performance across the different groups can be calculated. In many real world scenarios however, group labels ($g$) may not be available at scale during training and validation time, or collecting them might not be feasible or desirable as they could often be sensitive information. As a result, evaluating disparity metrics across categorical groups would not be feasible. On the other hand, in many scenarios noisy groupings may be obtainable using some form of a proxy, which would allow measuring disparity metrics across sub-populations. Here we explore performing such analysis on computer vision models trained on human faces, and on tasks such as face attribute prediction and affect estimation. Our experiments indicate that embeddings resulting from an off-the-shelf face recognition model, could meaningfully serve as a proxy for such estimation.

下载PDF全文

下载文献需遵守相关版权规定

论文标题