哪些神经网络记住什么以及原因：通过影响估计发现长尾

论文标题

哪些神经网络记住什么以及原因：通过影响估计发现长尾

What Neural Networks Memorize and Why: Discovering the Long Tail via Influence Estimation

论文作者

Feldman, Vitaly, Zhang, Chiyuan

论文摘要

深度学习算法众所周知，具有很好地拟合训练数据的倾向，并且通常适合离群值和标签错误的数据点。这种拟合需要记忆培训数据标签，这一现象引起了重大的研究兴趣，但到目前为止尚未得到令人信服的解释。费尔德曼（Feldman，2019年）的最新作品提出了基于两个见解的结合对这种现象的理论解释。首先，自然图像和数据分布（非正式）已知是长尾巴，这是罕见和非典型示例的很大一部分。其次，在简单的理论模型中，这种记忆对于在长期尾部分布时实现近距离的概括误差是必不可少的。但是，没有给出这种解释的直接经验证据，甚至没有给出获得此类证据的方法。在这项工作中，我们设计了实验来测试该理论中的关键思想。实验需要估计每个训练示例对每个测试示例准确性的影响以及训练示例的记忆值。直接估计这些数量是计算上的过度效果，但我们表明，密切相关的子采样影响和记忆值可以更有效地估计。我们的实验证明了记忆对几个标准基准的概括的重大好处。他们还为在中提出的理论提供了定量和视觉上令人信服的证据（Feldman，2019年）。

Deep learning algorithms are well-known to have a propensity for fitting the training data very well and often fit even outliers and mislabeled data points. Such fitting requires memorization of training data labels, a phenomenon that has attracted significant research interest but has not been given a compelling explanation so far. A recent work of Feldman (2019) proposes a theoretical explanation for this phenomenon based on a combination of two insights. First, natural image and data distributions are (informally) known to be long-tailed, that is have a significant fraction of rare and atypical examples. Second, in a simple theoretical model such memorization is necessary for achieving close-to-optimal generalization error when the data distribution is long-tailed. However, no direct empirical evidence for this explanation or even an approach for obtaining such evidence were given. In this work we design experiments to test the key ideas in this theory. The experiments require estimation of the influence of each training example on the accuracy at each test example as well as memorization values of training examples. Estimating these quantities directly is computationally prohibitive but we show that closely-related subsampled influence and memorization values can be estimated much more efficiently. Our experiments demonstrate the significant benefits of memorization for generalization on several standard benchmarks. They also provide quantitative and visually compelling evidence for the theory put forth in (Feldman, 2019).

下载PDF全文

下载文献需遵守相关版权规定

论文标题