论文标题
通过判别模型评估生成对抗网络
On the Evaluation of Generative Adversarial Networks By Discriminative Models
论文作者
论文摘要
生成对抗网络(GAN)可以准确地对复杂的多维数据进行建模并生成逼真的样本。但是,由于它们对数据分布的隐式估计,其评估是一项艰巨的任务。与解决此问题相关的大多数研究工作都通过定性视觉评估来验证。这种方法并不能超越图像域。由于许多评估指标是提出并绑定到视觉领域的,因此它们很难应用于其他域。必须采取定量措施,以更好地指导不同gan模型的训练和比较。在这项工作中,我们利用暹罗神经网络提出了一个域 - 不合稳定的评估度量:(1)具有与人类评估相一致的定性评估,(2)相对于常见的GAN问题,例如模式下降和发明,((3))不需要任何预告片的分类器。本文的经验结果证明了该方法的优势与流行的发起评分相比,并且与FID分数具有竞争力。
Generative Adversarial Networks (GANs) can accurately model complex multi-dimensional data and generate realistic samples. However, due to their implicit estimation of data distributions, their evaluation is a challenging task. The majority of research efforts associated with tackling this issue were validated by qualitative visual evaluation. Such approaches do not generalize well beyond the image domain. Since many of those evaluation metrics are proposed and bound to the vision domain, they are difficult to apply to other domains. Quantitative measures are necessary to better guide the training and comparison of different GANs models. In this work, we leverage Siamese neural networks to propose a domain-agnostic evaluation metric: (1) with a qualitative evaluation that is consistent with human evaluation, (2) that is robust relative to common GAN issues such as mode dropping and invention, and (3) does not require any pretrained classifier. The empirical results in this paper demonstrate the superiority of this method compared to the popular Inception Score and are competitive with the FID score.