论文标题
关于通过扩散模型生成的合成图像的检测
On the detection of synthetic images generated by diffusion models
论文作者
论文摘要
在过去的十年中,创建合成媒体取得了巨大进展,这主要归功于基于生成的对抗网络(GAN)的强大方法的发展。最近,基于扩散模型(DM)的方法已引起人们的关注。除了提供令人印象深刻的光真实主义水平之外,它们还可以创建基于文本的视觉内容,在从艺术到视频游戏的许多不同应用领域中开辟了新的令人兴奋的机会。另一方面,该属性是恶意用户手中的另一项资产,他们可以完美地适应其攻击,并向媒体法医社区提出新的挑战。通过这项工作,我们试图了解将扩散模型与原始模型产生的合成图像以及当前最新检测器是否适合任务是多么困难。为此,首先,我们揭示了扩散模型留下的取证痕迹,然后研究为GAN生成图像开发的当前检测器如何在这些新的合成图像上执行,尤其是在挑战涉及图像压缩和调整大小的社交网络场景中。数据集和代码可在github.com/grip-unina/dmimimagedetection上找到。
Over the past decade, there has been tremendous progress in creating synthetic media, mainly thanks to the development of powerful methods based on generative adversarial networks (GAN). Very recently, methods based on diffusion models (DM) have been gaining the spotlight. In addition to providing an impressive level of photorealism, they enable the creation of text-based visual content, opening up new and exciting opportunities in many different application fields, from arts to video games. On the other hand, this property is an additional asset in the hands of malicious users, who can generate and distribute fake media perfectly adapted to their attacks, posing new challenges to the media forensic community. With this work, we seek to understand how difficult it is to distinguish synthetic images generated by diffusion models from pristine ones and whether current state-of-the-art detectors are suitable for the task. To this end, first we expose the forensics traces left by diffusion models, then study how current detectors, developed for GAN-generated images, perform on these new synthetic images, especially in challenging social-networks scenarios involving image compression and resizing. Datasets and code are available at github.com/grip-unina/DMimageDetection.