论文标题

meta-sim2:综合数据生成场景结构的无监督学习

Meta-Sim2: Unsupervised Learning of Scene Structure for Synthetic Data Generation

论文作者

Devaranjan, Jeevan, Kar, Amlan, Fidler, Sanja

论文摘要

程序模型被广泛用于合成图形,游戏和创建(标记)ML(标签)合成数据集的场景。为了产生现实和多样化的场景,必须由专家仔细调整有关程序模型的许多参数。这些参数控制着要生成的场景的结构(例如,场景中有多少辆汽车)以及将对象放在有效配置中的参数。 Meta-SIM旨在以无监督的方式自动调整实际图像的目标集合。在Meta-Sim2中,我们旨在除了参数外学习场景结构,这是一个充满挑战的问题,由于其离散的性质。 Meta-SIM2通过学习从给定的概率场景语法中依次样本规则扩展来进行。由于问题的离散性质,我们使用强化学习来训练我们的模型,并在合成的图像和目标图像之间设计功能空间差异,这是成功培训的关键。实际驾驶数据集的实验表明,在没有任何监督的情况下,我们可以成功学习生成数据,以在实际图像中捕获对象的离散结构统计,例如其频率。我们还表明,这导致了在生成的数据集上训练的对象检测器的性能的下游改进,而不是其他基线仿真方法。项目页面:https://nv-tlabs.github.io/meta-sim-scructure/。

Procedural models are being widely used to synthesize scenes for graphics, gaming, and to create (labeled) synthetic datasets for ML. In order to produce realistic and diverse scenes, a number of parameters governing the procedural models have to be carefully tuned by experts. These parameters control both the structure of scenes being generated (e.g. how many cars in the scene), as well as parameters which place objects in valid configurations. Meta-Sim aimed at automatically tuning parameters given a target collection of real images in an unsupervised way. In Meta-Sim2, we aim to learn the scene structure in addition to parameters, which is a challenging problem due to its discrete nature. Meta-Sim2 proceeds by learning to sequentially sample rule expansions from a given probabilistic scene grammar. Due to the discrete nature of the problem, we use Reinforcement Learning to train our model, and design a feature space divergence between our synthesized and target images that is key to successful training. Experiments on a real driving dataset show that, without any supervision, we can successfully learn to generate data that captures discrete structural statistics of objects, such as their frequency, in real images. We also show that this leads to downstream improvement in the performance of an object detector trained on our generated dataset as opposed to other baseline simulation methods. Project page: https://nv-tlabs.github.io/meta-sim-structure/.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源