论文标题
从文本中对人姿势的对抗性综合
Adversarial Synthesis of Human Pose from Text
论文作者
论文摘要
这项工作的重点是从人类水平的文本描述中综合人类姿势。我们提出了一个基于条件生成对抗网络的模型。它旨在生成以人写的文本描述为条件的2D人姿势。使用可可数据集对该模型进行了训练和评估,该数据集由捕获各种人类姿势的复杂日常场景的图像组成。我们通过定性和定量结果表明,该模型能够合成与给定文本相匹配的合理姿势,这表明可以生成与给定语义特征一致的姿势,尤其是对于具有独特姿势的动作。
This work focuses on synthesizing human poses from human-level text descriptions. We propose a model that is based on a conditional generative adversarial network. It is designed to generate 2D human poses conditioned on human-written text descriptions. The model is trained and evaluated using the COCO dataset, which consists of images capturing complex everyday scenes with various human poses. We show through qualitative and quantitative results that the model is capable of synthesizing plausible poses matching the given text, indicating that it is possible to generate poses that are consistent with the given semantic features, especially for actions with distinctive poses.