空间：语音驱动的肖像画动画，具有可控表达

论文标题

空间：语音驱动的肖像画动画，具有可控表达

SPACE: Speech-driven Portrait Animation with Controllable Expression

论文作者

Gururani, Siddharth, Mallya, Arun, Wang, Ting-Chun, Valle, Rafael, Liu, Ming-Yu

论文摘要

近年来，使用语音进行动画肖像吸引了人们的关注，并具有各种创造性和实用用例。理想产生的视频应与音频，天然面部表情和头部动作以及高框架质量具有良好的嘴唇同步。在这项工作中，我们介绍了空间，该空间使用语音和单个图像来生成高分辨率和具有现实头姿势的表达视频，而无需驾驶视频。它使用了多阶段方法，将面部地标的可控性与预审前的面部发电机的高质量合成能力相结合。空间还可以控制情绪及其强度。我们的方法在客观指标上的图像质量和面部运动的先验方法优于先前的方法，并且在配对比较中，用户强烈希望使用。该项目网站可从https://deepimagination.cc/space/获得

Animating portraits using speech has received growing attention in recent years, with various creative and practical use cases. An ideal generated video should have good lip sync with the audio, natural facial expressions and head motions, and high frame quality. In this work, we present SPACE, which uses speech and a single image to generate high-resolution, and expressive videos with realistic head pose, without requiring a driving video. It uses a multi-stage approach, combining the controllability of facial landmarks with the high-quality synthesis power of a pretrained face generator. SPACE also allows for the control of emotions and their intensities. Our method outperforms prior methods in objective metrics for image quality and facial motions and is strongly preferred by users in pair-wise comparisons. The project website is available at https://deepimagination.cc/SPACE/

下载PDF全文

下载文献需遵守相关版权规定

论文标题