论文标题
HLVU:测试对电影的深入了解的新挑战
HLVU : A New Challenge to Test Deep Understanding of Movies the Way Humans do
论文作者
论文摘要
在本文中,我们在高级视频理解领域提出了一个新的评估挑战和方向。我们提出的挑战旨在测试自动视频分析和理解,以及系统如何准确地了解电影,实体,事件及其彼此之间的关系。为人类评估者收集了开源电影的高级视频理解(HLVU)数据集,以构建代表每个人的知识图。一组查询将从知识图派生到有关检索参与者之间关系的测试系统,以及推理和检索非视觉概念。目的是基于计算机系统可以像人类观看相同电影时“不明确”但明显的关系“理解”不明显但明显的关系。这是文本域中正在解决的长期存在的问题,该项目与视频域相似。这种性质的工作是未来视频分析和视频理解技术的基础。流媒体服务和广播公司希望为客户提供与视频内容互动和消费视频内容的更多直观方式。
In this paper we propose a new evaluation challenge and direction in the area of High-level Video Understanding. The challenge we are proposing is designed to test automatic video analysis and understanding, and how accurately systems can comprehend a movie in terms of actors, entities, events and their relationship to each other. A pilot High-Level Video Understanding (HLVU) dataset of open source movies were collected for human assessors to build a knowledge graph representing each of them. A set of queries will be derived from the knowledge graph to test systems on retrieving relationships among actors, as well as reasoning and retrieving non-visual concepts. The objective is to benchmark if a computer system can "understand" non-explicit but obvious relationships the same way humans do when they watch the same movies. This is long-standing problem that is being addressed in the text domain and this project moves similar research to the video domain. Work of this nature is foundational to future video analytics and video understanding technologies. This work can be of interest to streaming services and broadcasters hoping to provide more intuitive ways for their customers to interact with and consume video content.