论文标题
szloca:在互动艺术的背景下,通过单个相机进行完整3D跟踪的框架
Szloca: towards a framework for full 3D tracking through a single camera in context of interactive arts
论文作者
论文摘要
实时的物体和人类存在在大面积的虚拟虚拟数据是在人工智能的技术发展中实现许多经验和应用的重要关键,随着人工智能的技术发展的指数增长,计算机视觉扩大了跟踪和分类的可能性。 容量。在应用程序开发中使用计算机视觉的好处很大,因为它增加了传统的输入源(例如视频流),并且可以集成在许多环境和平台中。在新的媒体互动艺术的背景下,基于身体运动并在大面积或狂热的情况下扩展,这项研究提出了一种新颖的方式和一个框架,以获取对象/人的数据和虚拟表示,例如三维位置,Skeltons/pose和单个RGB摄像机的面具。通过最近的一些发展和计算机视觉领域的先前研究来研究艺术的状态,本文还提出了一种从单眼图像获得三维位置数据的原始方法,该模型不依赖计算机视觉系统的复杂培训,而是结合了先前的计算机视觉研究并增加了代表Z Depth的能力,IETO代表了2D Intup的3 Axis中的世界位置。
Realtime virtual data of objects and human presence in a large area holds a valuable key in enabling many experiences and applications in various industries and with exponential rise in the technological development of artificial intelligence, computer vision has expanded the possibilities of tracking and classifying things through just video inputs, which is also surpassing the limitations of most popular and common hardware setups known traditionally to detect human pose and position, such as low field of view and limited tracking capacity. The benefits of using computer vision in application development is large as it augments traditional input sources (like video streams) and can be integrated in many environments and platforms. In the context of new media interactive arts, based on physical movements and expanding over large areas or gallaries, this research presents a novel way and a framework towards obtaining data and virtual representation of objects/people - such as three-dimensional positions, skeltons/pose and masks from a single rgb camera. Looking at the state of art through some recent developments and building on prior research in the field of computer vision, the paper also proposes an original method to obtain three dimensional position data from monocular images, the model does not rely on complex training of computer vision systems but combines prior computer vision research and adds a capacity to represent z depth, ieto represent a world position in 3 axis from a 2d input source.