论文标题
通过建筑工地的理解来改善工人的安全和进度监控建筑工地
Towards Improving Workers' Safety and Progress Monitoring of Construction Sites Through Construction Site Understanding
论文作者
论文摘要
计算机视觉研究的重要组成部分是对象检测。近年来,建筑工地图像的研究取得了巨大进展。但是,施工对象检测存在明显的问题,包括复杂的背景,大小的对象和成像质量差。在最先进的方法中,开发了精心设计的注意机制来处理时空功能,但很少解决渠道功能调整的重要性。我们提出了一个轻巧的优化定位(OP)模块,以改善基于全局特征亲和力关联的通道关系,该关联可用于确定每个通道的优化权重。 OP首先通过将每个通道与给定特征图集的剩余通道进行比较来计算中间优化位置。然后将使用所有通道的加权聚合来表示每个通道。 OP-NET模块是一个通用的深神经网络模块,可以插入任何深神经网络。利用深度学习的算法已经证明了它们几乎实时从图像中识别出广泛对象的能力。机器智能可以通过使用与建筑图像相关的算法自动分析生产率和监视安全性,从而有可能使建筑行业受益。预防危险时,现场自动监测的好处是巨大的。一旦正确识别施工对象,也可以自动化施工监控任务。对施工现场图像中的对象检测任务进行了广泛的实验,以证明其功效和有效性。使用苏打水的基准测试表明,我们的OP-NET能够在保持合理的计算开销的同时,在准确性方面实现新的最新性能。
An important component of computer vision research is object detection. In recent years, there has been tremendous progress in the study of construction site images. However, there are obvious problems in construction object detection, including complex backgrounds, varying-sized objects, and poor imaging quality. In the state-of-the-art approaches, elaborate attention mechanisms are developed to handle space-time features, but rarely address the importance of channel-wise feature adjustments. We propose a lightweight Optimized Positioning (OP) module to improve channel relation based on global feature affinity association, which can be used to determine the Optimized weights adaptively for each channel. OP first computes the intermediate optimized position by comparing each channel with the remaining channels for a given set of feature maps. A weighted aggregation of all the channels will then be used to represent each channel. The OP-Net module is a general deep neural network module that can be plugged into any deep neural network. Algorithms that utilize deep learning have demonstrated their ability to identify a wide range of objects from images nearly in real time. Machine intelligence can potentially benefit the construction industry by automatically analyzing productivity and monitoring safety using algorithms that are linked to construction images. The benefits of on-site automatic monitoring are immense when it comes to hazard prevention. Construction monitoring tasks can also be automated once construction objects have been correctly recognized. Object detection task in construction site images is experimented with extensively to demonstrate its efficacy and effectiveness. A benchmark test using SODA demonstrated that our OP-Net was capable of achieving new state-of-the-art performance in accuracy while maintaining a reasonable computational overhead.