骨干审查：用于深度学习和深入强化学习方法的功能提取网络

论文标题

骨干审查：用于深度学习和深入强化学习方法的功能提取网络

Backbones-Review: Feature Extraction Networks for Deep Learning and Deep Reinforcement Learning Approaches

论文作者

Elharrouss, Omar, Akbari, Younes, Almaadeed, Noor, Al-Maadeed, Somaya

论文摘要

为了使用各种类型的数据理解现实世界，人工智能（AI）是当今最常用的技术。在分析数据中找到模式的同时表示主要任务。这是通过提取代表性特征步骤来执行的，该步骤是使用统计算法或使用某些特定过滤器进行的。但是，从大规模数据中选择有用的功能代表了至关重要的挑战。现在，随着卷积神经网络（CNN）的发展，功能提取操作变得更加自动和更容易。 CNN允许处理大规模的数据，并涵盖特定任务的不同方案。对于计算机视觉任务，卷积网络也用于为深度学习模型的其他部分提取功能。选择合适的网络用于特征提取或DL模型的其他部分不是随机工作。因此，这种模型的实现可能与目标任务以及IT的计算复杂性有关。已经提出了许多网络，并成为任何AI任务中任何DL模型的著名网络。这些网络被利用用于特征提取或在任何名为骨架的DL模型的开头。骨干是以前在许多其他任务中训练并证明其有效性的已知网络。在本文中，现有骨架的概述，例如vggs，resnets，densenet等具有详细的描述。此外，通过对所使用的骨干进行审查，讨论了几个计算机视觉任务。此外，还基于每个任务的主干，还提供了性能的比较。

To understand the real world using various types of data, Artificial Intelligence (AI) is the most used technique nowadays. While finding the pattern within the analyzed data represents the main task. This is performed by extracting representative features step, which is proceeded using the statistical algorithms or using some specific filters. However, the selection of useful features from large-scale data represented a crucial challenge. Now, with the development of convolution neural networks (CNNs), the feature extraction operation has become more automatic and easier. CNNs allow to work on large-scale size of data, as well as cover different scenarios for a specific task. For computer vision tasks, convolutional networks are used to extract features also for the other parts of a deep learning model. The selection of a suitable network for feature extraction or the other parts of a DL model is not random work. So, the implementation of such a model can be related to the target task as well as the computational complexity of it. Many networks have been proposed and become the famous networks used for any DL models in any AI task. These networks are exploited for feature extraction or at the beginning of any DL model which is named backbones. A backbone is a known network trained in many other tasks before and demonstrates its effectiveness. In this paper, an overview of the existing backbones, e.g. VGGs, ResNets, DenseNet, etc, is given with a detailed description. Also, a couple of computer vision tasks are discussed by providing a review of each task regarding the backbones used. In addition, a comparison in terms of performance is also provided, based on the backbone used for each task.

下载PDF全文

下载文献需遵守相关版权规定

论文标题