舞蹈：可区分的加速器/网络共同探索

论文标题

舞蹈：可区分的加速器/网络共同探索

DANCE: Differentiable Accelerator/Network Co-Exploration

论文作者

Choi, Kanghyun, Hong, Deokki, Yoon, Hojae, Yu, Joonsang, Kim, Youngsok, Lee, Jinho

论文摘要

为了应对DNN执行的计算需求不断增加，最近的神经体系结构搜索（NAS）算法考虑了硬件成本指标，例如GPU延迟。为了进一步追求快速，有效的执行，正在为多种目的而设计DNN专业的硬件加速器，这远远超出了GPU的效率。但是，这些与硬件相关的指标已被证明可以与网络体系结构表现出非线性关系。因此，针对加速器优化网络或针对网络优化加速器，成为鸡和蛋的问题。在这种情况下，这项工作提出了舞蹈，这是一种可区分的方法，用于共同探索硬件加速器和网络架构设计。舞蹈的核心是一个可区分的评估者网络。通过使用神经网络对硬件评估软件进行建模，加速器体系结构和硬件指标之间的关系变得可以微分，从而可以通过反向传播执行搜索。与现有的天真方法相比，我们的方法在短时间内进行了共同探索，同时实现了卓越的准确性和硬件成本指标。

To cope with the ever-increasing computational demand of the DNN execution, recent neural architecture search (NAS) algorithms consider hardware cost metrics into account, such as GPU latency. To further pursue a fast, efficient execution, DNN-specialized hardware accelerators are being designed for multiple purposes, which far-exceeds the efficiency of the GPUs. However, those hardware-related metrics have been proven to exhibit non-linear relationships with the network architectures. Therefore it became a chicken-and-egg problem to optimize the network against the accelerator, or to optimize the accelerator against the network. In such circumstances, this work presents DANCE, a differentiable approach towards the co-exploration of the hardware accelerator and network architecture design. At the heart of DANCE is a differentiable evaluator network. By modeling the hardware evaluation software with a neural network, the relation between the accelerator architecture and the hardware metrics becomes differentiable, allowing the search to be performed with backpropagation. Compared to the naive existing approaches, our method performs co-exploration in a significantly shorter time, while achieving superior accuracy and hardware cost metrics.

下载PDF全文

下载文献需遵守相关版权规定

论文标题