14UJ/决策关键字发现加速器，并通过SRAM计算和芯片学习进行自定义

论文标题

14UJ/决策关键字发现加速器，并通过SRAM计算和芯片学习进行自定义

A 14uJ/Decision Keyword Spotting Accelerator with In-SRAM-Computing and On Chip Learning for Customization

论文作者

Chiang, Yu-Hsiang, Chang, Tian-Sheuan, Jou, Shyh Jye

论文摘要

近年来，关键字斑点已成为一种与消费者设备互动的自然方式。但是，由于其始终持续的性质和各种语音，因此需要进行低功率设计以及用户定制。本文介绍了使用基于SRAM的内存计算（IMC）和用于用户自定义的芯片学习的低功率，节能关键字点斑点加速器。但是，IMC受到宏观大小，精度有限和非理想效应的约束。为了解决上述问题，本文提出了使用IMC感知模型设计的偏差补偿和微调。此外，由于使用低精度边缘设备学习导致零误差和梯度值，因此本文提出了误差缩放和较小的梯度积累，以达到与理想模型训练相同的准确性。仿真结果表明，通过用户自定义，我们可以通过补偿和微调将准确性损失从51.08 \％\％恢复到89.76 \％，并通过自定义进一步提高到96.71 \％。芯片实施可以成功运行该模型，每个决定只有14 $ uj $。与最先进的作品相比，提出的设计具有更高的能源效率，具有额外的芯片模型自定义功能，以提高准确性。

Keyword spotting has gained popularity as a natural way to interact with consumer devices in recent years. However, because of its always-on nature and the variety of speech, it necessitates a low-power design as well as user customization. This paper describes a low-power, energy-efficient keyword spotting accelerator with SRAM based in-memory computing (IMC) and on-chip learning for user customization. However, IMC is constrained by macro size, limited precision, and non-ideal effects. To address the issues mentioned above, this paper proposes bias compensation and fine-tuning using an IMC-aware model design. Furthermore, because learning with low-precision edge devices results in zero error and gradient values due to quantization, this paper proposes error scaling and small gradient accumulation to achieve the same accuracy as ideal model training. The simulation results show that with user customization, we can recover the accuracy loss from 51.08\% to 89.76\% with compensation and fine-tuning and further improve to 96.71\% with customization. The chip implementation can successfully run the model with only 14$uJ$ per decision. When compared to the state-of-the-art works, the presented design has higher energy efficiency with additional on-chip model customization capabilities for higher accuracy.

下载PDF全文

下载文献需遵守相关版权规定

论文标题