论文标题

用于机器学习的光子张量核心

Photonic tensor cores for machine learning

论文作者

Miscuglio, Mario, Sorger, Volker J.

论文摘要

随着计算硬件朝着增加异质性的持续趋势,域特异性的处理器正在成为集中式范式的替代方案。张量核心单元(TPU)已证明,通过较高的信号和能源效率,几乎由较高的信号启用了几乎3个幅度的图形过程单位。在这种情况下,光子具有许多协同的物理特性,而相变材料则允许在这些新兴分布的非Van-Neumann架构中局部非易失性助记符功能。虽然已经探索了几种光子神经网络设计,但执行矩阵矢量乘法和求和的光子TPU尚未出色。 Here we introduced an integrated photonics-based TPU by strategically utilizing a) photonic parallelism via wavelength division multiplexing, b) high 2 Peta-operations-per second throughputs enabled by 10s of picosecond-short delays from optoelectronics and compact photonic integrated circuitry, and c) zero power-consuming novel photonic multi-state memories based on phase-change materials featuring vanishing losses in the无定形状态。结合了材料,功能和系统的这些物理协同作用,我们表明,与电气TPU相比,该8位光子TPU的性能可以高2-3个订单,同时具有相似的芯片区域。这项工作表明,光子专用处理器有潜力增强电子系统,并且可能在迫在眉睫的5G网络及其他地区的网络边缘设备中表现出色。

With an ongoing trend in computing hardware towards increased heterogeneity, domain-specific co-processors are emerging as alternatives to centralized paradigms. The tensor core unit (TPU) has shown to outperform graphic process units by almost 3-orders of magnitude enabled by higher signal throughout and energy efficiency. In this context, photons bear a number of synergistic physical properties while phase-change materials allow for local nonvolatile mnemonic functionality in these emerging distributed non van-Neumann architectures. While several photonic neural network designs have been explored, a photonic TPU to perform matrix vector multiplication and summation is yet outstanding. Here we introduced an integrated photonics-based TPU by strategically utilizing a) photonic parallelism via wavelength division multiplexing, b) high 2 Peta-operations-per second throughputs enabled by 10s of picosecond-short delays from optoelectronics and compact photonic integrated circuitry, and c) zero power-consuming novel photonic multi-state memories based on phase-change materials featuring vanishing losses in the amorphous state. Combining these physical synergies of material, function, and system, we show that the performance of this 8-bit photonic TPU can be 2-3 orders higher compared to an electrical TPU whilst featuring similar chip areas. This work shows that photonic specialized processors have the potential to augment electronic systems and may perform exceptionally well in network-edge devices in the looming 5G networks and beyond.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源