进化神经结构搜索算法的性能预测指标的新型培训方案

论文标题

进化神经结构搜索算法的性能预测指标的新型培训方案

A Novel Training Protocol for Performance Predictors of Evolutionary Neural Architecture Search Algorithms

论文作者

Sun, Yanan, Sun, Xian, Fang, Yuhan, Yen, Gary

论文摘要

进化神经体系结构搜索（ENA）可以使用进化计算算法自动设计深神经网络（DNN）的体系结构。但是，大多数ENA算法都需要密集的计算资源，这不一定可供感兴趣的用户使用。性能预测因素是一种回归模型，可以帮助完成搜索，同时而无需提供大量计算资源。尽管设计了各种性能预测因素，但他们采用相同的培训协议来构建回归模型：1）以训练数据集为绩效，2）使用均方误差标准训练模型，以及3）预测ENAS期间新生成的DNN的性能。在本文中，我们指出，构成培训方案的三个步骤并不是通过直观和说明性的例子出来的。此外，我们提出了一种新的培训协议来解决这些问题，包括设计成对的排名指标来构建培训目标，建议使用逻辑回归来适合培训样本，并开发一种差异方法来构建培训实例。为了验证拟议的培训协议的有效性，已经选择了机器学习领域的四个广泛使用的回归模型来对两个基准数据集进行比较。所有比较的实验结果表明，提出的培训方案可以显着提高针对传统培训方案的性能预测准确性。

Evolutionary Neural Architecture Search (ENAS) can automatically design the architectures of Deep Neural Networks (DNNs) using evolutionary computation algorithms. However, most ENAS algorithms require intensive computational resource, which is not necessarily available to the users interested. Performance predictors are a type of regression models which can assist to accomplish the search, while without exerting much computational resource. Despite various performance predictors have been designed, they employ the same training protocol to build the regression models: 1) sampling a set of DNNs with performance as the training dataset, 2) training the model with the mean square error criterion, and 3) predicting the performance of DNNs newly generated during the ENAS. In this paper, we point out that the three steps constituting the training protocol are not well though-out through intuitive and illustrative examples. Furthermore, we propose a new training protocol to address these issues, consisting of designing a pairwise ranking indicator to construct the training target, proposing to use the logistic regression to fit the training samples, and developing a differential method to building the training instances. To verify the effectiveness of the proposed training protocol, four widely used regression models in the field of machine learning have been chosen to perform the comparisons on two benchmark datasets. The experimental results of all the comparisons demonstrate that the proposed training protocol can significantly improve the performance prediction accuracy against the traditional training protocols.

下载PDF全文

下载文献需遵守相关版权规定

论文标题