增量任务学习具有增量排名更新

论文标题

增量任务学习具有增量排名更新

Incremental Task Learning with Incremental Rank Updates

论文作者

Hyder, Rakib, Shao, Ken, Hou, Boyu, Markopoulos, Panos, Prater-Bennette, Ashley, Asif, M. Salman

论文摘要

增量任务学习（ITL）是一个持续学习的类别，试图培训单个网络以进行多个任务（一个接一个），其中每个任务的培训数据仅在培训该任务期间可用。当神经网络接受较新的任务培训时，往往会忘记旧任务。该特性通常被称为灾难性遗忘。为了解决此问题，ITL方法使用情节内存，参数正则化，掩盖和修剪或可扩展的网络结构。在本文中，我们提出了一个基于低级分解的新的增量任务学习框架。特别是，我们代表每一层的网络权重作为几个等级-1矩阵的线性组合。为了更新一个新任务的网络，我们学习一个排名1（或低级别）矩阵，并将其添加到每一层的权重中。我们还引入了一个其他选择器向量，该向量将不同的权重分配给对先前任务的低级矩阵。我们证明，就准确性和遗忘而言，我们的方法的表现比当前的最新方法更好。与基于情节的内存和基于面具的方法相比，我们的方法还提供了更好的内存效率。我们的代码将在https://github.com/csiplab/task-increment-rank-update.git上找到。

Incremental Task learning (ITL) is a category of continual learning that seeks to train a single network for multiple tasks (one after another), where training data for each task is only available during the training of that task. Neural networks tend to forget older tasks when they are trained for the newer tasks; this property is often known as catastrophic forgetting. To address this issue, ITL methods use episodic memory, parameter regularization, masking and pruning, or extensible network structures. In this paper, we propose a new incremental task learning framework based on low-rank factorization. In particular, we represent the network weights for each layer as a linear combination of several rank-1 matrices. To update the network for a new task, we learn a rank-1 (or low-rank) matrix and add that to the weights of every layer. We also introduce an additional selector vector that assigns different weights to the low-rank matrices learned for the previous tasks. We show that our approach performs better than the current state-of-the-art methods in terms of accuracy and forgetting. Our method also offers better memory efficiency compared to episodic memory- and mask-based approaches. Our code will be available at https://github.com/CSIPlab/task-increment-rank-update.git

下载PDF全文

下载文献需遵守相关版权规定

论文标题