论文标题
视觉变压器的数据级彩票票证假设
Data Level Lottery Ticket Hypothesis for Vision Transformers
论文作者
论文摘要
传统的彩票假说(LTH)声称,密集的神经网络中存在一个稀疏的子网和称为获胜票的适当随机初始化方法,因此可以从scratch训练到几乎与密集的对应物一样好。同时,几乎无法评估VISION变压器(VIT)中LTH的研究。在本文中,我们首先表明,很难通过现有方法在VIT的重量水平上找到传统的获胜票。然后,我们将VIT的LTH推广到输入数据,该输入数据由受VIT的输入依赖性启发的图像补丁组成。也就是说,存在一部分输入图像贴片,以便可以通过仅使用这一子集贴片来从头开始训练VIT,并实现与使用所有图像贴片训练的VIT相似的精度。我们称此输入补丁的这子集em获奖票,该票代表了输入数据中的大量信息。我们使用票务选择器根据各种VIT的补丁信息(包括DEIT,LV-VIT和SWIN Transformers)的补丁信息来生产获奖门票。实验表明,通过获奖门票训练的模型的性能与随机选择的子集之间存在明显的差异,这验证了我们提出的理论。我们详细介绍了我们提出的数据lth-vits和常规LTH之间的类似性相似性,以进一步验证我们理论的完整性。源代码可在https://github.com/shawnricecake/vit-lottery-ticket input中找到。
The conventional lottery ticket hypothesis (LTH) claims that there exists a sparse subnetwork within a dense neural network and a proper random initialization method called the winning ticket, such that it can be trained from scratch to almost as good as the dense counterpart. Meanwhile, the research of LTH in vision transformers (ViTs) is scarcely evaluated. In this paper, we first show that the conventional winning ticket is hard to find at the weight level of ViTs by existing methods. Then, we generalize the LTH for ViTs to input data consisting of image patches inspired by the input dependence of ViTs. That is, there exists a subset of input image patches such that a ViT can be trained from scratch by using only this subset of patches and achieve similar accuracy to the ViTs trained by using all image patches. We call this subset of input patches the em winning tickets, which represent a significant amount of information in the input data. We use a ticket selector to generate the winning tickets based on the informativeness of patches for various types of ViT, including DeiT, LV-ViT, and Swin Transformers. The experiments show that there is a clear difference between the performance of models trained with winning tickets and randomly selected subsets, which verifies our proposed theory. We elaborate on the analogical similarity between our proposed Data-LTH-ViTs and the conventional LTH to further verify the integrity of our theory. The Source codes are available at https://github.com/shawnricecake/vit-lottery-ticket-input.