论文标题

在依赖关系和培训时间的界面上,加利西亚BERT模型的句法能力的计算心理语言评估

A computational psycholinguistic evaluation of the syntactic abilities of Galician BERT models at the interface of dependency resolution and training time

论文作者

de-Dios-Flores, Iria, Garcia, Marcos

论文摘要

本文探讨了变压器模型捕获加利西亚人中捕获主题 - 动词和名词形容性一致性的能力。我们进行了一系列单词预测实验,其中我们操纵依赖性长度以及具有吸引人的吸引子名词的存在。首先,我们评估了加利西亚人现有的单语和多语言模型的整体性能。其次,为了观察训练过程的效果,我们比较了不同训练点上两个单语言模型的不同程度。我们还发布了他们的检查站,并提出了替代评估度量。我们的结果证实了使用“协议预测任务”的类似作品的先前发现,并为变压器模型所需的培训步骤数量提供了有趣的见解,以解决长距离依赖关系。

This paper explores the ability of Transformer models to capture subject-verb and noun-adjective agreement dependencies in Galician. We conduct a series of word prediction experiments in which we manipulate dependency length together with the presence of an attractor noun that acts as a lure. First, we evaluate the overall performance of the existing monolingual and multilingual models for Galician. Secondly, to observe the effects of the training process, we compare the different degrees of achievement of two monolingual BERT models at different training points. We also release their checkpoints and propose an alternative evaluation metric. Our results confirm previous findings by similar works that use the agreement prediction task and provide interesting insights into the number of training steps required by a Transformer model to solve long-distance dependencies.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源