论文标题

从形式和含义中预测偏角类别

Predicting Declension Class from Form and Meaning

论文作者

Williams, Adina, Pimentel, Tiago, McCarthy, Arya D., Blix, Hagen, Chodroff, Eleanor, Cotterell, Ryan

论文摘要

许多自然语言的名词词典被分为具有特征形态特性的几种偏僻类别。班级成员资格远非确定性,但是名词和/或其含义的语音形式通常可以提供不完美的线索。在这里,我们研究了这些线索的强度。更具体地说,我们通过测量多少信息(在位置)可以从了解名词的形式和/或含义中收集到一些信息来实现这一目标。我们知道,形式和含义通常也表明语法性别 - 正如我们进行了量化验证的那样,它们本身可以与dectresension类共享信息 - 因此我们也可以控制性别。我们发现,两种印度 - 欧洲语言(捷克语和德语)分别形成和含义与类别共享大量信息(并在性别上方和超出性别之外的其他信息)。类,形式和意义之间的三向相互作用(给定性别)也很重要。我们的研究很重要,有两个原因:首先,我们引入了一种新方法,该方法为经典的语言发现提供了额外的定量支持,即形式和含义与将名词分类为变为变化有关。其次,我们不仅表明单个校正课程在语言中的线索的强度上有所不同,而且这些变化本身在各种语言中也有所不同。

The noun lexica of many natural languages are divided into several declension classes with characteristic morphological properties. Class membership is far from deterministic, but the phonological form of a noun and/or its meaning can often provide imperfect clues. Here, we investigate the strength of those clues. More specifically, we operationalize this by measuring how much information, in bits, we can glean about declension class from knowing the form and/or meaning of nouns. We know that form and meaning are often also indicative of grammatical gender---which, as we quantitatively verify, can itself share information with declension class---so we also control for gender. We find for two Indo-European languages (Czech and German) that form and meaning respectively share significant amounts of information with class (and contribute additional information above and beyond gender). The three-way interaction between class, form, and meaning (given gender) is also significant. Our study is important for two reasons: First, we introduce a new method that provides additional quantitative support for a classic linguistic finding that form and meaning are relevant for the classification of nouns into declensions. Secondly, we show not only that individual declensions classes vary in the strength of their clues within a language, but also that these variations themselves vary across languages.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源