论文标题
对注释的观察
Observations on Annotations
论文作者
论文摘要
文本信息的注释是语言学和计算语言学中的基本活动。本文介绍了有关注释的各种观察。它从高文本,计算语言学和语言技术,人工智能和开放科学在内的几个角度接近该主题。可以沿着不同的维度检查注释。就复杂性而言,从实验到标准化的成熟度方面,它们的成熟度可以从微不足道到高度成熟。可以使用更多抽象注释对注释进行注释。主要的研究数据,例如,可以同时在不同层上注释文本文档,这些数据是独立的,但可以使用多层查询来利用。标准保证数据集的互操作性和可重复使用性。本章以四个最终观察为结束,以研究问题为例,或者就注释研究的现状提出了挑衅性的评论。
The annotation of textual information is a fundamental activity in Linguistics and Computational Linguistics. This article presents various observations on annotations. It approaches the topic from several angles including Hypertext, Computational Linguistics and Language Technology, Artificial Intelligence and Open Science. Annotations can be examined along different dimensions. In terms of complexity, they can range from trivial to highly sophisticated, in terms of maturity from experimental to standardised. Annotations can be annotated themselves using more abstract annotations. Primary research data such as, e.g., text documents can be annotated on different layers concurrently, which are independent but can be exploited using multi-layer querying. Standards guarantee interoperability and reusability of data sets. The chapter concludes with four final observations, formulated as research questions or rather provocative remarks on the current state of annotation research.