论文标题
视觉分析上下文化嵌入
Visually Analyzing Contextualized Embeddings
论文作者
论文摘要
在本文中,我们介绍了一种方法,用于视觉分析由深神经网络模型产生的上下文化嵌入。我们的方法灵感来自自然语言处理的语言探针,在该探针中,任务旨在探测语言结构的语言模型,例如词性词性和指定实体。这些方法在很大程度上是确认性的,但是仅使用户能够测试已知的信息。在这项工作中,我们避开了监督探查任务,并主张无监督的探针,再加上视觉探索技术,以评估语言模型所学的内容。具体而言,我们聚集了由大型文本语料库产生的上下文化嵌入,并基于这种聚类和文本结构引入可视化设计 - 群集共发生,群集跨度和群集字成员资格 - 以帮助激发单个簇之间的功能和关系。用户反馈突出了我们设计在发现不同类型的语言结构方面的好处。
In this paper we introduce a method for visually analyzing contextualized embeddings produced by deep neural network-based language models. Our approach is inspired by linguistic probes for natural language processing, where tasks are designed to probe language models for linguistic structure, such as parts-of-speech and named entities. These approaches are largely confirmatory, however, only enabling a user to test for information known a priori. In this work, we eschew supervised probing tasks, and advocate for unsupervised probes, coupled with visual exploration techniques, to assess what is learned by language models. Specifically, we cluster contextualized embeddings produced from a large text corpus, and introduce a visualization design based on this clustering and textual structure - cluster co-occurrences, cluster spans, and cluster-word membership - to help elicit the functionality of, and relationship between, individual clusters. User feedback highlights the benefits of our design in discovering different types of linguistic structures.