论文标题

推进对话的跨学科科学:来自大型人类演讲的洞察力

Advancing an Interdisciplinary Science of Conversation: Insights from a Large Multimodal Corpus of Human Speech

论文作者

Reece, Andrew, Cooney, Gus, Bull, Peter, Chung, Christine, Dawson, Bryn, Fitzpatrick, Casey, Glazer, Tamara, Knox, Dean, Liebscher, Alex, Marin, Sebastian

论文摘要

人们花了很大一部分时间从事对话,但是我们对谈话的科学理解仍处于起步阶段。在本报告中,我们提出了一门跨学科的对话科学,其中包括一个大型,新颖的多模式语料库,其中1,656篇以英语为单位的对话。这个7+百万个单词,850小时的语料库总计超过1TB的音频,视频和成绩单,并进行了瞬间的声音,面部和语义表达的衡量,以及对演讲者的对话后对话的广泛调查。我们利用语料库的相当大的范围来扩展文献中的关键发现,例如人类转弯的合作性; (2)定义新的算法程序,以将语音分割为对话转弯; (3)在各种文本,听觉和视觉功能上应用机器学习见解来分析使对话成功或失败的原因; (4)探讨对话与整个生命周期的幸福感如何相关。我们还报告(5)基于定量分析和对每种录音的定性审查的全面混合方法报告,它展示了来自不同背景的个人如何改变其沟通模式并找到联系方式。最后,我们讨论了这个大型公共数据集如何为未来的研究提供新的方向,尤其是跨学科界限,因为来自各个领域的学者似乎对对话的研究越来越感兴趣。

People spend a substantial portion of their lives engaged in conversation, and yet our scientific understanding of conversation is still in its infancy. In this report we advance an interdisciplinary science of conversation, with findings from a large, novel, multimodal corpus of 1,656 recorded conversations in spoken English. This 7+ million word, 850 hour corpus totals over 1TB of audio, video, and transcripts, with moment-to-moment measures of vocal, facial, and semantic expression, along with an extensive survey of speaker post conversation reflections. We leverage the considerable scope of the corpus to (1) extend key findings from the literature, such as the cooperativeness of human turn-taking; (2) define novel algorithmic procedures for the segmentation of speech into conversational turns; (3) apply machine learning insights across various textual, auditory, and visual features to analyze what makes conversations succeed or fail; and (4) explore how conversations are related to well-being across the lifespan. We also report (5) a comprehensive mixed-method report, based on quantitative analysis and qualitative review of each recording, that showcases how individuals from diverse backgrounds alter their communication patterns and find ways to connect. We conclude with a discussion of how this large-scale public dataset may offer new directions for future research, especially across disciplinary boundaries, as scholars from a variety of fields appear increasingly interested in the study of conversation.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源