论文标题

自动网格术语建议在系统评论文献搜索中有效查询配方

Automated MeSH Term Suggestion for Effective Query Formulation in Systematic Reviews Literature Search

论文作者

Wang, Shuai, Scells, Harrisen, Koopman, Bevan, Zuccon, Guido

论文摘要

高质量的医学系统评价需要全面的文献搜索,以确保建议和结果足够可靠。确实,寻找相关的医学文献是构建系统评价的关键阶段,并且通常涉及域(医学研究人员)和搜索(信息专家)专家开发搜索查询。基于布尔逻辑,在这种情况下的查询非常复杂,包括标准化术语(例如,医学主题标题(网格)词库)的自由文本项和索引项,并且难以构建。特别是显示网格术语的使用可以提高搜索结果的质量。但是,确定正确的网格术语以在查询中包含很难:信息专家通常不熟悉网格数据库,并且不确定查询网格条款的适当性。自然地,网格术语的全部价值通常不会完全利用。本文研究了基于最初的布尔查询,建议仅包含自由文本术语的方法。在这种情况下,我们设计了基于语言模型的词汇和预训练的方法。这些方法有望自动确定高效的网格术语,以包含在系统的审查查询中。我们的研究对几种网格术语建议方法进行了经验评估。我们进一步对每种方法的网格术语建议进行了广泛的分析,以及这些建议如何影响布尔查询的有效性。

High-quality medical systematic reviews require comprehensive literature searches to ensure the recommendations and outcomes are sufficiently reliable. Indeed, searching for relevant medical literature is a key phase in constructing systematic reviews and often involves domain (medical researchers) and search (information specialists) experts in developing the search queries. Queries in this context are highly complex, based on Boolean logic, include free-text terms and index terms from standardised terminologies (e.g., the Medical Subject Headings (MeSH) thesaurus), and are difficult and time-consuming to build. The use of MeSH terms, in particular, has been shown to improve the quality of the search results. However, identifying the correct MeSH terms to include in a query is difficult: information experts are often unfamiliar with the MeSH database and unsure about the appropriateness of MeSH terms for a query. Naturally, the full value of the MeSH terminology is often not fully exploited. This article investigates methods to suggest MeSH terms based on an initial Boolean query that includes only free-text terms. In this context, we devise lexical and pre-trained language models based methods. These methods promise to automatically identify highly effective MeSH terms for inclusion in a systematic review query. Our study contributes an empirical evaluation of several MeSH term suggestion methods. We further contribute an extensive analysis of MeSH term suggestions for each method and how these suggestions impact the effectiveness of Boolean queries.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源