论文标题

使用数字痕迹来建立预期和实时的县级预警系统,以预测美国的Covid-19爆发

Using digital traces to build prospective and real-time county-level early warning systems to anticipate COVID-19 outbreaks in the United States

论文作者

Stolerman, Lucas M., Clemente, Leonardo, Poirier, Canelle, Parag, Kris V., Majumder, Atreyee, Masyn, Serge, Resch, Bernd, Santillana, Mauricio

论文摘要

持续的19日大流行继续影响世界各地的社区。迄今为止,近600万人死于19日,造成了19日,据估计,全世界已感染了十亿人中的四分之一以上。适当,及时缓解的策略以遏制这种疾病和未来疾病爆发的影响需要密切监测其时空轨迹。我们提出了机器学习方法,以预测美国县的COVID-19活动的急剧增加。我们的方法利用了基于Internet的数字痕迹 - 例如,与疾病相关的互联网搜索活动,来自一般人群和临床医生,与疾病相关的Twitter微博以及来自相邻位置的爆发轨迹 - 来监控人口水平健康趋势的潜在变化。由于需要更精细的空间分辨率流行病学见解以改善当地决策的动机,我们以先前在州一级和大流行的早期构想的回顾性研究工作为基础。我们的方法 - 在美国分布的97个县的子集中进行实时和样本外测试 - 在局部暴发发作开始前1-6周,经常预期的是,经常预期的是急剧增加(定义为有效的复制号$ r_t $ r_t $变大于1的时间)。鉴于Covid-19的持续出现了关注的变体(例如最近的一种,Omicron),以及多个国家没有完全使用疫苗的事实,我们提出的框架虽然为美国的县级构想,但在美国有帮助的国家中可能会有所帮助。

The ongoing COVID-19 pandemic continues to affect communities around the world. To date, almost 6 million people have died as a consequence of COVID-19, and more than one-quarter of a billion people are estimated to have been infected worldwide. The design of appropriate and timely mitigation strategies to curb the effects of this and future disease outbreaks requires close monitoring of their spatio-temporal trajectories. We present machine learning methods to anticipate sharp increases in COVID-19 activity in US counties in real-time. Our methods leverage Internet-based digital traces -- e.g., disease-related Internet search activity from the general population and clinicians, disease-relevant Twitter micro-blogs, and outbreak trajectories from neighboring locations -- to monitor potential changes in population-level health trends. Motivated by the need for finer spatial-resolution epidemiological insights to improve local decision-making, we build upon previous retrospective research efforts originally conceived at the state level and in the early months of the pandemic. Our methods -- tested in real-time and in an out-of-sample manner on a subset of 97 counties distributed across the US -- frequently anticipated sharp increases in COVID-19 activity 1-6 weeks before the onset of local outbreaks (defined as the time when the effective reproduction number $R_t$ becomes larger than 1 consistently). Given the continued emergence of COVID-19 variants of concern -- such as the most recent one, Omicron -- and the fact that multiple countries have not had full access to vaccines, the framework we present, while conceived for the county-level in the US, could be helpful in countries where similar data sources are available.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源