论文标题
台风强度预测的基于语义的端到端学习
Semantic-based End-to-End Learning for Typhoon Intensity Prediction
论文作者
论文摘要
灾难预测是灾难监视和准备的最关键任务之一。现有技术采用不同的机器学习方法来预测历史环境数据的传入灾难。但是,对于短期灾难(例如地震),仅历史数据的预测能力有限。因此,需要其他警告来进行准确的预测来源。除了历史环境数据之外,我们认为社交媒体是知识的补充来源。但是,社交媒体帖子(例如推文)非常非正式,仅包含有限的内容。为了减轻这些局限性,我们提出了富含语义的单词嵌入模型的组合,以用传统word2vec计算出其语义表示的推文中代表实体。此外,我们研究社交媒体帖子与台风之间的相关性是如何在推文的数量和情感的条件下(也称为强度)的。基于这些见解,我们提出了一个基于端到端的框架,该框架从与灾难相关的推文和环境数据中学习,以改善台风强度预测。本文是我们最初在K-CAP 2019中发表的作品的扩展[32]。我们通过使用最先进的深层神经模型构建框架来扩展本文,并通过新的台风和他们的推文及其推文及其推文,并基于我们针对灾难预测中最近基线的方法。我们的实验结果表明,我们的方法在F1得分方面优于最先进的基线的准确性(CNN By12.1%和BilstM By 3.1%)与上次实验相比
Disaster prediction is one of the most critical tasks towards disaster surveillance and preparedness. Existing technologies employ different machine learning approaches to predict incoming disasters from historical environmental data. However, for short-term disasters (e.g., earthquakes), historical data alone has a limited prediction capability. Therefore, additional sources of warnings are required for accurate prediction. We consider social media as a supplementary source of knowledge in addition to historical environmental data. However, social media posts (e.g., tweets) is very informal and contains only limited content. To alleviate these limitations, we propose the combination of semantically-enriched word embedding models to represent entities in tweets with their semantic representations computed with the traditionalword2vec. Moreover, we study how the correlation between social media posts and typhoons magnitudes (also called intensities)-in terms of volume and sentiments of tweets-. Based on these insights, we propose an end-to-end based framework that learns from disaster-related tweets and environmental data to improve typhoon intensity prediction. This paper is an extension of our work originally published in K-CAP 2019 [32]. We extended this paper by building our framework with state-of-the-art deep neural models, up-dated our dataset with new typhoons and their tweets to-date and benchmark our approach against recent baselines in disaster prediction. Our experimental results show that our approach outperforms the accuracy of the state-of-the-art baselines in terms of F1-score with (CNN by12.1%and BiLSTM by3.1%) improvement compared with last experiments