论文标题

使用多个实例学习进行可解释的太阳耀斑预测

Using Multiple Instance Learning for Explainable Solar Flare Prediction

论文作者

Huwyler, Cédric, Melchior, Martin

论文摘要

在这项工作中,我们利用了来自NASAS IRIS卫星的光谱数据的弱标记数据集,用于使用多个实例学习(MIL)范式预测太阳耀斑。尽管标准监督的学习模型期望每种情况都有标签,但MIL会放松此标签,并且仅考虑将一些实例标记为标签。这是理想情况下适合使用虹膜数据的耀斑预测的,虹膜数据包括沿仪器缝的时间序列的紫外光谱袋。特别是,我们考虑围绕MG II H&K线的读数窗口,该窗口编码有关太阳能球体动力学的信息。我们的MIL型号不仅能够预测下一个$ \ sim $ 25分钟内是否出现耀斑,精度约为90%,而且还可以解释哪些频谱轮廓对于其行李级预测特别重要。该信息可用于突出实时虹膜观察结果中感兴趣的区域,并确定典型耀斑前体光谱谱的候选者。我们使用K-均值聚类来提取与耀斑预测相关的光谱轮廓组。以前的作品发现,回收的组显示出高强度,三胞胎红翼发射和单峰H和K线。它们似乎与小规模爆炸事件有关,这些事件发生在爆发前数十分钟。

In this work we leverage a weakly-labeled dataset of spectral data from NASAs IRIS satellite for the prediction of solar flares using the Multiple Instance Learning (MIL) paradigm. While standard supervised learning models expect a label for every instance, MIL relaxes this and only considers bags of instances to be labeled. This is ideally suited for flare prediction with IRIS data that consists of time series of bags of UV spectra measured along the instrument slit. In particular, we consider the readout window around the Mg II h&k lines that encodes information on the dynamics of the solar chromosphere. Our MIL models are not only able to predict whether flares occur within the next $\sim$25 minutes with accuracies of around 90%, but are also able to explain which spectral profiles were particularly important for their bag-level prediction. This information can be used to highlight regions of interest in ongoing IRIS observations in real-time and to identify candidates for typical flare precursor spectral profiles. We use k-means clustering to extract groups of spectral profiles that appear relevant for flare prediction. The recovered groups show high intensity, triplet red wing emission and single-peaked h and k lines, as found by previous works. They seem to be related to small-scale explosive events that have been reported to occur tens of minutes before a flare.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源