论文标题
Sensorscan:在化学过程中进行自我监督的学习和深层聚类诊断
SensorSCAN: Self-Supervised Learning and Deep Clustering for Fault Diagnosis in Chemical Processes
论文作者
论文摘要
现代工业设施在生产过程中产生了大量的原始传感器数据。该数据用于监视和控制过程,可以分析以检测和预测过程异常。通常,必须由专家注释数据才能用于预测建模。但是,在工业环境中,大量数据的手动注释可能很困难。 在本文中,我们提出了Sensorscan,这是一种用于工业化学过程监测的新型方法,用于无监督的故障检测和诊断。我们在田纳西州伊士曼进程的两个公开数据集上演示了模型的性能。结果表明,我们的方法显着优于现有方法(固定FPR的+0.2-0.3 TPR),并有效地检测了大多数过程故障而没有专家注释。此外,我们表明该模型在一小部分的标记数据上进行了微调几乎达到了在完整数据集中训练的SOTA模型的性能。我们还证明我们的方法适用于未提前不知道故障数量的现实世界应用。该代码可从https://github.com/airi-institute/sensorscan获得。
Modern industrial facilities generate large volumes of raw sensor data during the production process. This data is used to monitor and control the processes and can be analyzed to detect and predict process abnormalities. Typically, the data has to be annotated by experts in order to be used in predictive modeling. However, manual annotation of large amounts of data can be difficult in industrial settings. In this paper, we propose SensorSCAN, a novel method for unsupervised fault detection and diagnosis, designed for industrial chemical process monitoring. We demonstrate our model's performance on two publicly available datasets of the Tennessee Eastman Process with various faults. The results show that our method significantly outperforms existing approaches (+0.2-0.3 TPR for a fixed FPR) and effectively detects most of the process faults without expert annotation. Moreover, we show that the model fine-tuned on a small fraction of labeled data nearly reaches the performance of a SOTA model trained on the full dataset. We also demonstrate that our method is suitable for real-world applications where the number of faults is not known in advance. The code is available at https://github.com/AIRI-Institute/sensorscan.