论文标题
5G无人机识别的准确可靠的方法,并具有校准的不确定性
Accurate and Reliable Methods for 5G UAV Jamming Identification With Calibrated Uncertainty
论文作者
论文摘要
只有在不考虑不确定性的情况下提高准确性可能会对深层神经网络(DNN)决策产生负面影响,并降低其可靠性。本文提出了针对时间序列的二进制分类问题的五种组合预处理和后处理方法,这些方法同时提高了5G无人机安全数据集中应用的DNN输出的准确性和可靠性。这些技术使用DNN输出作为输入参数,并以不同的方式处理它们。两种方法使用著名的机器学习(ML)算法作为补充,而其他三种仅使用DNN估计的置信值。我们比较了七个不同的指标,例如预期校准误差(ECE),最大校准误差(MCE),平均信心(MC),平均准确性(MA),归一化的负对数可能性(NLL),BRIER得分损失(BSL)和可靠性评分(RS)和可靠性评分(RS)以及它们之间的折衷,以评估拟议的杂种杂种杂志。首先,我们表明,在本工作呈现的条件下,极端梯度提升(XGB)分类器可能对二进制分类不可靠。其次,我们证明至少一种潜在方法可以比DNN软磁层中的分类获得更好的结果。最后,我们表明,基于RS决定MC和MA指标之间差异的假设,前瞻性方法可以通过更好的不确定性校准提高准确性和可靠性,并且该差异应为零以提高可靠性。例如,方法3即使与XGB分类器相比,最佳RS也是0.65,即XGB分类器(达到7.22卢比)。
Only increasing accuracy without considering uncertainty may negatively impact Deep Neural Network (DNN) decision-making and decrease its reliability. This paper proposes five combined preprocessing and post-processing methods for time-series binary classification problems that simultaneously increase the accuracy and reliability of DNN outputs applied in a 5G UAV security dataset. These techniques use DNN outputs as input parameters and process them in different ways. Two methods use a well-known Machine Learning (ML) algorithm as a complement, and the other three use only confidence values that the DNN estimates. We compare seven different metrics, such as the Expected Calibration Error (ECE), Maximum Calibration Error (MCE), Mean Confidence (MC), Mean Accuracy (MA), Normalized Negative Log Likelihood (NLL), Brier Score Loss (BSL), and Reliability Score (RS) and the tradeoffs between them to evaluate the proposed hybrid algorithms. First, we show that the eXtreme Gradient Boosting (XGB) classifier might not be reliable for binary classification under the conditions this work presents. Second, we demonstrate that at least one of the potential methods can achieve better results than the classification in the DNN softmax layer. Finally, we show that the prospective methods may improve accuracy and reliability with better uncertainty calibration based on the assumption that the RS determines the difference between MC and MA metrics, and this difference should be zero to increase reliability. For example, Method 3 presents the best RS of 0.65 even when compared to the XGB classifier, which achieves RS of 7.22.