论文标题

研究多标签分类问题的班级难度因素

Investigating Class-level Difficulty Factors in Multi-label Classification Problems

论文作者

Marsden, Mark, McGuinness, Kevin, Antony, Joseph, Wei, Haolin, Redzic, Milan, Tang, Jian, Hu, Zhilan, Smeaton, Alan, O'Connor, Noel E

论文摘要

这项工作首次研究了多标签分类问题中类级难度因素的使用。提出了四个级别的难度因素:频率,视觉变化,语义抽象和类同时出现。一旦为给定的多标签分类数据集进行了计算,这些难度因素被证明具有多种潜在的应用程序,包括在数据集中预测类级级别的性能以及通过难度加权优化来提高预测性能。对于两个具有挑战性的多标签数据集(WWW人群和视觉基因组),观察到了MAP和AUC性能的重大改进,其中包括难度加权优化。所提出的技术在训练或推理过程中不需要任何其他计算复杂性,并且可以随着时间的流逝而延长其他类别的难度因素。

This work investigates the use of class-level difficulty factors in multi-label classification problems for the first time. Four class-level difficulty factors are proposed: frequency, visual variation, semantic abstraction, and class co-occurrence. Once computed for a given multi-label classification dataset, these difficulty factors are shown to have several potential applications including the prediction of class-level performance across datasets and the improvement of predictive performance through difficulty weighted optimisation. Significant improvements to mAP and AUC performance are observed for two challenging multi-label datasets (WWW Crowd and Visual Genome) with the inclusion of difficulty weighted optimisation. The proposed technique does not require any additional computational complexity during training or inference and can be extended over time with inclusion of other class-level difficulty factors.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源