论文标题
使用传输转换进行数据分析和机器学习的传输转换分区类别
Partitioning signal classes using transport transforms for data analysis and machine learning
论文作者
论文摘要
一组相对较新的基于运输的变换(CDT,R-CDT,LOT)显示了它们在各种图像和数据处理任务中的强度和巨大潜力,例如参数信号估计,分类,癌症检测等。因此,值得阐明一些数学属性,这些数学属性可以解释这些变换的成功,当它们用作数据分析,信号处理或数据分类的工具时。特别是,我们提供条件下,由代数生成模型创建的一类信号通过传输变换转换为凸集。当在变换域中查看时,这些类的这种凸化简化了分类以及其他数据分析和处理问题。更具体地说,我们研究了代数生成建模框架下这些变换的凸介构化能力的程度和局限性。我们希望本文将作为这些转换的介绍,并鼓励数学家和其他研究人员进一步探索理论的基础和算法工具,这些工具将有助于了解这些转换的成功,并为进一步的成功应用奠定基础。
A relatively new set of transport-based transforms (CDT, R-CDT, LOT) have shown their strength and great potential in various image and data processing tasks such as parametric signal estimation, classification, cancer detection among many others. It is hence worthwhile to elucidate some of the mathematical properties that explain the successes of these transforms when they are used as tools in data analysis, signal processing or data classification. In particular, we give conditions under which classes of signals that are created by algebraic generative models are transformed into convex sets by the transport transforms. Such convexification of the classes simplify the classification and other data analysis and processing problems when viewed in the transform domain. More specifically, we study the extent and limitation of the convexification ability of these transforms under an algebraic generative modeling framework. We hope that this paper will serve as an introduction to these transforms and will encourage mathematicians and other researchers to further explore the theoretical underpinnings and algorithmic tools that will help understand the successes of these transforms and lay the groundwork for further successful applications.