论文标题
通过使用Crikit从有限观察到的有限观察中推断出冰盖伤害模型:本构关系推理工具包
Inferring ice sheet damage models from limited observations using CRIKit: the Constitutive Relation Inference Toolkit
论文作者
论文摘要
我们检查了从观察数据中学习冰盖损害模型的前景。我们在Crikit(构型关系推理工具包)中实现的方法是为损伤的材料时间导数建模为框架不变的神经网络,并通过模拟冰圆顶的模拟来优化模型的参数。使用Albrecht和Levermann的模型作为基础真理来产生合成观察,我们测量了该模型中优化的神经网络模型的差异,以试图了解该过程如何生成模型,然后将模型转移到其他冰盖模拟中。 在其他学科中,使用所谓的“深度学习”模型用于构成方程,状态方程,子网格规模的过程以及出现在PDES系统中的其他重点关系,但我们的推论设置具有一些混乱的因素。首先是可用的观察类型:我们比较当数值模拟的丢失包括整个冰中的观察不正确时,在实际环境中无法实现的观察错误与仅包含表面和钻孔观察的组合的损失。第二个混杂因素是冰盖损伤的演变,这是占主导地位的。损坏模型中扰动的非本地效应导致损失功能具有许多局部最小值和许多参数配置,而系统无法解析。 我们的经验表明,基本的神经网络具有几种影响优化模型质量的缺陷。我们建议将其他归纳偏见纳入神经网络中的几种方法,这可能会导致未来的工作表现更好。
We examine the prospect of learning ice sheet damage models from observational data. Our approach, implemented in CRIKit (the Constitutive Relation Inference Toolkit), is to model the material time derivative of damage as a frame-invariant neural network, and to optimize the parameters of the model from simulations of the flow of an ice dome. Using the model of Albrecht and Levermann as the ground truth to generate synthetic observations, we measure the difference of optimized neural network models from that model to try to understand how well this process generates models that can then transfer to other ice sheet simulations. The use of so-called "deep-learning" models for constitutive equations, equations of state, sub-grid-scale processes, and other pointwise relations that appear in systems of PDEs has been successful in other disciplines, yet our inference setting has some confounding factors. The first is the type of observations that are available: we compare the quality of the inferred models when the loss of the numerical simulations includes observation misfits throughout the ice, which is unobtainable in real settings, to losses that include only combinations of surface and borehole observations. The second confounding factor is the evolution of damage in an ice sheet, which is advection dominated. The non-local effect of perturbations in a damage models results in loss functions that have both many local minima and many parameter configurations for which the system is unsolvable. Our experience suggests that basic neural networks have several deficiencies that affect the quality of the optimized models. We suggest several approaches to incorporating additional inductive biases into neural networks which may lead to better performance in future work.