论文标题
对CFD模拟的ML-Surogates的数据需求进行神秘面纱
Demystifying the Data Need of ML-surrogates for CFD Simulations
论文作者
论文摘要
计算流体动力学(CFD)模拟是各种工程应用中的关键工具,通常需要大量的时间和计算能力来预测流量特性。与CFD模拟相关的高计算成本大大限制了设计空间探索的范围,并限制了它们在计划和运营控制中的使用。为了解决这个问题,已经提出了基于机器的替代模型作为计算有效的工具,以加速CFD模拟。但是,缺乏对CFD数据要求的清晰度通常会挑战设计工程师和CFD从业人员中基于ML的替代物的广泛采用。在这项工作中,我们提出了一个基于ML的替代模型,以在各种操作条件下预测乘用车机舱内的温度分布,并使用它来证明预测性能和培训数据集大小之间的权衡。我们的结果表明,即使训练尺寸从2000年逐渐降低到200,预测的准确性也很高且稳定。基于ML的替代物还将计算时间从〜30分钟降低到约9毫秒左右。此外,即使仅使用50个CFD模拟进行训练,ML-Surrogate预测的温度趋势(例如,热/冷区域的位置)与CFD模拟的结果非常吻合。
Computational fluid dynamics (CFD) simulations, a critical tool in various engineering applications, often require significant time and compute power to predict flow properties. The high computational cost associated with CFD simulations significantly restricts the scope of design space exploration and limits their use in planning and operational control. To address this issue, machine learning (ML) based surrogate models have been proposed as a computationally efficient tool to accelerate CFD simulations. However, a lack of clarity about CFD data requirements often challenges the widespread adoption of ML-based surrogates among design engineers and CFD practitioners. In this work, we propose an ML-based surrogate model to predict the temperature distribution inside the cabin of a passenger vehicle under various operating conditions and use it to demonstrate the trade-off between prediction performance and training dataset size. Our results show that the prediction accuracy is high and stable even when the training size is gradually reduced from 2000 to 200. The ML-based surrogates also reduce the compute time from ~30 minutes to around ~9 milliseconds. Moreover, even when only 50 CFD simulations are used for training, the temperature trend (e.g., locations of hot/cold regions) predicted by the ML-surrogate matches quite well with the results from CFD simulations.