论文标题
通过专家强盗反馈调整多变量模型预测控制器
Tuning of multivariable model predictive controllersthrough expert bandit feedback
论文作者
论文摘要
对于某些工业控制应用程序,不可用的功能捕获封闭循环绩效中竞争目标之间的非平凡权衡的明确功能。在这种情况下,通常使用人类先天能力隐式学习这种关系并手动调整相应的控制器以实现理想的闭环性能。这种方法由于经验水平和偏好而没有明确的校准度量,因此这种方法具有缺陷。此外,随着基础系统和/或控制器的复杂性增加,为了实现更好的性能,调谐时间和相关的调整成本也是如此。为了降低整体调整成本,本文提出了一个调整框架,从而使用监督的机器学习来提取人类学习的成本功能和可以有效处理大量变量的优化算法,用于优化提取的成本功能。鉴于对许多工业领域的实施以及相应的调整过程中存在的相关高度自由度的兴趣,对柴油发动机中的空气路径控制应用的模型预测控制器进行了调整,目的是为了证明框架的潜力。
For certain industrial control applications an explicit function capturing the nontrivial trade-off between competing objectives in closed loop performance is not available. In such scenarios it is common practice to use the human innate ability to implicitly learn such a relationship and manually tune the corresponding controller to achieve the desirable closed loop performance. This approach has its deficiencies because of individual variations due to experience levels and preferences in the absence of an explicit calibration metric. Moreover, as the complexity of the underlying system and/or the controller increase, in the effort to achieve better performance, so does the tuning time and the associated tuning cost. To reduce the overall tuning cost, a tuning framework is proposed herein, whereby a supervised machine learning is used to extract the human-learned cost function and an optimization algorithm that can efficiently deal with a large number of variables, is used for optimizing the extracted cost function. Given the interest in the implementation across many industrial domains and the associated high degree of freedom present in the corresponding tuning process, a Model Predictive Controller applied to air path control in a diesel engine is tuned for the purpose of demonstrating the potential of the framework.