Approximations of MPC: Dynamic KKT & Learning based Approaches

107 Views

February 18, 26

#制御工学 #モデル予測制御 #機械学習 #最適化

スライド概要

セビリア大学での招待講義の資料です。

Masaki Inoue

@minoue227

スライド一覧

制御工学の研究者です。

またはPlayer版

埋め込む »CMSなどでJSが使えない場合

ダウンロード

関連スライド

理系プレゼン技法

プレゼン教育物理情報工学科

Masaki Inoue 23K

現代制御論のすゝめ

制御工学現代制御最適レギュレータリアプノフ安定論状態方程式物理情報工学科

Masaki Inoue 22.9K

制御で支えるAIの未来：Physical AIからSocial AIまで

制御工学 ai フィジカルai ソーシャルai

Masaki Inoue 2.8K

受動性再訪：モデル集合のみでのシステムの記述から解析・設計まで

制御工学大規模複雑系受動性消散性

Masaki Inoue 224

人間の行動・意思決定を含むシステムに対する制御理論の開拓と展開

制御工学 human-in-the-loop制御弱い制御内部モデル制御（imc）

Masaki Inoue >100

学振特別研究員になるために～2025年度申請版

学振 dc1 dc2 jsps pd

大上雅史 787.7K

各ページのテキスト

Approximations of MPC: Dynamic KKT & Learning based Approaches Masaki Inoue Keio University [email protected] Seminar, University of Seville May 30, 2022

Various Control Applications Air Traffic Management Energy Management fuel efficiency safety Light Guidance Control reduced discomfort complex driver behavior 2 stability economic dispatch Smart Agriculture reduced workload preferred environment

Model Predictive Control (MPC) • Plant Model • extended vector • Control Law ex. model-based state prediction: • find optimal action based on the measured state • handle complex design specifications • require real-time solution of optimization problem 3

Approximation of Control Law 4 ➢Control Law in MPC ➢Optimality Condition: inequality constraints are omitted in this talk (can be extended) • Karush–Kuhn–Tucker (KKT) condition essence of high computational burden • control law is given by implicit function like ➢Aim of this talk: Approximation of control law by explicit function like

Two Approaches MPC i. 5 approx MPC Approximation by dynamic KKT (instant MPC) • original work: [K. Yoshida et al, IEEE LCSS, 2019] • extensions: [M. Figura et al, ACC, 2020], [K. Sato et al, SICE AC, 2020] ii. Approximation by learning technique • [K. Hara et al, IFAC WC, 2020], [K. Hara et al, Automatica (conditionally accepted)] • Others by many researchers • explicit MPC: [Bemporad et al, Automatica, 2002] etc • neural network based MPC: [T. Parisini & R. Zoppoli, Automatica, 1995] etc

Outline 1. Preliminary: Dissipation-based System Analysis • tool for closed-loop stability of approximated MPC 2. Dynamic KKT-based Approximation of MPC 3. Learning-based Approximation of MPC 6

1. On System Description ➢ State equation • basic description of dynamical systems • express detailed behavior of IO dynamical system • systems interconnection increases state space dimension state equation is not always tractable for large-scale system analysis & design

1. Differential Equation & Inequality (revisit) ➢ 1-dim differential equation ➢ 1-dim differential inequality • is a description of a set of dynamical systems • express rough behavior of dynamical system ➢ Model “Reduction” of state equation finding energy function reduced to 1-dim diff inequality

1. Description by Dissipation Inequality ➢ Dissipation inequality (Willems) • special case: QSR dissipativity (Hill & Moylan) upper bound of energy increase • Im Example 1: bounded L2 gain Im 0 • Example 2: passivity Re Re 0 Nyquist diagram

10.

1. Feedback Connection ー ➢ Assumption: dissipation in subsystems connected as ➢ Dissipation in feedback system • evaluated by sum of energies

11.

1. Stability Theorem ー ➢ Dissipation in feedback system ➢ Theorem: Suppose (*) is negative (semi)definite. Then, the origin of FB system is asymptotically (Lyapunov) stable • small-gain theorem: , • passivity theorem: , • only few parameters are required for stability analysis (NO detailed model like state equation)

12.

Outline 12 1. Preliminary: Dissipation-based System Analysis • tool for closed-loop stability of approximated MPC 2. Dynamic KKT-based Approximation of MPC • Main Reference: [K. Yoshida et al, LCSS, 2019] 3. Learning-based Approximation of MPC MPC approx MPC

13.

2. Dynamic KKT ➢ KKT condition (recall) optimization problem ➢ dynamic KKT condition • reduced to KKT condition at steady state ( • primal-dual gradient algorithm: ) [K. J. Arrow et al, Studies in Linear and Non-Linear Programming, 1958] • extension to handle inequality constraints: [A. Cherukuri et al, SCL, 2016], [A. Adegbege, IFAC WC, 2020]

14.

2. Instant MPC ➢ Instant MPC: dynamic controller realized by optimization algorithm associated with MPC approx MPC • less approx. error for slow dynamics • No error at DC gain • explicit control law like • sketch for discrete-time algorithm case MPC iMPC objective function

15.

2. Demonstration of instant MPC ➢ Attitude control of aircraft [K. M. Sobel et al, JGCD, 1985] ➢ Simulation results • suboptimal performance • constraint violation for some algorithm • avoided by other PDG algorithm [A. Adegbege, IFAC WC, 2020] • significantly reduce computational burden MPC（cplex） 18.13 ms iMPC 0.2070 ms

16.

2. Stability Analysis of instant MPC ➢ Assumption: dissipativity of plant system iMPC ➢ Extension of instant MPC • add tunable parameters that do not shift the steady state Note: at steady state ➢ Theorem: existence of parameters s.t. • dissipativity of iMPC: • close-loop stability:

17.

2. Short Summary on instant MPC MPC iMPC ➢ Instant MPC: suboptimal control realized by solution algorithm Main Reference: [K. Yoshida et al, LCSS, 2019] • guarantee optimality at steady state • related works: [A. Jokic et al, TAC, 2009], [L. Lawrence, TAC, 2021] • guarantee closed-loop stability based on dissipation theory

18.

Outline 18 1. Preliminary: Dissipation-based System Analysis • tool for closed-loop stability of approximated MPC 2. Dynamic KKT based Approximation of MPC • main reference: [K. Yoshida et al, LCSS, 2019] 3. Learning-based Approximation of MPC • main reference: [K. Hara et al, IFAC WC, 2020], [K. Hara et al, Automatica (conditionally accepted)] MPC approx MPC

19.

3. Learning MPC 19 ➢Idea of data-driven approximation • collect input-output data on optimal control law offline • apply data-driven modeling to find explicit control law MPC data collection approx MPC modeling ➢Related (classic) works • neural network-based approximation of optimal control: [D. Psaltis et al, CSM, 1988], [T. Parisini el al, Automatica, 1995] ➢This work: stability-preserving data-driven approximation

20.

3. Controller Structure 20 ➢Approximation of observer-based MPC • find suitable lifted state • observer MPC e.g. radial basis function approximation • state equation for controller • focus trajectory of lifted state • nonlinear in state • linear in system matrices , not of state & lifted state ➢Preliminary for data-driven approximation • perform offline simulation of MPC to generate • transform to lifted state & input

21.

3. Dissipativity-Constrained Learning ➢Preliminary 2: • impose dissipation specification approx MPC ➢Formulation: dissipativity-constrained learning control performance closed-loop stability • nonlinear matrix inequality is reduced to LMI [Y. Abe et al, ECC, 2016] • idea for reduced conservativeness [K. Hara et al, IFAC WC, 2020] etc 21

22.

3. Demonstration 22 ➢Nonlinear plant: • passive [H. Zekari & P. Antsaklis, IJC, 2019] ➢Suboptimal controller design: Simulation result: • MPC & observer design • IO data collection • passivity-constrained learning ーpassivity-constrained ーno-constrained closed-loop stability

23.

Summary: Approximation of MPC 1. Dynamic KKT-based Approximation • instant MPC is realized by optimization algorithm associated with MPC • guarantee optimality at steady-state • stability analysis by dissipativity theory 2. Learning-based Approximation • guarantee stability by dissipativity-constrained learning • suboptimality analysis by Koopman operator theory (omitted) • present LMI-based solution method (omitted) MPC approx MPC 23