Menu

Numerics.DimRed

Burkhard Schmidt
Attachments
Control_1.gif (4804 bytes)
Error_1.gif (8355 bytes)
Error_2.gif (5573 bytes)
Error_3.gif (3237 bytes)
Lyapunov_3.gif (6644 bytes)
Transfer_1.gif (2217 bytes)
Transfer_2.gif (2682 bytes)
Transfer_3.gif (4035 bytes)
Transfer_4.gif (12502 bytes)
WCO_1.gif (6805 bytes)
WCO_2.gif (4484 bytes)
WCO_3.gif (7710 bytes)
WCO_4.gif (3291 bytes)
There is a newer version of this page. You can find it here.

Dimension reduction: From linear to bilinear control systems

Introduction

Before discussing the dimension reduction itself, we shall first shed some light on the transfer function/matrix (and its H2 norm). The transfer function/matrix describes how for every input/control u(t), the linear/bilinear control systems responds with an output/observation y(t). It is the task of dimension reduction to find lower-dimensional (reduced) systems which approximate this input-output behavior as closely as possible on any compact time interval [0;T]. We note that this problem is closely connected to the concepts of controllability and observability.

Transfer function/matrix and H2 norm

In linear time-invariant (LTI) system theory, the input equation of a control system describes the evolution of a high-dimensional system in terms of its state vector x(t)∈Cn. The field-free evolution is described by linear ODEs with real symmetric matrix A∈Rn×n (or complex Hermitian matrix A∈Cn×n) with simple eigenvalue 0. The (linear!) interaction with a low-dimensional control u(t)∈Rm, m<<n is given by input matrix B∈Rn×m. The output equation defines the low-dimensional observables y(t)∈Rp, p<<n in terms of output matrix C ∈Rp×n.

\begin{array}{lll}\dot{x}(t)&=&Ax(t)+Bu(t),\,x(0)=x_0klzzwxh:0042y(t)&=&Cx(t)\end{array}

where we have dropped the imaginary unit from our previous control equations because it does not affect the calculations of the Gramians discussed below. Upon Legendre transform, the input-output behavior of the system is simply given by a linear mapping

Y(s)=G(s)U(s)

where U(s) and Y(s) are the Legendre transforms of input u(t) and y(t), respectively, and where the transfer function/matrix

G(s)=C\left(sI_{n\times n}-A\right)^{-1}B

is the Legendre transform of matrix g(t) consisting of elements

g_{ij}(t)=\leftklzzwxh:0049\begin{matrix}C_ie^{At}B_j,&t\rangle 0klzzwxh:00500,&t\leq 0\end{matrix}\right.

which can be considered as output yi component generated by a δ-like (impulse) input uj component.

Then the H2 norm of the control system can be obtained as

\begin{array}{lll} ||G(\cdot)||_{\mathcal{H}_2}^2 &\equiv& \int_0^\infty \sum_{i=1}^p\sum_{j=1}^m|g_{ij}|^2(t)dtklzzwxh:0059&=&\int_0^\infty tr (g^\ast(t)g(t))dtklzzwxh:0060&=&\frac{1}{2\pi}\int_{-\infty}^{\infty}tr(G^\ast(i\omega)G(i\omega))d\omegaklzzwxh:0061&=& tr (B^\ast W_oB)klzzwxh:0062&=&tr(CW_cC^\ast)\end{array}

where tr denote the trace of a matrix and where the Gramians Wc and Wo are explained in the following.

Controllability and observability

Ttwo important properties of control systems are the controllability (aka reachability) and the observability which can be regarded as dual aspects of the same problem. For the case of linear control systems, the corresponding controllability Gramian WC (often referred to as P in the literature) and the observability Gramian WO (often referred to as Q in the literature) are defined as

\begin{matrix}W_C&=&\int_0^\infty \exp(At)BB^\ast\exp(A^\ast t)dtklzzwxh:0076W_O&=&\int_0^\infty \exp(A^\ast t)C^\ast C\exp(At)dt\end{matrix}

If the pair (A,B) is controllable, i.e., if the controllability matrix [ B AB A2B ... An-1B ] has full rank n, then the controllability Gramian WC is positive definite. Qualitatively, the quadratic form x* WCx is a measure for the "input energy" (in terms of the integral over |u(t)|²) that is needed to drive the system to a state x. For a generalization to bilinear systems (involving multiple nested time integrals), see Ref. [1,2]

Conversely, if the pair (A,C) is observable, i.e., if the observability matrix [ CT ATCT (AT)2CT ... (AT)n-1CT ] has full rank n, then the observability Gramian WO is positive definite. Qualitatively, the quadratic form x* WOx is a measure for the "output" energy (time integral over |y(t)|²) that can be extracted from the system when u=0 and the system has been initialized at state x.

In practice, evaluation of the above integrals is avoided; instead the Gramians WC and WO for the linear case are computed as the symmetric positive semi-definite solutions of the Lyapunov equations

\begin{matrix} AW_C+W_C A^\ast+BB^\ast&=&0klzzwxh:0126A^\ast W_0+W_0A+C^\ast C&=&0\end{matrix}

which have unique solutions if A is stable. The effort for numerical solution scales as O(n³). For the case of a bilinear input relation, this has to be replaced by the following generalized Lyapunov equations [1,2]

\begin{matrix}AW_C+W_C A^\ast +\sum_{k=1}^mN_kW_CN_k^\ast+BB^\ast&=&0klzzwxh:0132A^\ast W_0+W_0A+\sum_{k=1}^m N_k^\ast W_0N_k+C^\ast C&=&0\end{matrix}

While direct methods for solving such equations have a numerical complexity of O(n6), the iterative method introduced in [3] can be used instead which requires the solution of a standard Lyapunov equation in each step.

While for linear control systems the Lyapunov equations can always be solved if matrix A is stable (see here), for the generalized Lyapunov equations we recall that a system is called stable when the system matrix A has only eigenvalues in the open left half complex plane (i.e., excluding the imaginary axis). Stability thus means that there are constants λ , a > 0 such that ||exp(At)|| ≤ λ exp(−at), where ||•|| is a suitable matrix norm. If moreover

\frac{\lambda^2}{2a}\sum_{k=1}^m||N_k||^2\langle 1

then controllability and observability Gramians exist. This can be achieved by a suitable scaling u→ηu, N→N/η, B →B/η with real η>1 which leaves the equations of motion invariant but, clearly, not the Gramians. Hence, by increasing η, we drive the system to its linear counterpart. For the limit η→∞, the system matrix N vanishes and we obtain a linear system. For this reason, η should not be chosen too large. Note that also the shift of the spectrum of A may serve to render the generalized Lyapunov equations solvable.

Dimension reduction

The need for efficient numerical treatment of control problems leads to the problem of model order reduction, i.e. the approximation of large-scale systems by significantly smaller ones. While model reduction of linear systems has been studied for several years now, and a well established theory including error bounds and structure-preserving properties fulfilled by a reduced-order model exists, the situation is less favorable for non-linear systems (which occur, e.g. in control of quantum problems). There, the mathematical techniques developed to linear systems are more difficult and much less general. Here we will deal with following two classes of methods, representing generalizations of their linear counterparts:

References

  1. L. Zhang, J. Lam: Automatica 38, 205 (1988)

  2. Z. Bai, D. Skoogh: Lin. Alg. Appl. 415, 406 (2006) (dx.doi.org)

  3. E. Wachspress: Appl. Math. Lett. 1, 87 (1988) (dx.doi.org)

For a review of the theory of linear control systems, see e.g. the excellent lecture notes of Umea university and/or the books by Kemin Zhou.


MongoDB Logo MongoDB