Optimal Projection Equations
   HOME

TheInfoList



OR:

In
control theory Control theory is a field of mathematics that deals with the control of dynamical systems in engineered processes and machines. The objective is to develop a model or algorithm governing the application of system inputs to drive the system to a ...
, optimal projection equations constitute
necessary and sufficient conditions In logic and mathematics, necessity and sufficiency are terms used to describe a conditional or implicational relationship between two statements. For example, in the conditional statement: "If then ", is necessary for , because the truth of ...
for a locally optimal reduced-order LQG controller. The linear-quadratic-Gaussian (LQG) control problem is one of the most fundamental
optimal control Optimal control theory is a branch of mathematical optimization that deals with finding a control for a dynamical system over a period of time such that an objective function is optimized. It has numerous applications in science, engineering and ...
problems. It concerns uncertain
linear system In systems theory, a linear system is a mathematical model of a system based on the use of a linear operator. Linear systems typically exhibit features and properties that are much simpler than the nonlinear case. As a mathematical abstraction o ...
s disturbed by
additive white Gaussian noise Additive white Gaussian noise (AWGN) is a basic noise model used in information theory to mimic the effect of many random processes that occur in nature. The modifiers denote specific characteristics: * ''Additive'' because it is added to any nois ...
, incomplete state information (i.e. not all the state variables are measured and available for feedback) also disturbed by additive white Gaussian noise and quadratic
cost In production, research, retail, and accounting, a cost is the value of money that has been used up to produce something or deliver a service, and hence is not available for use anymore. In business, the cost may be one of acquisition, in which ...
s. Moreover, the solution is unique and constitutes a linear dynamic feedback control law that is easily computed and implemented. Finally the LQG controller is also fundamental to the optimal perturbation control of non-linear systems. The LQG controller itself is a dynamic system like the system it controls. Both systems have the same state dimension. Therefore, implementing the LQG controller may be problematic if the dimension of the system state is large. The reduced-order LQG problem (fixed-order LQG problem) overcomes this by fixing a-priori the number of states of the LQG controller. This problem is more difficult to solve because it is no longer separable. Also the solution is no longer unique. Despite these facts numerical algorithms are available Associated software download from Matlab Central
.
Associated software download from Matlab Central
.
to solve the associated optimal projection equations.


Mathematical problem formulation and solution


Continuous-time

The reduced-order LQG control problem is almost identical to the conventional full-order LQG control problem. Let \hat_r(t) represent the state of the reduced-order LQG controller. Then the only difference is that the state dimension n_r=dim(\hat_r(t)) of the LQG controller is a-priori fixed to be smaller than n=dim((t)) , the state dimension of the controlled system. The reduced-order LQG controller is represented by the following equations: : \dot_r(t) = A_r(t)\hat_r(t) + B_r(t)(t)+K_r(t) \left( (t)-C_r(t)\hat_r(t) \right),\hat_r(0)=_r(0), : (t)= -L_r(t) \hat_r(t). These equations are deliberately stated in a format that equals that of the conventional full-order LQG controller. For the reduced-order LQG control problem it is convenient to rewrite them as : \dot_r(t) = F_r(t)\hat_r(t) + K_r(t) (t),\hat_r(0)=_r(0), : (t)= -L_r(t) \hat_r(t), where : F_r(t)=A_r(t)-B_r(t)L_r(t)-K_r(t)C_r(t). The matrices F_r(t), K_r(t), L_r(t) and _r(0) of the reduced-order LQG controller are determined by the so-called optimal projection equations (OPE). The square optimal projection matrix \tau(t) with dimension n is central to the OPE. The rank of this matrix is almost everywhere equal to n_r. The associated projection is an oblique projection: \tau^2(t)=\tau(t). The OPE constitute four matrix differential equations. The first two equations listed below are generalizations of the matrix Riccati differential equations associated to the conventional full-order LQG controller. In these equations \tau_\perp(t) denotes I_n-\tau(t) where I_n is the identity matrix of dimension n. : \begin \dot(t) = & A(t)P(t)+P(t)A'(t)-P(t)C'(t)W^(t) C(t)P(t)+V(t) \\ pt& +\tau_\perp (t)P(t)C'(t)W^(t) C(t)P(t)\tau'_\perp (t), \\ ptP(0)= & E \left((0)'(0) \right), \\ pt& -\dot(t) = A'(t)S(t)+S(t)A(t)-S(t)B(t)R^(t)B'(t)S(t)+Q(t) \\ pt& + \tau'_\perp (t)S(t)B(t)R^(t)B'(t)S(t) \tau_\perp (t), \end : S(T) = F. If the dimension of the LQG controller is not reduced, that is if n=n_r, then \tau(t)=I_n, \tau_\perp(t)=0 and the two equations above become the uncoupled matrix Riccati differential equations associated to the conventional full-order LQG controller. If n_r the two equations are coupled by the oblique projection \tau(t). This reveals why the reduced-order LQG problem is not separable. The oblique projection \tau(t) is determined from two additional matrix differential equations which involve rank conditions. Together with the previous two matrix differential equations these are the OPE. To state the additional two matrix differential equations it is convenient to introduce the following two matrices: : \Psi_1(t)=(A(t)-B(t)R^(t)B'(t)S(t))\hat(t)+\hat(t) (A(t)-B(t)R^(t)B'(t)S(t))' :::: +P(t)C'(t)W^(t)C(t)P(t), : \Psi_2(t)=(A(t)-P(t)C'(t)W^(t) C(t))'\hat(t)+\hat(t)(A(t)-P(t)C'(t)W^(t)C(t)) :::: +S(t)B(t)R^(t)B'(t)S(t). Then the two additional matrix differential equations that complete the OPE are as follows: : \dot(t)=1/2 \left(\tau(t)\Psi_1(t)+\Psi_1(t)\tau'(t) \right),\hat(0)=E((0))E((0))', \operatorname(\hat(t))=n_r almost everywhere, : -\dot(t)=1/2 \left(\tau'(t)\Psi_2(t)+\Psi_2(t)\tau(t) \right),\hat(T)=0, \operatorname(\hat(t))=n_r almost everywhere, with : \tau(t)= \hat(t) \hat(t) \left( \hat(t) \hat(t) \right)^*. Here * denotes the group generalized inverse or
Drazin inverse In mathematics, the Drazin inverse, named after Michael P. Drazin, is a kind of generalized inverse of a matrix. Let ''A'' be a square matrix. The index of ''A'' is the least nonnegative integer ''k'' such that rank(''A'k''+1) = rank(''A'k'') ...
that is unique and given by : A^*=A(A^3)^+A. where + denotes the Moore–Penrose pseudoinverse. The matrices P(t),S(t),\hat(t),\hat(t) must all be nonnegative symmetric. Then they constitute a solution of the OPE that determines the reduced-order LQG controller matrices F_r(t), K_r(t), L_r(t) and _r(0): : F_r(t)=H(t)\left( A(t)-P(t)C'(t)W^(t) C(t)-B(t)R^(t)B'(t)S(t) \right)G(t)+\dot(t)G'(t), : K_r(t)=H(t)P(t)C'(t)W^(t), : L_r(t)=R^(t)B'(t)S(t)G'(t), : _r(0)=H(0)E((0)). In the equations above the matrices G(t),H(t) are two matrices with the following properties: : G'(t)H(t)=\tau(t),G(t)H'(t)=I_ almost everywhere. They can be obtained from a projective factorization of \hat(t)\hat(t). The OPE can be stated in many different ways that are all equivalent. To identify the equivalent representations the following identities are especially useful: : \tau(t)\hat(t)=\hat(t)\tau'(t)=\hat(t), \tau'(t)\hat(t)=\hat(t)\tau(t)=\hat(t) Using these identities one may for instance rewrite the first two of the optimal projection equations as follows: : \dot(t) = A(t)P(t)+P(t)A'(t)-P(t)C'(t)W^(t)C(t)P(t)+V(t)+\tau_\perp(t)\Psi_1(t)\tau'_\perp (t), : P(0)= E \left((0)'(0) \right), : -\dot(t) = A'(t)S(t)+S(t)A(t)-S(t)B(t)R^(t)B'(t)S(t)+Q(t)+\tau'_\perp\Psi_2(t)\tau_\perp(t), : S(T) = F. This representation is both relatively simple and suitable for numerical computations. If all the matrices in the reduced-order LQG problem formulation are time-invariant and if the horizon T tends to infinity, the optimal reduced-order LQG controller becomes time-invariant and so do the OPE. In that case the derivatives on the left hand side of the OPE are zero.


Discrete-time

Similar to the continuous-time case, in the discrete-time case the difference with the conventional discrete-time full-order LQG problem is the a-priori fixed reduced-order n_r of the LQG controller state dimension. As in continuous-time, to state the discrete-time OPE it is convenient to introduce the following two matrices: : \Psi^1_i=\left(A_i-B_i(B'_iS_B_i+R_i)^B'_iS_A_i)\right)\hat_i \left(A_i-B_i(B'_iS_B_i+R_i)^B'_iS_A_i)\right)' :::: +A_iP_iC'_i(C_iP_iC'_i+W_i)^C_iP_iA'_i : \Psi^2_=\left(A_i-A_iP_iC'_i(C_iP_C'_i+W_i)^C_i\right)'\hat_ \left(A_i-A_iP_iC'_i(C_iP_C'_i+W_i)^C_i\right) :::: +A'_iS_B_i(B'_iS_B_i+R_i)^B'_iS_A_i Then the discrete-time OPE is : P_ = A_i \left( P_i - P_i C'_i \left( C_i P_i C'_i+W_i \right)^ C_i P_i \right) A'_i+V_i+\tau_\Psi^1_i \tau'_, P_0=E \left( _0_0 \right). : S_i = A'_i \left( S_ - S_B_i \left( B'_iS_B_i+R_i \right)^ B'_i S_ \right) A_i+Q_i+\tau'_\Psi^2_ \tau_, S_N=F. : \hat_=1/2(\tau_\Psi_i^1+\Psi_i^1\tau'_),\hat_0=E((0)) E((0))', \operatorname(\hat_i)=n_r almost everywhere, : \hat_=1/2(\tau'_i \Psi_^2+\Psi_^2\tau_i),\hat_N=0, \operatorname(\hat_i)=n_r almost everywhere. The oblique projection matrix is given by : \tau_i=\hat_i\hat_i \left(\hat_i\hat_i \right)^*. The nonnegative symmetric matrices P_i,S_i,\hat_i,\hat_i that solve the discrete-time OPE determine the reduced-order LQG controller matrices F_i^r, K_i^r, L_i^r and _0^r: : F_i^r=H_\left( A_i-P_i C'_i \left( C_i P_i C'_i+W_i \right)^C_i-B_i\left( B'_iS_B_i+R_i \right)^ B'_i S_\right)G'_i, : K_i^r=H_P_i C'_i \left( C_i P_i C'_i+W_i \right)^, : L_i^r=\left( B'_iS_B_i+R_i \right)^ B'_i S_G'_i, : _0^r=H_0E(_0). In the equations above the matrices G_i,H_i are two matrices with the following properties: : G'_iH_i=\tau_i, G_iH'_i=I_ almost everywhere. They can be obtained from a projective factorization of \hat_i\hat_i. To identify equivalent representations of the discrete-time OPE the following identities are especially useful: : \tau_i\hat_i=\hat_i\tau'_i=\hat_i, \tau'_i\hat_i=\hat_i\tau_i=\hat{S}_i As in the continuous-time case if all the matrices in the problem formulation are time-invariant and if the horizon N tends to infinity the reduced-order LQG controller becomes time-invariant. Then the discrete-time OPE converge to a steady state solution that determines the time-invariant reduced-order LQG controller. The discrete-time OPE apply also to discrete-time systems with variable state, input and output dimensions (discrete-time systems with time-varying dimensions). Such systems arise in the case of digital controller design if the sampling occurs asynchronously.


References

Optimal control Control theory Stochastic control