control theory Control theory is a field of mathematics that deals with the control of dynamical systems in engineered processes and machines. The objective is to develop a model or algorithm governing the application of system inputs to drive the system to a ...

, a state observer or state estimator is a system that provides an estimate of the internal state of a given real system, from measurements of the input and output of the real system. It is typically computer-implemented, and provides the basis of many practical applications. Knowing the system state is necessary to solve many

problems; for example, stabilizing a system using state feedback. In most practical cases, the physical state of the system cannot be determined by direct observation. Instead, indirect effects of the internal state are observed by way of the system outputs. A simple example is that of vehicles in a tunnel: the rates and velocities at which vehicles enter and leave the tunnel can be observed directly, but the exact state inside the tunnel can only be estimated. If a system is

observable In physics, an observable is a physical quantity that can be measured. Examples include position and momentum. In systems governed by classical mechanics, it is a real-valued "function" on the set of all possible system states. In quantum phy ...

, it is possible to fully reconstruct the system state from its output measurements using the state observer.

Typical observer model

Linear, delayed, sliding mode, high gain, Tau, homogeneity-based, extended and cubic observers are among several observer structures used for state estimation of linear and nonlinear systems. A linear observer structure is described in the following sections.

Discrete-time case

The state of a linear, time-invariant physical discrete-time system is assumed to satisfy :

x(k+1) = A x(k) + B u(k)

y(k) = C x(k) + D u(k)

where, at time

k

x(k)

is the plant's state;

u(k)

is its inputs; and

y(k)

is its outputs. These equations simply say that the plant's current outputs and its future state are both determined solely by its current states and the current inputs. (Although these equations are expressed in terms of

discrete Discrete may refer to: *Discrete particle or quantum in physics, for example in quantum theory *Discrete device, an electronic component with just one circuit element, either passive or active, other than an integrated circuit *Discrete group, a g ...

time steps, very similar equations hold for

continuous Continuity or continuous may refer to: Mathematics * Continuity (mathematics), the opposing concept to discreteness; common examples include ** Continuous probability distribution or random variable in probability and statistics ** Continuous ...

systems). If this system is

then the output of the plant,

y(k)

, can be used to steer the state of the state observer. The observer model of the physical system is then typically derived from the above equations. Additional terms may be included in order to ensure that, on receiving successive measured values of the plant's inputs and outputs, the model's state converges to that of the plant. In particular, the output of the observer may be subtracted from the output of the plant and then multiplied by a matrix

L

; this is then added to the equations for the state of the observer to produce a so-called '' Luenberger observer'', defined by the equations below. Note that the variables of a state observer are commonly denoted by a "hat":

\hat(k)

and

\hat(k)

to distinguish them from the variables of the equations satisfied by the physical system. :

\hat(k+1) = A \hat(k) + L \left (k) - \hat(k)\right + B u(k)

\hat(k) = C \hat(k) + D u(k)

The observer is called asymptotically stable if the observer error

e(k) = \hat(k) - x(k)

converges to zero when

k \to \infty

. For a Luenberger observer, the observer error satisfies

e(k+1) = (A - LC) e(k)

. The Luenberger observer for this discrete-time system is therefore asymptotically stable when the matrix

A - LC

has all the eigenvalues inside the unit circle. For control purposes the output of the observer system is fed back to the input of both the observer and the plant through the gains matrix

K

. :

u(k)= -K \hat(k)

The observer equations then become: :

\hat(k+1) = A \hat(k) + L \left(y(k) - \hat(k)\right) - B K \hat(k)

\hat(k) = C \hat(k) - D K \hat(k)

or, more simply, :

\hat(k+1) = \left(A - B K \right) \hat(k) + L \left(y(k) - \hat(k)\right)

\hat(k) = \left(C - D K\right) \hat(k)

Due to the

separation principle In control theory, a separation principle, more formally known as a principle of separation of estimation and control, states that under some assumptions the problem of designing an optimal feedback controller for a stochastic system can be solved b ...

we know that we can choose

K

and

L

independently without harm to the overall stability of the systems. As a rule of thumb, the poles of the observer

A-LC

are usually chosen to converge 10 times faster than the poles of the system

A-BK

Continuous-time case

The previous example was for an observer implemented in a discrete-time LTI system. However, the process is similar for the continuous-time case; the observer gains

L

are chosen to make the continuous-time error dynamics converge to zero asymptotically (i.e., when

A-LC

is a

Hurwitz matrix In mathematics, a Hurwitz matrix, or Routh–Hurwitz matrix, in engineering stability matrix, is a structured real square matrix constructed with coefficients of a real polynomial. Hurwitz matrix and the Hurwitz stability criterion Namely, given a ...

). For a continuous-time linear system :

\dot = A x + B u,

y = C x + D u,

where

x \in \mathbb^n, u \in \mathbb^m ,y \in \mathbb^r

, the observer looks similar to discrete-time case described above: :

\dot = A \hat+ B u + L \left(y - \hat\right)

. :

\hat = C \hat + D u,

The observer error

e=x-\hat

satisfies the equation :

\dot = (A - LC) e

. The eigenvalues of the matrix

A-LC

can be chosen arbitrarily by appropriate choice of the observer gain

L

when the pair

,C /math> is observable, i.e.

observability Observability is a measure of how well internal states of a system can be inferred from knowledge of its external outputs. In control theory, the observability and controllability of a linear system are mathematical duals. The concept of observ ...

condition holds. In particular, it can be made Hurwitz, so the observer error

e(t) \to 0

when

t \to \infty

Peaking and other observer methods

When the observer gain

L

is high, the linear Luenberger observer converges to the system states very quickly. However, high observer gain leads to a peaking phenomenon in which initial estimator error can be prohibitively large (i.e., impractical or unsafe to use). As a consequence, nonlinear high-gain observer methods are available that converge quickly without the peaking phenomenon. For example,

sliding mode control In control systems, sliding mode control (SMC) is a nonlinear control method that alters the dynamics of a nonlinear system by applying a discontinuous control signal (or more rigorously, a set-valued control signal) that forces the system to ...

can be used to design an observer that brings one estimated state's error to zero in finite time even in the presence of measurement error; the other states have error that behaves similarly to the error in a Luenberger observer after peaking has subsided. Sliding mode observers also have attractive noise resilience properties that are similar to a

Kalman filter For statistics and control theory, Kalman filtering, also known as linear quadratic estimation (LQE), is an algorithm that uses a series of measurements observed over time, including statistical noise and other inaccuracies, and produces estima ...

. Another approach is to apply multi observer, that significantly improves transients and reduces observer overshoot. Multi-observer can be adapted to every system where high-gain observer is applicable.

State observers for nonlinear systems

High gain, sliding mode and extended observers are the most common observers for nonlinear systems. To illustrate the application of sliding mode observers for nonlinear systems, first consider the no-input non-linear system: :

\dot = f(x)

where

x \in \mathbb^n

. Also assume that there is a measurable output

y \in \mathbb

given by :

y = h(x).

There are several non-approximate approaches for designing an observer. The two observers given below also apply to the case when the system has an input. That is, :

\dot = f(x) + B(x) u

y = h(x).

Linearizable error dynamics

One suggestion by Krener and Isidori and Krener and Respondek can be applied in a situation when there exists a linearizing transformation (i.e., a

diffeomorphism In mathematics, a diffeomorphism is an isomorphism of smooth manifolds. It is an invertible function that maps one differentiable manifold to another such that both the function and its inverse are differentiable. Definition Given two ...

, like the one used in

feedback linearization Feedback linearization is a common strategy employed in nonlinear control to control nonlinear systems. Feedback linearization techniques may be applied to nonlinear control systems of the form where x(t) \in \mathbb^n is the state, u_1(t), ...

)

z=\Phi(x)

such that in new variables the system equations read :

\dot = A z+ \phi(y),

y = Cz.

The Luenberger observer is then designed as :

\dot = A \hat+ \phi(y) - L \left(C \hat-y \right)

. The observer error for the transformed variable

e=\hat-z

satisfies the same equation as in classical linear case. :

\dot = (A - LC) e

. As shown by Gauthier, Hammouri, and Othman and Hammouri and Kinnaert, if there exists transformation

z=\Phi(x)

such that the system can be transformed into the form :

\dot = A(u(t)) z+ \phi(y,u(t) ),

y = Cz,

then the observer is designed as :

\dot = A(u(t)) \hat+ \phi(y,u(t) ) - L(t) \left(C \hat-y \right)

, where

L(t)

is a time-varying observer gain. Ciccarella, Dalla Mora, and Germani obtained more advanced and general results, removing the need for a nonlinear transform and proving global asymptotic convergence of the estimated state to the true state using only simple assumptions on regularity.

Sliding mode observer

As discussed for the linear case above, the peaking phenomenon present in Luenberger observers justifies the use of a sliding mode observer. The sliding mode observer uses non-linear high-gain feedback to drive estimated states to a

hypersurface In geometry, a hypersurface is a generalization of the concepts of hyperplane, plane curve, and surface. A hypersurface is a manifold or an algebraic variety of dimension , which is embedded in an ambient space of dimension , generally a Euclidea ...

where there is no difference between the estimated output and the measured output. The non-linear gain used in the observer is typically implemented with a scaled switching function, like the signum (i.e., sgn) of the estimated – measured output error. Hence, due to this high-gain feedback, the vector field of the observer has a crease in it so that observer trajectories ''slide along'' a curve where the estimated output matches the measured output exactly. So, if the system is

from its output, the observer states will all be driven to the actual system states. Additionally, by using the sign of the error to drive the sliding mode observer, the observer trajectories become insensitive to many forms of noise. Hence, some sliding mode observers have attractive properties similar to the

but with simpler implementation. As suggested by Drakunov, a sliding mode observer can also be designed for a class of non-linear systems. Such an observer can be written in terms of original variable estimate

\hat

and has the form :

M(\hat) \sgn( V(t) - H(\hat) )

where: * The

\sgn(\mathord)

vector extends the scalar

signum function In mathematics, the sign function or signum function (from '' signum'', Latin for "sign") is an odd mathematical function that extracts the sign of a real number. In mathematical expressions the sign function is often represented as . To avo ...

n

dimensions. That is, *::

\sgn(z) = \begin
\sgn(z_1)\\
\sgn(z_2)\\
\vdots\\
\sgn(z_i)\\
\vdots\\
\sgn(z_n)
\end

*: for the vector

z \in \mathbb^n

. * The vector

H(x)

has components that are the output function

h(x)

and its repeated Lie derivatives. In particular, *::

H(x) \triangleq
\begin
h_1(x)\\
h_2(x)\\
h_3(x)\\
\vdots\\
h_n(x)
\end
\triangleq
\begin
h(x)\\
L_h(x)\\
L_^2 h(x)\\
\vdots\\
L_^h(x)
\end

*: where

L^i_f h

is the ''i''^th

Lie derivative In differential geometry, the Lie derivative ( ), named after Sophus Lie by Władysław Ślebodziński, evaluates the change of a tensor field (including scalar functions, vector fields and one-forms), along the flow defined by another vector fi ...

of output function

h

along the vector field

f

(i.e., along

x

trajectories of the non-linear system). In the special case where the system has no input or has a relative degree of ''n'',

H(x(t))

is a collection of the output

y(t)=h(x(t))

and its

n-1

derivatives. Because the inverse of the Jacobian linearization of

H(x)

must exist for this observer to be well defined, the transformation

H(x)

is guaranteed to be a local

. * The

diagonal matrix In linear algebra, a diagonal matrix is a matrix in which the entries outside the main diagonal are all zero; the term usually refers to square matrices. Elements of the main diagonal can either be zero or nonzero. An example of a 2×2 diagonal m ...

M(\hat)

of gains is such that *::

M(\hat) \triangleq
\operatorname( m_1(\hat), m_2(\hat), \ldots, m_n(\hat) )
=
\begin
m_1(\hat) & & & & & \\
& m_2(\hat) & & & & \\
& & \ddots & & & \\
& & & m_i(\hat) & &\\
& & & & \ddots &\\
& & & & & m_n(\hat)
\end

*: where, for each

i \in \

, element

m_i(\hat) > 0

and suitably large to ensure reachability of the sliding mode. * The observer vector

V(t)

is such that *::

V(t)
\triangleq
\beginv_(t)\\
v_2(t)\\
v_3(t)\\
\vdots\\
v_i(t)\\
\vdots\\
v_(t)
\end
\triangleq
\begin
y(t)\\
\_\\
\_\\
\vdots\\
\_\\
\vdots\\
\_
\end

*: where

\sgn(\mathord)

here is the normal

defined for scalars, and

\_

denotes an "equivalent value operator" of a discontinuous function in sliding mode. The idea can be briefly explained as follows. According to the theory of sliding modes, in order to describe the system behavior, once sliding mode starts, the function

\sgn( v_(t)\!-\! h_(\hat(t)) )

should be replaced by equivalent values (see ''equivalent control'' in the theory of sliding modes). In practice, it switches (chatters) with high frequency with slow component being equal to the equivalent value. Applying appropriate lowpass filter to get rid of the high frequency component on can obtain the value of the equivalent control, which contains more information about the state of the estimated system. The observer described above uses this method several times to obtain the state of the nonlinear system ideally in finite time. The modified observation error can be written in the transformed states

e=H(x)-H(\hat)

. In particular, :

\begin
\dot
&=
\frac H(x)
-
\frac H(\hat)\\
&=
\frac H(x)
-
M(\hat) \, \sgn( V(t) - H(\hat(t)) ),
\end

and so :

\begin
\begin
\dot_1\\
\dot_2\\
\vdots\\
\dot_i\\
\vdots\\
\dot_\\
\dot_n
\end
&=
\mathord
-
\mathord
=
\begin
h_2(x)\\
h_3(x)\\
\vdots\\
h_(x)\\
\vdots\\
h_n(x)\\
L_f^n h(x)
\end
-
\begin
m_1 \sgn( v_1(t) - h_1(\hat(t)) )\\
m_2 \sgn( v_2(t) - h_2(\hat(t)) )\\
\vdots\\
m_i \sgn( v_i(t) - h_i(\hat(t)) )\\
\vdots\\
m_ \sgn( v_(t) - h_(\hat(t)) )\\
m_n \sgn( v_n(t) - h_n(\hat(t)) )
\end\\
&=
\begin
h_2(x) - m_1(\hat) \sgn( \mathord )\\
h_3(x) - m_2(\hat) \sgn( v_2(t) - h_2(\hat(t)) )\\
\vdots\\
h_(x) - m_i(\hat) \sgn( v_i(t) - h_i(\hat(t)) )\\
\vdots\\
h_n(x) - m_(\hat) \sgn( v_(t) - h_(\hat(t)) )\\
L_f^n h(x) - m_n(\hat) \sgn( v_n(t) - h_n(\hat(t)) )
\end.
\end

So: # As long as

m_1(\hat) \geq , h_2(x(t)),

, the first row of the error dynamics,

\dot_1 = h_2(\hat) - m_1(\hat) \sgn( e_1 )

, will meet sufficient conditions to enter the

e_1 = 0

sliding mode in finite time. # Along the

e_1 = 0

surface, the corresponding

v_2(t) = \_

equivalent control will be equal to

h_2(x)

, and so

v_2(t) - h_2(\hat) = h_2(x) - h_2(\hat) = e_2

. Hence, so long as

m_2(\hat) \geq , h_3(x(t)),

, the second row of the error dynamics,

\dot_2 = h_3(\hat) - m_2(\hat) \sgn( e_2 )

, will enter the

e_2 = 0

sliding mode in finite time. # Along the

e_i = 0

surface, the corresponding

v_(t) = \_

equivalent control will be equal to

h_(x)

. Hence, so long as

m_(\hat) \geq , h_(x(t)),

, the

(i+1)

^th row of the error dynamics,

\dot_ = h_(\hat) - m_(\hat) \sgn( e_ )

, will enter the

e_ = 0

sliding mode in finite time. So, for sufficiently large

m_i

gains, all observer estimated states reach the actual states in finite time. In fact, increasing

m_i

allows for convergence in any desired finite time so long as each

, h_i(x(0)),

function can be bounded with certainty. Hence, the requirement that the map

H:\mathbb^n \to \mathbb^n

is a

(i.e., that its Jacobian linearization is invertible) asserts that convergence of the estimated output implies convergence of the estimated state. That is, the requirement is an observability condition. In the case of the sliding mode observer for the system with the input, additional conditions are needed for the observation error to be independent of the input. For example, that :

\frac B(x)

does not depend on time. The observer is then :

M(\hat) \sgn(V(t) - H(\hat))+B(\hat)u.

Multi-observer

Multi-observer extends the high-gain observer structure from single to multi observer, with many models working simultaneously. This has two layers: the first consists of multiple high-gain observers with different estimation states, and the second determines the importance weights of the first layer observers. The algorithm is simple to implement and does not contain any risky operations like differentiation. The idea of multiple models was previously applied to obtain information in adaptive control. Multi observer.png, Multi-observer schema Assuming that the number of high-gain observers equals

n+1

, :

\dot_k(t) = A \hat(t)+ B \phi_0(\hat(t), u(t)) - L (\hat(t)-y(t))

\hat(t) = C \hat(t)

where

k = 1, \dots, n + 1

is the observer index. The first layer observers consists of the same gain

L

but they differ with the initial state

x_k(0)

. In the second layer all

x_k(t)

from

k = 1...n + 1

observers are combined into one to obtain single state vector estimation :

\hat(t) = \sum\limits_^ \alpha_k(t) \hat(t)

where

\alpha_k \in \mathbb

are weight factors. These factors are changed to provide the estimation in the second layer and to improve the observation process. Let assume that :

\sum\limits_^ \alpha_k(t) \xi_k(t) = 0

and :

\sum\limits_^ \alpha_k(t) = 1

where

\xi_k \in \mathbb^

is some vector that depends on

kth

observer error

e_k(t)

. Some transformation yields to linear regression problem :

T \begin \alpha_1(t)\\ \vdots \\ \alpha_k(t)\\ \vdots\\ \alpha_n(t) \end

This formula gives possibility to estimate

\alpha_k (t)

. To construct manifold we need mapping

m: \mathbb^ \to \mathbb^

between

\xi_k (t) = m(e_k(t))

and ensurance that

\xi_k (t)

is calculable relying on measurable signals. First thing is to eliminate parking phenomenon for

\alpha_k(t)

from observer error :

e_(t) =  \sum\limits_^ \alpha_k(t) e_k(t)

. Calculate

n

times derivative on

\eta_k(t)=\hat y_k (t) - y(t)

to find mapping m lead to

\xi_k(t)

defined as :

\xi_k (t) = \begin 
1 & 0 & 0 & \cdots & 0 \\ 
CL & 1 & 0 & \cdots & 0 \\
CAL & CL & 1 & \cdots & 0 \\
CA^L & CAL & CL & \cdots & 0 \\
\vdots & \vdots & \vdots & \ddots \\ 
CA^L & CA^L & CA^L & \cdots & 1
\end 
\begin 
\int\limits^t_  \int\limits^t_ \eta_k(\tau) d\tau\\ 
\vdots \\
\eta(t) - \eta(t-(n-1)t_d)
\end

where

t_d > 0

is some time constant. Note that

\xi_k(t)

relays on both

\eta_k(t)

and its integrals hence it is easily available in the control system. Further

\alpha_k(t)

is specified by estimation law; and thus it proves that manifold is measurable. In the second layer

\hat\alpha_k(t)

for

k = 1 \dots n + 1

is introduced as estimates of

\alpha_k(t)

coefficients. The mapping error is specified as :

e_\xi(t) = \sum\limits_^ \hat\alpha_k(t) \xi_k(t)

where

e_\xi(t) \in \mathbb^, \hat\alpha_k(t) \in \mathbb

. If coefficients

\hat\alpha(t)

are equal to

\alpha_k(t)

, then mapping error

e_\xi(t) = 0

Now it is possible to calculate

\hat x

from above equation and hence the peaking phenomenon is reduced thanks to properties of manifold. The created mapping gives a lot of flexibility in the estimation process. Even it is possible to estimate the value of

x(t)

in the second layer and to calculate the state

x

Bounding observers

Bounding or interval observers constitute a class of observers that provide two estimations of the state simultaneously: one of the estimations provides an upper bound on the real value of the state, whereas the second one provides a lower bound. The real value of the state is then known to be always within these two estimations. These bounds are very important in practical applications, as they make possible to know at each time the precision of the estimation. Mathematically, two Luenberger observers can be used, if

L

is properly selected, using, for example,

positive systems Positive systemsT. Kaczorek. Positive 1D and 2D Systems. Springer- Verlag, 2002 constitute a class of systems that has the important property that its state variables are never negative, given a positive initial state. These systems appear frequentl ...

properties: one for the upper bound

\hat_U(k)

(that ensures that

e(k) = \hat_U(k) - x(k)

converges to zero from above when

k \to \infty

, in the absence of noise and

uncertainty Uncertainty refers to epistemic situations involving imperfect or unknown information. It applies to predictions of future events, to physical measurements that are already made, or to the unknown. Uncertainty arises in partially observable ...

), and a lower bound

\hat_L(k)

(that ensures that

e(k) = \hat_L(k) - x(k)

converges to zero from below). That is, always

\hat_U(k) \ge x(k) \ge \hat_L(k)

References

; In-line references ; General references * {{refend

External links

Kalman Filter Explained Simply
Step-by-Step Tutorial of the Kalman Filter with Equations Classical control theory Time series

Typical observer model

Discrete-time case

Continuous-time case

Peaking and other observer methods

State observers for nonlinear systems

Linearizable error dynamics

Sliding mode observer

Multi-observer

Bounding observers

See also

References

External links