The field of system identification uses
statistical method
Statistics (from German: ', "description of a state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a scientific, industrial, or social ...
s to build
mathematical model
A mathematical model is an abstract and concrete, abstract description of a concrete system using mathematics, mathematical concepts and language of mathematics, language. The process of developing a mathematical model is termed ''mathematical m ...
s of
dynamical system
In mathematics, a dynamical system is a system in which a Function (mathematics), function describes the time dependence of a Point (geometry), point in an ambient space, such as in a parametric curve. Examples include the mathematical models ...
s from measured data. System identification also includes the
optimal design of experiments
The design of experiments (DOE), also known as experiment design or experimental design, is the design of any task that aims to describe and explain the variation of information under conditions that are hypothesized to reflect the variation. ...
for efficiently generating informative data for
fitting such models as well as model reduction. A common approach is to start from measurements of the behavior of the system and the external influences (inputs to the system) and try to determine a mathematical relation between them without going into many details of what is actually happening inside the system; this approach is called
black box
In science, computing, and engineering, a black box is a system which can be viewed in terms of its inputs and outputs (or transfer characteristics), without any knowledge of its internal workings. Its implementation is "opaque" (black). The te ...
system identification.
Overview
A dynamic mathematical model in this context is a mathematical description of the dynamic behavior of a
system
A system is a group of interacting or interrelated elements that act according to a set of rules to form a unified whole. A system, surrounded and influenced by its open system (systems theory), environment, is described by its boundaries, str ...
or process in either the time or frequency domain. Examples include:
*
physical processes such as the movement of a falling body under the influence of
gravity
In physics, gravity (), also known as gravitation or a gravitational interaction, is a fundamental interaction, a mutual attraction between all massive particles. On Earth, gravity takes a slightly different meaning: the observed force b ...
;
*
economic
An economy is an area of the Production (economics), production, Distribution (economics), distribution and trade, as well as Consumption (economics), consumption of Goods (economics), goods and Service (economics), services. In general, it is ...
processes such as international
trade
Trade involves the transfer of goods and services from one person or entity to another, often in exchange for money. Economists refer to a system or network that allows trade as a market.
Traders generally negotiate through a medium of cr ...
markets that react to external influences.
One of the many possible applications of system identification is in
control systems
A control system manages, commands, directs, or regulates the behavior of other devices or systems using control loops. It can range from a single home heating controller using a thermostat controlling a domestic boiler to large industrial co ...
. For example, it is the basis for modern
data-driven control system
Data-driven control systems are a broad family of Control theory, control systems, in which the System identification, identification of the process model and/or the design of the controller are based entirely on ''experimental data'' collected fro ...
s, in which concepts of system identification are integrated into the controller design, and lay the foundations for formal controller optimality proofs.
Input-output vs output-only
System identification techniques can utilize both input and output data (e.g.
eigensystem realization algorithm) or can include only the output data (e.g.
frequency domain decomposition). Typically an input-output technique would be more accurate, but the input data is not always available. In addition, the final estimated responses from arbitrary inputs can be analyzed by investigating their correlation and spectral properties.
Optimal design of experiments
The quality of system identification depends on the quality of the inputs, which are under the control of the systems engineer. Therefore, systems engineers have long used the principles of the
design of experiments
The design of experiments (DOE), also known as experiment design or experimental design, is the design of any task that aims to describe and explain the variation of information under conditions that are hypothesized to reflect the variation. ...
. In recent decades, engineers have increasingly used the theory of
optimal experimental design to specify inputs that yield
maximally precise estimator
In statistics, an estimator is a rule for calculating an estimate of a given quantity based on Sample (statistics), observed data: thus the rule (the estimator), the quantity of interest (the estimand) and its result (the estimate) are distinguish ...
s.
White-, grey-, and black-box

One could build a
white-box model based on
first principles
In philosophy and science, a first principle is a basic proposition or assumption that cannot be deduced from any other proposition or assumption. First principles in philosophy are from first cause attitudes and taught by Aristotelians, and nuan ...
, e.g. a model for a physical process from the
Newton equations, but in many cases, such models will be overly complex and possibly even impossible to obtain in reasonable time due to the complex nature of many systems and processes.
A more common approach is therefore to start from measurements of the behavior of the system and the external influences (inputs to the system) and try to determine a mathematical relation between them without going into the details of what is actually happening inside the system. This approach is called system identification. Two types of models are common in the field of system identification:
* grey box model: although the peculiarities of what is going on inside the system are not entirely known, a certain model based on both insight into the system and experimental data is constructed. This model does however still have a number of unknown free
parameter
A parameter (), generally, is any characteristic that can help in defining or classifying a particular system (meaning an event, project, object, situation, etc.). That is, a parameter is an element of a system that is useful, or critical, when ...
s which can be estimated using system identification.
One example uses the
Monod saturation model for microbial growth. The model contains a simple hyperbolic relationship between substrate concentration and growth rate, but this can be justified by molecules binding to a substrate without going into detail on the types of molecules or types of binding. Grey box modeling is also known as semi-physical modeling.
*
black box
In science, computing, and engineering, a black box is a system which can be viewed in terms of its inputs and outputs (or transfer characteristics), without any knowledge of its internal workings. Its implementation is "opaque" (black). The te ...
model: No prior model is available. Most system identification algorithms are of this type.
In the context of
nonlinear system identification System identification is a method of identifying or measuring the mathematical model of a system from measurements of the system inputs and outputs. The applications of system identification include any system where the inputs and outputs can be mea ...
Jin et al. describe grey-box modeling by assuming a model structure a priori and then estimating the model parameters. Parameter estimation is relatively easy if the model form is known but this is rarely the case. Alternatively, the structure or model terms for both linear and highly complex nonlinear models can be identified using
NARMAX methods. This approach is completely flexible and can be used with grey box models where the algorithms are primed with the known terms, or with completely black-box models where the model terms are selected as part of the identification procedure. Another advantage of this approach is that the algorithms will just select linear terms if the system under study is linear, and nonlinear terms if the system is nonlinear, which allows a great deal of flexibility in the identification.
Identification for control
In
control systems
A control system manages, commands, directs, or regulates the behavior of other devices or systems using control loops. It can range from a single home heating controller using a thermostat controlling a domestic boiler to large industrial co ...
applications, the objective of engineers is to obtain a
good performance of the
closed-loop system, which is the one comprising the physical system, the feedback loop and the controller. This performance is typically achieved by designing the control law relying on a model of the system, which needs to be identified starting from experimental data. If the model identification procedure is aimed at control purposes, what really matters is not to obtain the best possible model that fits the data, as in the classical system identification approach, but to obtain a model satisfying enough for the closed-loop performance. This more recent approach is called identification for control, or I4C in short.
The idea behind I4C can be better understood by considering the following simple example. Consider a system with ''true''
transfer function
In engineering, a transfer function (also known as system function or network function) of a system, sub-system, or component is a function (mathematics), mathematical function that mathematical model, models the system's output for each possible ...
:
:
and an identified model
:
:
From a classical system identification perspective,
is ''not'', in general, a ''good'' model for
. In fact, modulus and phase of
are different from those of
at low frequency. What is more, while
is an
asymptotically stable system,
is a simply stable system. However,
may still be a model good enough for control purposes. In fact, if one wants to apply a
purely proportional negative feedback
Negative feedback (or balancing feedback) occurs when some function (Mathematics), function of the output of a system, process, or mechanism is feedback, fed back in a manner that tends to reduce the fluctuations in the output, whether caused ...
controller with high gain
, the closed-loop transfer function from the reference to the output is, for
:
and for
:
Since
is very large, one has that
. Thus, the two closed-loop transfer functions are indistinguishable. In conclusion,
is a ''perfectly acceptable'' identified model for the ''true'' system if such feedback control law has to be applied. Whether or not a model is ''appropriate'' for control design depends not only on the plant/model mismatch but also on the controller that will be implemented. As such, in the I4C framework, given a control performance objective, the control engineer has to design the identification phase in such a way that the performance achieved by the model-based controller on the ''true'' system is as high as possible.
Sometimes, it is even more convenient to design a controller without explicitly identifying a model of the system, but directly working on experimental data. This is the case of ''direct''
data-driven control system
Data-driven control systems are a broad family of Control theory, control systems, in which the System identification, identification of the process model and/or the design of the controller are based entirely on ''experimental data'' collected fro ...
s.
Forward model
A common understanding in Artificial Intelligence is that the
controller has to generate the next move for a
robot
A robot is a machine—especially one Computer program, programmable by a computer—capable of carrying out a complex series of actions Automation, automatically. A robot can be guided by an external control device, or the robot control, co ...
. For example, the robot starts in the maze and then the robot decides to move forward. Model predictive control determines the next action indirectly. The term
"model" is referencing to a forward model which doesn't provide the correct action but simulates a scenario. A forward model is equal to a
physics engine
A physics engine is computer software that provides an approximate simulation of certain physical systems, typically classical dynamics, including rigid body dynamics (including collision detection), soft body dynamics, and fluid dynamics. I ...
used in game programming. The model takes an input and calculates the future state of the system.
The reason why dedicated forward models are constructed is because it allows one to divide the overall control process. The first question is how to predict the future states of the system. That means, to simulate a
plant
Plants are the eukaryotes that form the Kingdom (biology), kingdom Plantae; they are predominantly Photosynthesis, photosynthetic. This means that they obtain their energy from sunlight, using chloroplasts derived from endosymbiosis with c ...
over a timespan for different input values. And the second task is to search for a
sequence
In mathematics, a sequence is an enumerated collection of objects in which repetitions are allowed and order matters. Like a set, it contains members (also called ''elements'', or ''terms''). The number of elements (possibly infinite) is cal ...
of input values which brings the plant into a goal state. This is called predictive control.
The forward model is the most important aspect of a
MPC-controller. It has to be created before the
solver
A solver is a piece of mathematical software, possibly in the form of a stand-alone computer program or as a Library (computing), software library, that 'solves' a mathematical problem. A solver takes problem descriptions in some sort of generic ...
can be realized. If it's unclear what the behavior of a system is, it's not possible to search for meaningful actions. The workflow for creating a forward model is called system identification. The idea is to
formalize a system in a set of equations which will behave like the original system. The error between the real system and the forward model can be measured.
There are many techniques available to create a forward model:
ordinary differential equation
In mathematics, an ordinary differential equation (ODE) is a differential equation (DE) dependent on only a single independent variable (mathematics), variable. As with any other DE, its unknown(s) consists of one (or more) Function (mathematic ...
s is the classical one which is used in
physics engine
A physics engine is computer software that provides an approximate simulation of certain physical systems, typically classical dynamics, including rigid body dynamics (including collision detection), soft body dynamics, and fluid dynamics. I ...
s like
Box2D. A more recent technique is a
neural network
A neural network is a group of interconnected units called neurons that send signals to one another. Neurons can be either biological cells or signal pathways. While individual neurons are simple, many of them together in a network can perfor ...
for creating the forward model.
See also
*
Black box model of power converter
*
Black box
In science, computing, and engineering, a black box is a system which can be viewed in terms of its inputs and outputs (or transfer characteristics), without any knowledge of its internal workings. Its implementation is "opaque" (black). The te ...
*
Data-driven control system
Data-driven control systems are a broad family of Control theory, control systems, in which the System identification, identification of the process model and/or the design of the controller are based entirely on ''experimental data'' collected fro ...
*
Generalized filtering
*
Grey box completion and validation
*
Hysteresis
Hysteresis is the dependence of the state of a system on its history. For example, a magnet may have more than one possible magnetic moment in a given magnetic field, depending on how the field changed in the past. Plots of a single component of ...
*
Linear time-invariant system theory
*
Model order reduction
*
Model selection
Model selection is the task of selecting a model from among various candidates on the basis of performance criterion to choose the best one.
In the context of machine learning and more generally statistical analysis, this may be the selection of ...
*
Nonlinear autoregressive exogenous model
*
Open system (systems theory)
An open system is a system that has external interactions. Such interactions can take the form of information, energy, or material transfers into or out of the system boundary, depending on the discipline which defines the concept. An open syste ...
*
Parameter estimation
*
Pattern recognition
Pattern recognition is the task of assigning a class to an observation based on patterns extracted from data. While similar, pattern recognition (PR) is not to be confused with pattern machines (PM) which may possess PR capabilities but their p ...
*
Structural identifiability
In the area of system identification, a dynamical system is structurally identifiable if it is possible to infer its unknown parameters by measuring its output over time. This problem arises in many branch of applied mathematics, since dynamical sy ...
*
System dynamics
*
System realization
*
Systems theory
Systems theory is the Transdisciplinarity, transdisciplinary study of systems, i.e. cohesive groups of interrelated, interdependent components that can be natural or artificial. Every system has causal boundaries, is influenced by its context, de ...
References
Further reading
*
* Daniel Graupe: ''Identification of Systems'', Van Nostrand Reinhold, New York, 1972 (2nd ed., Krieger Publ. Co., Malabar, FL, 1976)
* Eykhoff, Pieter: ''System Identification – Parameter and System Estimation'', John Wiley & Sons, New York, 1974.
*
Lennart Ljung: ''System Identification — Theory For the User'', 2nd ed, PTR
Prentice Hall
Prentice Hall was a major American publishing#Textbook_publishing, educational publisher. It published print and digital content for the 6–12 and higher-education market. It was an independent company throughout the bulk of the twentieth cen ...
, Upper Saddle River, N.J., 1999.
* Jer-Nan Juang: ''Applied System Identification'', Prentice-Hall, Upper Saddle River, N.J., 1994.
*
* Oliver Nelles: ''Nonlinear System Identification'', Springer, 2001.
* T. Söderström,
P. Stoica, System Identification, Prentice Hall, Upper Saddle River, N.J., 1989.
* R. Pintelon, J. Schoukens, ''System Identification: A Frequency Domain Approach'', 2nd Edition, IEEE Press, Wiley, New York, 2012.
* Spall, J. C. (2003), ''Introduction to Stochastic Search and Optimization: Estimation, Simulation, and Control'', Wiley, Hoboken, NJ.
*
External links
L. Ljung: Perspectives on System Identification, July 2008System Identification and Model Reduction via Empirical Gramians
{{Statistics, applications, state=collapsed
Classical control theory
Dynamical systems
Engineering statistics
Identification
Identification
Biological models