
Finite element method (FEM) is a popular method for numerically solving
differential equations arising in engineering and
mathematical modeling
A mathematical model is an abstract and concrete, abstract description of a concrete system using mathematics, mathematical concepts and language of mathematics, language. The process of developing a mathematical model is termed ''mathematical m ...
. Typical problem areas of interest include the traditional fields of
structural analysis
Structural analysis is a branch of solid mechanics which uses simplified models for solids like bars, beams and shells for engineering decision making. Its main objective is to determine the effect of loads on physical structures and their c ...
,
heat transfer
Heat transfer is a discipline of thermal engineering that concerns the generation, use, conversion, and exchange of thermal energy (heat) between physical systems. Heat transfer is classified into various mechanisms, such as thermal conduction, ...
,
fluid flow
In physics, physical chemistry and engineering, fluid dynamics is a subdiscipline of fluid mechanics that describes the flow of fluids – liquids and gases. It has several subdisciplines, including (the study of air and other gases in motion ...
, mass transport, and
electromagnetic potential. Computers are usually used to perform the calculations required. With high-speed
supercomputer
A supercomputer is a type of computer with a high level of performance as compared to a general-purpose computer. The performance of a supercomputer is commonly measured in floating-point operations per second (FLOPS) instead of million instruc ...
s, better solutions can be achieved and are often required to solve the largest and most complex problems.
FEM is a general
numerical method
In numerical analysis, a numerical method is a mathematical tool designed to solve numerical problems. The implementation of a numerical method with an appropriate convergence check in a programming language is called a numerical algorithm.
Mathem ...
for solving
partial differential equations
In mathematics, a partial differential equation (PDE) is an equation which involves a multivariable function and one or more of its partial derivatives.
The function is often thought of as an "unknown" that solves the equation, similar to how ...
in two- or three-space variables (i.e., some
boundary value problem
In the study of differential equations, a boundary-value problem is a differential equation subjected to constraints called boundary conditions. A solution to a boundary value problem is a solution to the differential equation which also satis ...
s). There are also studies about using FEM to solve high-dimensional problems. To solve a problem, FEM subdivides a large system into smaller, simpler parts called finite elements. This is achieved by a particular space
discretization
In applied mathematics, discretization is the process of transferring continuous functions, models, variables, and equations into discrete counterparts. This process is usually carried out as a first step toward making them suitable for numeri ...
in the space dimensions, which is implemented by the construction of a
mesh
Medical Subject Headings (MeSH) is a comprehensive controlled vocabulary for the purpose of indexing journal articles and books in the life sciences. It serves as a thesaurus of index terms that facilitates searching. Created and updated by th ...
of the object: the numerical domain for the solution that has a finite number of points. FEM formulation of a boundary value problem finally results in a system of
algebraic equation
In mathematics, an algebraic equation or polynomial equation is an equation of the form P = 0, where ''P'' is a polynomial with coefficients in some field, often the field of the rational numbers.
For example, x^5-3x+1=0 is an algebraic equati ...
s. The method approximates the unknown function over the domain. The simple equations that model these finite elements are then assembled into a larger system of equations that models the entire problem. FEM then approximates a solution by minimizing an associated error function via the
calculus of variations
The calculus of variations (or variational calculus) is a field of mathematical analysis that uses variations, which are small changes in Function (mathematics), functions
and functional (mathematics), functionals, to find maxima and minima of f ...
.
Studying or
analyzing a phenomenon with FEM is often referred to as finite element analysis (FEA).
Basic concepts
The subdivision of a whole domain into simpler parts has several advantages:
* Accurate representation of complex geometry;
* Inclusion of dissimilar material properties;
* Easy representation of the total solution; and
* Capture of local effects.
A typical approach using the method involves the following steps:
# Dividing the domain of the problem into a collection of subdomains, with each subdomain represented by a set of element equations for the original problem.
# Systematically recombining all sets of element equations into a global system of equations for the final calculation.
The global system of equations uses known solution techniques and can be calculated from the
initial value
In multivariable calculus, an initial value problem (IVP) is an ordinary differential equation together with an initial condition which specifies the value of the unknown function at a given point in the domain. Modeling a system in physics or ...
s of the original problem to obtain a numerical answer.
In the first step above, the element equations are simple equations that locally approximate the original complex equations to be studied, where the original equations are often
partial differential equation
In mathematics, a partial differential equation (PDE) is an equation which involves a multivariable function and one or more of its partial derivatives.
The function is often thought of as an "unknown" that solves the equation, similar to ho ...
s (PDEs). To explain the approximation of this process, FEM is commonly introduced as a special case of the
Galerkin method
In mathematics, in the area of numerical analysis, Galerkin methods are a family of methods for converting a continuous operator problem, such as a differential equation, commonly in a weak formulation, to a discrete problem by applying linear c ...
. The process, in mathematical language, is to construct an integral of the
inner product
In mathematics, an inner product space (or, rarely, a Hausdorff pre-Hilbert space) is a real vector space or a complex vector space with an operation called an inner product. The inner product of two vectors in the space is a scalar, ofte ...
of the residual and the
weight function
A weight function is a mathematical device used when performing a sum, integral, or average to give some elements more "weight" or influence on the result than other elements in the same set. The result of this application of a weight function is ...
s; then, set the integral to zero. In simple terms, it is a procedure that minimizes the approximation error by fitting trial functions into the PDE. The residual is the error caused by the trial functions, and the weight functions are
polynomial
In mathematics, a polynomial is a Expression (mathematics), mathematical expression consisting of indeterminate (variable), indeterminates (also called variable (mathematics), variables) and coefficients, that involves only the operations of addit ...
approximation functions that project the residual. The process eliminates all the spatial derivatives from the PDE, thus approximating the PDE locally using the following:
* a set of
algebraic equations
In mathematics, an algebraic equation or polynomial equation is an equation of the form P = 0, where ''P'' is a polynomial with coefficients in some field, often the field of the rational numbers.
For example, x^5-3x+1=0 is an algebraic equation ...
for
steady-state
In systems theory, a system or a process is in a steady state if the variables (called state variables) which define the behavior of the system or the process are unchanging in time. In continuous time, this means that for those properties ''p'' ...
problems; and
* a set of
ordinary differential equation
In mathematics, an ordinary differential equation (ODE) is a differential equation (DE) dependent on only a single independent variable (mathematics), variable. As with any other DE, its unknown(s) consists of one (or more) Function (mathematic ...
s for
transient problems.
These equation sets are element equations. They are
linear
In mathematics, the term ''linear'' is used in two distinct senses for two different properties:
* linearity of a '' function'' (or '' mapping'');
* linearity of a '' polynomial''.
An example of a linear function is the function defined by f(x) ...
if the underlying PDE is linear and vice versa. Algebraic equation sets that arise in the steady-state problems are solved using
numerical linear algebra
Numerical linear algebra, sometimes called applied linear algebra, is the study of how matrix operations can be used to create computer algorithms which efficiently and accurately provide approximate answers to questions in continuous mathemati ...
ic methods. In contrast,
ordinary differential equation
In mathematics, an ordinary differential equation (ODE) is a differential equation (DE) dependent on only a single independent variable (mathematics), variable. As with any other DE, its unknown(s) consists of one (or more) Function (mathematic ...
sets that occur in the transient problems are solved by numerical integrations using standard techniques such as
Euler's method
In mathematics and computational science, the Euler method (also called the forward Euler method) is a first-order numerical procedure for solving ordinary differential equations (ODEs) with a given initial value. It is the most basic explic ...
or the
Runge –Kutta method.
In the second step above, a global system of equations is generated from the element equations by transforming coordinates from the subdomains' local nodes to the domain's global nodes. This spatial transformation includes appropriate
orientation adjustments as applied in relation to the reference
coordinate system
In geometry, a coordinate system is a system that uses one or more numbers, or coordinates, to uniquely determine and standardize the position of the points or other geometric elements on a manifold such as Euclidean space. The coordinates are ...
. The process is often carried out using FEM software with
coordinate
In geometry, a coordinate system is a system that uses one or more numbers, or coordinates, to uniquely determine and standardize the position of the points or other geometric elements on a manifold such as Euclidean space. The coordinates are ...
data generated from the subdomains.
The practical application of FEM is known as finite element analysis (FEA). FEA, as applied in
engineering
Engineering is the practice of using natural science, mathematics, and the engineering design process to Problem solving#Engineering, solve problems within technology, increase efficiency and productivity, and improve Systems engineering, s ...
, is a computational tool for performing
engineering analysis Engineering analysis involves the application of scientific/mathematical analytic principles and processes to reveal the properties and state of a system, device or mechanism under study.
Engineering analysis is decompositional: it proceeds by se ...
. It includes the use of
mesh generation
Mesh generation is the practice of creating a polygon mesh, mesh, a subdivision of a continuous geometric space into discrete geometric and topological cells.
Often these cells form a simplicial complex.
Usually the cells partition the geometric ...
techniques for dividing a
complex problem into smaller elements, as well as the use of software coded with a FEM algorithm. When applying FEA, the complex problem is usually a physical system with the underlying
physics
Physics is the scientific study of matter, its Elementary particle, fundamental constituents, its motion and behavior through space and time, and the related entities of energy and force. "Physical science is that department of knowledge whi ...
, such as the
Euler–Bernoulli beam equation, the
heat equation
In mathematics and physics (more specifically thermodynamics), the heat equation is a parabolic partial differential equation. The theory of the heat equation was first developed by Joseph Fourier in 1822 for the purpose of modeling how a quanti ...
, or the
Navier
Claude-Louis Navier (born Claude Louis Marie Henri Navier; ; 10 February 1785 – 21 August 1836) was a French civil engineer, affiliated with the French government, and a physicist who specialized in continuum mechanics.
The Navier–Stokes eq ...
–Stokes equations, expressed in either PDEs or
integral equation
In mathematical analysis, integral equations are equations in which an unknown function appears under an integral sign. In mathematical notation, integral equations may thus be expressed as being of the form: f(x_1,x_2,x_3,\ldots,x_n ; u(x_1,x_2 ...
s, while the divided, smaller elements of the complex problem represent different areas in the physical system.
FEA may be used for analyzing problems over complicated domains (e.g., cars and oil pipelines) when the domain changes (e.g., during a solid-state reaction with a moving boundary), when the desired precision varies over the entire domain, or when the solution lacks smoothness. FEA simulations provide a valuable resource, as they remove multiple instances of creating and testing complex prototypes for various high-fidelity situations. For example, in a frontal crash simulation, it is possible to increase prediction accuracy in important areas, like the front of the car, and reduce it in the rear of the car, thus reducing the cost of the simulation. Another example would be in
numerical weather prediction
Numerical weather prediction (NWP) uses mathematical models of the atmosphere and oceans to weather forecasting, predict the weather based on current weather conditions. Though first attempted in the 1920s, it was not until the advent of comput ...
, where it is more important to have accurate predictions over developing highly nonlinear phenomena, such as
tropical cyclone
A tropical cyclone is a rapidly rotating storm system with a low-pressure area, a closed low-level atmospheric circulation, strong winds, and a spiral arrangement of thunderstorms that produce heavy rain and squalls. Depending on its locat ...
s in the atmosphere or
eddies
In fluid dynamics, an eddy is the swirling of a fluid and the reverse current created when the fluid is in a turbulent flow regime. The moving fluid creates a space devoid of downstream-flowing fluid on the downstream side of the object. Fluid ...
in the ocean, rather than relatively calm areas.
A clear, detailed, and practical presentation of this approach can be found in the textbook ''The Finite Element Method for Engineers''.
History
While it is difficult to quote the date of the invention of FEM, the method originated from the need to solve complex
elasticity and
structural analysis
Structural analysis is a branch of solid mechanics which uses simplified models for solids like bars, beams and shells for engineering decision making. Its main objective is to determine the effect of loads on physical structures and their c ...
problems in
civil and
aeronautical engineering
Aerospace engineering is the primary field of engineering concerned with the development of aircraft and spacecraft. It has two major and overlapping branches: aeronautical engineering and astronautical engineering. Avionics engineering is s ...
. Its development can be traced back to work by
Alexander Hrennikoff and
Richard Courant
Richard Courant (January 8, 1888 – January 27, 1972) was a German-American mathematician. He is best known by the general public for the book '' What is Mathematics?'', co-written with Herbert Robbins. His research focused on the areas of real ...
in the early 1940s. Another pioneer was
Ioannis Argyris. In the USSR, the introduction of the practical application of FEM is usually connected with
Leonard Oganesyan. It was also independently rediscovered in China by
Feng Kang in the late 1950s and early 1960s, based on the computations of dam constructions, where it was called the "
finite difference method
In numerical analysis, finite-difference methods (FDM) are a class of numerical techniques for solving differential equations by approximating Derivative, derivatives with Finite difference approximation, finite differences. Both the spatial doma ...
" based on variation principles. Although the approaches used by these pioneers are different, they share one essential characteristic: the
mesh
Medical Subject Headings (MeSH) is a comprehensive controlled vocabulary for the purpose of indexing journal articles and books in the life sciences. It serves as a thesaurus of index terms that facilitates searching. Created and updated by th ...
discretization
In applied mathematics, discretization is the process of transferring continuous functions, models, variables, and equations into discrete counterparts. This process is usually carried out as a first step toward making them suitable for numeri ...
of a continuous domain into a set of discrete sub-domains, usually called elements.
Hrennikoff's work discretizes the domain by using a
lattice analogy, while Courant's approach divides the domain into finite triangular sub-regions to solve
second-order
Second-order may refer to:
Mathematics
* Second order approximation, an approximation that includes quadratic terms
* Second-order arithmetic, an axiomatization allowing quantification of sets of numbers
* Second-order differential equation, a d ...
elliptic partial differential equation
In mathematics, an elliptic partial differential equation is a type of partial differential equation (PDE). In mathematical modeling, elliptic PDEs are frequently used to model steady states, unlike parabolic PDE and hyperbolic PDE which gene ...
s that arise from the problem of the
torsion of a
cylinder
A cylinder () has traditionally been a three-dimensional solid, one of the most basic of curvilinear geometric shapes. In elementary geometry, it is considered a prism with a circle as its base.
A cylinder may also be defined as an infinite ...
. Courant's contribution was evolutionary, drawing on a large body of earlier results for PDEs developed by
Lord Rayleigh
John William Strutt, 3rd Baron Rayleigh ( ; 12 November 1842 – 30 June 1919), was an English physicist who received the Nobel Prize in Physics in 1904 "for his investigations of the densities of the most important gases and for his discovery ...
,
Walther Ritz
Walther Heinrich Wilhelm Ritz (22 February 1878 – 7 July 1909) was a Swiss theoretical physicist. He is most famous for his work with Johannes Rydberg on the Rydberg–Ritz combination principle. Ritz is also known for the variational method n ...
, and
Boris Galerkin
Boris Grigoryevich Galerkin (, surname more accurately romanized as Galyorkin; –12 July 1945) was a Soviet mathematician and an engineer.
Biography
Early life
Galerkin was born on in Polotsk, Vitebsk Governorate, Russian Empire, now part of ...
.
The application of FEM gained momentum in the 1960s and 1970s due to the developments of
J. H. Argyris and his co-workers at the
University of Stuttgart
The University of Stuttgart () is a research university located in Stuttgart, Germany. It was founded in 1829 and is organized into 10 faculties. It is one of the oldest technical universities in Germany with programs in civil, mechanical, ind ...
;
R. W. Clough and his co-workers at
University of California Berkeley
The University of California, Berkeley (UC Berkeley, Berkeley, Cal, or California), is a public land-grant research university in Berkeley, California, United States. Founded in 1868 and named after the Anglo-Irish philosopher George Berkeley ...
;
O. C. Zienkiewicz and his co-workers
Ernest Hinton,
Bruce Irons, and others at
Swansea University
Swansea University () is a public university, public research university located in Swansea, Wales, United Kingdom.
It was chartered as University College of Swansea in 1920, as the fourth college of the University of Wales. In 1996, it chang ...
;
Philippe G. Ciarlet at the University of
Paris 6; and
Richard Gallagher and his co-workers at
Cornell University
Cornell University is a Private university, private Ivy League research university based in Ithaca, New York, United States. The university was co-founded by American philanthropist Ezra Cornell and historian and educator Andrew Dickson W ...
. During this period, additional impetus was provided by the available open-source FEM programs. NASA sponsored the original version of
NASTRAN. University of California Berkeley made the finite element programs SAP IV and, later,
OpenSees widely available. In Norway, the ship classification society Det Norske Veritas (now
DNV GL
Det Norske Veritas (DNV), formerly DNV GL, is an international accredited registrar and classification society headquartered in Høvik, Norway. DNV provides services for several industries, including maritime, oil and gas, renewable energy, ...
) developed
Sesam in 1969 for use in the analysis of ships. A rigorous mathematical basis for FEM was provided in 1973 with a publication by
Gilbert Strang
William Gilbert Strang (born November 27, 1934) is an American mathematician known for his contributions to Finite elements, finite element theory, the calculus of variations, wavelet analysis and linear algebra. He has made many contributions ...
and
George Fix. The method has since been generalized for the
numerical modeling of physical systems in a wide variety of
engineering
Engineering is the practice of using natural science, mathematics, and the engineering design process to Problem solving#Engineering, solve problems within technology, increase efficiency and productivity, and improve Systems engineering, s ...
disciplines, such as
electromagnetism
In physics, electromagnetism is an interaction that occurs between particles with electric charge via electromagnetic fields. The electromagnetic force is one of the four fundamental forces of nature. It is the dominant force in the interacti ...
,
heat transfer
Heat transfer is a discipline of thermal engineering that concerns the generation, use, conversion, and exchange of thermal energy (heat) between physical systems. Heat transfer is classified into various mechanisms, such as thermal conduction, ...
, and
fluid dynamics
In physics, physical chemistry and engineering, fluid dynamics is a subdiscipline of fluid mechanics that describes the flow of fluids – liquids and gases. It has several subdisciplines, including (the study of air and other gases in motion ...
.
Technical discussion
The structure of finite element methods
A finite element method is characterized by a
variational formulation, a discretization strategy, one or more solution algorithms, and post-processing procedures.
Examples of the variational formulation are the
Galerkin method
In mathematics, in the area of numerical analysis, Galerkin methods are a family of methods for converting a continuous operator problem, such as a differential equation, commonly in a weak formulation, to a discrete problem by applying linear c ...
, the discontinuous Galerkin method, mixed methods, etc.
A discretization strategy is understood to mean a clearly defined set of procedures that cover (a) the creation of finite element meshes, (b) the definition of basis function on reference elements (also called shape functions), and (c) the mapping of reference elements onto the elements of the mesh. Examples of discretization strategies are the h-version,
p-version,
hp-version,
x-FEM,
isogeometric analysis, etc. Each discretization strategy has certain advantages and disadvantages. A reasonable criterion in selecting a discretization strategy is to realize nearly optimal performance for the broadest set of mathematical models in a particular model class.
Various numerical solution algorithms can be classified into two broad categories; direct and iterative solvers. These algorithms are designed to exploit the sparsity of matrices that depend on the variational formulation and discretization strategy choices.
Post-processing procedures are designed to extract the data of interest from a finite element solution. To meet the requirements of solution verification, postprocessors need to provide for ''a posteriori'' error estimation in terms of the quantities of interest. When the errors of approximation are larger than what is considered acceptable, then the discretization has to be changed either by an automated adaptive process or by the action of the analyst. Some very efficient postprocessors provide for the realization of
superconvergence.
Illustrative problems P1 and P2
The following two problems demonstrate the finite element method.
P1 is a one-dimensional problem
where
is given,
is an unknown function of
, and
is the second derivative of
with respect to
.
P2 is a two-dimensional problem (
Dirichlet problem
In mathematics, a Dirichlet problem asks for a function which solves a specified partial differential equation (PDE) in the interior of a given region that takes prescribed values on the boundary of the region.
The Dirichlet problem can be solved ...
)
where
is a connected open region in the
plane whose boundary
is nice (e.g., a
smooth manifold
In mathematics, a differentiable manifold (also differential manifold) is a type of manifold that is locally similar enough to a vector space to allow one to apply calculus. Any manifold can be described by a collection of charts (atlas). One may ...
or a
polygon
In geometry, a polygon () is a plane figure made up of line segments connected to form a closed polygonal chain.
The segments of a closed polygonal chain are called its '' edges'' or ''sides''. The points where two edges meet are the polygon ...
), and
and
denote the second derivatives with respect to
and
, respectively.
The problem P1 can be solved directly by computing
antiderivative
In calculus, an antiderivative, inverse derivative, primitive function, primitive integral or indefinite integral of a continuous function is a differentiable function whose derivative is equal to the original function . This can be stated ...
s. However, this method of solving the
boundary value problem
In the study of differential equations, a boundary-value problem is a differential equation subjected to constraints called boundary conditions. A solution to a boundary value problem is a solution to the differential equation which also satis ...
(BVP) works only when there is one spatial dimension. It does not generalize to higher-dimensional problems or problems like
. For this reason, we will develop the finite element method for P1 and outline its generalization to P2.
Our explanation will proceed in two steps, which mirror two essential steps one must take to solve a boundary value problem (BVP) using the FEM.
* In the first step, one rephrases the original BVP in its weak form. Little to no computation is usually required for this step. The transformation is done by hand on paper.
* The second step is discretization, where the weak form is discretized in a finite-dimensional space.
After this second step, we have concrete formulae for a large but finite-dimensional linear problem whose solution will approximately solve the original BVP. This finite-dimensional problem is then implemented on a
computer
A computer is a machine that can be Computer programming, programmed to automatically Execution (computing), carry out sequences of arithmetic or logical operations (''computation''). Modern digital electronic computers can perform generic set ...
.
Weak formulation
The first step is to convert P1 and P2 into their equivalent
weak formulation
Weak formulations are important tools for the analysis of mathematical equations that permit the transfer of concepts of linear algebra to solve problems in other fields such as partial differential equations. In a weak formulation, equations or co ...
s.
The weak form of P1
If
solves P1, then for any smooth function
that satisfies the displacement boundary conditions, i.e.
at
and
, we have
Conversely, if
with
satisfies (1) for every smooth function
then one may show that this
will solve P1. The proof is easier for twice continuously differentiable
(
mean value theorem
In mathematics, the mean value theorem (or Lagrange's mean value theorem) states, roughly, that for a given planar arc (geometry), arc between two endpoints, there is at least one point at which the tangent to the arc is parallel to the secant lin ...
) but may be proved in a
distributional sense as well.
We define a new operator or map
by using
integration by parts
In calculus, and more generally in mathematical analysis, integration by parts or partial integration is a process that finds the integral of a product of functions in terms of the integral of the product of their derivative and antiderivati ...
on the right-hand-side of (1):
where we have used the assumption that
.
The weak form of P2
If we integrate by parts using a form of
Green's identities
In mathematics, Green's identities are a set of three identities in vector calculus relating the bulk with the boundary of a region on which differential operators act. They are named after the mathematician George Green, who discovered Green's ...
, we see that if
solves P2, then we may define
for any
by
where
denotes the
gradient
In vector calculus, the gradient of a scalar-valued differentiable function f of several variables is the vector field (or vector-valued function) \nabla f whose value at a point p gives the direction and the rate of fastest increase. The g ...
and
denotes the
dot product
In mathematics, the dot product or scalar productThe term ''scalar product'' means literally "product with a Scalar (mathematics), scalar as a result". It is also used for other symmetric bilinear forms, for example in a pseudo-Euclidean space. N ...
in the two-dimensional plane. Once more
can be turned into an inner product on a suitable space
of once differentiable functions of
that are zero on
. We have also assumed that
(see
Sobolev space
In mathematics, a Sobolev space is a vector space of functions equipped with a norm that is a combination of ''Lp''-norms of the function together with its derivatives up to a given order. The derivatives are understood in a suitable weak sense ...
s). The existence and uniqueness of the solution can also be shown.
A proof outline of the existence and uniqueness of the solution
We can loosely think of
to be the
absolutely continuous
In calculus and real analysis, absolute continuity is a smoothness property of functions that is stronger than continuity and uniform continuity. The notion of absolute continuity allows one to obtain generalizations of the relationship betwe ...
functions of
that are
at
and
(see
Sobolev spaces
In mathematics, a Sobolev space is a vector space of functions equipped with a normed space, norm that is a combination of Lp norm, ''Lp''-norms of the function together with its derivatives up to a given order. The derivatives are understood in a ...
). Such functions are (weakly) once differentiable, and it turns out that the symmetric
bilinear map
In mathematics, a bilinear map is a function combining elements of two vector spaces to yield an element of a third vector space, and is linear in each of its arguments. Matrix multiplication is an example.
A bilinear map can also be defined for ...
then defines an
inner product
In mathematics, an inner product space (or, rarely, a Hausdorff pre-Hilbert space) is a real vector space or a complex vector space with an operation called an inner product. The inner product of two vectors in the space is a scalar, ofte ...
which turns
into a
Hilbert space
In mathematics, a Hilbert space is a real number, real or complex number, complex inner product space that is also a complete metric space with respect to the metric induced by the inner product. It generalizes the notion of Euclidean space. The ...
(a detailed proof is nontrivial). On the other hand, the left-hand-side
is also an inner product, this time on the
Lp space
In mathematics, the spaces are function spaces defined using a natural generalization of the -norm for finite-dimensional vector spaces. They are sometimes called Lebesgue spaces, named after Henri Lebesgue , although according to the Bourba ...
. An application of the
Riesz representation theorem
The Riesz representation theorem, sometimes called the Riesz–Fréchet representation theorem after Frigyes Riesz and Maurice René Fréchet, establishes an important connection between a Hilbert space and its continuous dual space. If the un ...
for Hilbert spaces shows that there is a unique
solving (2) and, therefore, P1. This solution is a-priori only a member of
, but using
elliptic regularity, will be smooth if
is.
Discretization
P1 and P2 are ready to be discretized, which leads to a common sub-problem (3). The basic idea is to replace the infinite-dimensional linear problem:
:Find
such that
:
with a finite-dimensional version:
where
is a finite-dimensional
subspace of
. There are many possible choices for
(one possibility leads to the
spectral method
Spectral methods are a class of techniques used in applied mathematics and scientific computing to numerically solve certain differential equations. The idea is to write the solution of the differential equation as a sum of certain " basis funct ...
). However, we take
as a space of piecewise polynomial functions for the finite element method.
For problem P1
We take the interval
, choose
values of
with
and we define
by:
where we define
and
. Observe that functions in
are not differentiable according to the elementary definition of calculus. Indeed, if
then the derivative is typically not defined at any
,
. However, the derivative exists at every other value of
, and one can use this derivative for
integration by parts
In calculus, and more generally in mathematical analysis, integration by parts or partial integration is a process that finds the integral of a product of functions in terms of the integral of the product of their derivative and antiderivati ...
.
For problem P2
We need
to be a set of functions of
. In the figure on the right, we have illustrated a
triangulation
In trigonometry and geometry, triangulation is the process of determining the location of a point by forming triangles to the point from known points.
Applications
In surveying
Specifically in surveying, triangulation involves only angle m ...
of a 15-sided
polygon
In geometry, a polygon () is a plane figure made up of line segments connected to form a closed polygonal chain.
The segments of a closed polygonal chain are called its '' edges'' or ''sides''. The points where two edges meet are the polygon ...
al region
in the plane (below), and a
piecewise linear function
In mathematics, a piecewise linear or segmented function is a real-valued function of a real variable, whose graph is composed of straight-line segments.
Definition
A piecewise linear function is a function defined on a (possibly unbounded) ...
(above, in color) of this polygon which is linear on each triangle of the triangulation; the space
would consist of functions that are linear on each triangle of the chosen triangulation.
One hopes that as the underlying triangular mesh becomes finer and finer, the solution of the discrete problem (3) will, in some sense, converge to the solution of the original boundary value problem P2. To measure this mesh fineness, the triangulation is indexed by a real-valued parameter
which one takes to be very small. This parameter will be related to the largest or average triangle size in the triangulation. As we refine the triangulation, the space of piecewise linear functions
must also change with
. For this reason, one often reads
instead of
in the literature. Since we do not perform such an analysis, we will not use this notation.
Choosing a basis
To complete the discretization, we must select a
basis of
. In the one-dimensional case, for each control point
we will choose the piecewise linear function
in
whose value is
at
and zero at every
, i.e.,
for
; this basis is a shifted and scaled
tent function. For the two-dimensional case, we choose again one basis function
per vertex
of the triangulation of the planar region
. The function
is the unique function of
whose value is
at
and zero at every
.
Depending on the author, the word "element" in the "finite element method" refers to the domain's triangles, the piecewise linear basis function, or both. So, for instance, an author interested in curved domains might replace the triangles with curved primitives and so might describe the elements as being curvilinear. On the other hand, some authors replace "piecewise linear" with "piecewise quadratic" or even "piecewise polynomial". The author might then say "higher order element" instead of "higher degree polynomial". The finite element method is not restricted to triangles (tetrahedra in 3-d or higher-order simplexes in multidimensional spaces). Still, it can be defined on quadrilateral subdomains (hexahedra, prisms, or pyramids in 3-d, and so on). Higher-order shapes (curvilinear elements) can be defined with polynomial and even non-polynomial shapes (e.g., ellipse or circle).
Examples of methods that use higher degree piecewise polynomial basis functions are the
hp-FEM
hp-FEM is a generalization of the finite element method (FEM) for solving partial differential equations numerical analysis, numerically based on Piecewise, piecewise-polynomial approximations. hp-FEM originates from the discovery by Barna Szabó, ...
and
spectral FEM.
More advanced implementations (adaptive finite element methods) utilize a method to assess the quality of the results (based on error estimation theory) and modify the mesh during the solution aiming to achieve an approximate solution within some bounds from the exact solution of the continuum problem. Mesh adaptivity may utilize various techniques; the most popular are:
* moving nodes (r-adaptivity)
* refining (and unrefined) elements (h-adaptivity)
* changing order of base functions (p-adaptivity)
* combinations of the above (
hp-adaptivity).
Small support of the basis

The primary advantage of this choice of basis is that the inner products
and
will be zero for almost all
.
(The matrix containing
in the
location is known as the
Gramian matrix
In linear algebra, the Gram matrix (or Gramian matrix, Gramian) of a set of vectors v_1,\dots, v_n in an inner product space is the Hermitian matrix of inner products, whose entries are given by the inner product G_ = \left\langle v_i, v_j \right\ ...
.)
In the one dimensional case, the
support of
is the interval