Finite element method (FEM) is a popular method for numerically solving differential equations arising in engineering and

mathematical modeling A mathematical model is an abstract and concrete, abstract description of a concrete system using mathematics, mathematical concepts and language of mathematics, language. The process of developing a mathematical model is termed ''mathematical m ...

. Typical problem areas of interest include the traditional fields of

structural analysis Structural analysis is a branch of solid mechanics which uses simplified models for solids like bars, beams and shells for engineering decision making. Its main objective is to determine the effect of loads on physical structures and their c ...

heat transfer Heat transfer is a discipline of thermal engineering that concerns the generation, use, conversion, and exchange of thermal energy (heat) between physical systems. Heat transfer is classified into various mechanisms, such as thermal conduction, ...

fluid flow In physics, physical chemistry and engineering, fluid dynamics is a subdiscipline of fluid mechanics that describes the flow of fluids – liquids and gases. It has several subdisciplines, including (the study of air and other gases in motion ...

, mass transport, and electromagnetic potential. Computers are usually used to perform the calculations required. With high-speed

supercomputer A supercomputer is a type of computer with a high level of performance as compared to a general-purpose computer. The performance of a supercomputer is commonly measured in floating-point operations per second (FLOPS) instead of million instruc ...

s, better solutions can be achieved and are often required to solve the largest and most complex problems. FEM is a general

numerical method In numerical analysis, a numerical method is a mathematical tool designed to solve numerical problems. The implementation of a numerical method with an appropriate convergence check in a programming language is called a numerical algorithm. Mathem ...

for solving

partial differential equations In mathematics, a partial differential equation (PDE) is an equation which involves a multivariable function and one or more of its partial derivatives. The function is often thought of as an "unknown" that solves the equation, similar to how ...

in two- or three-space variables (i.e., some

boundary value problem In the study of differential equations, a boundary-value problem is a differential equation subjected to constraints called boundary conditions. A solution to a boundary value problem is a solution to the differential equation which also satis ...

s). There are also studies about using FEM to solve high-dimensional problems. To solve a problem, FEM subdivides a large system into smaller, simpler parts called finite elements. This is achieved by a particular space

discretization In applied mathematics, discretization is the process of transferring continuous functions, models, variables, and equations into discrete counterparts. This process is usually carried out as a first step toward making them suitable for numeri ...

in the space dimensions, which is implemented by the construction of a

mesh Medical Subject Headings (MeSH) is a comprehensive controlled vocabulary for the purpose of indexing journal articles and books in the life sciences. It serves as a thesaurus of index terms that facilitates searching. Created and updated by th ...

of the object: the numerical domain for the solution that has a finite number of points. FEM formulation of a boundary value problem finally results in a system of

algebraic equation In mathematics, an algebraic equation or polynomial equation is an equation of the form P = 0, where ''P'' is a polynomial with coefficients in some field, often the field of the rational numbers. For example, x^5-3x+1=0 is an algebraic equati ...

s. The method approximates the unknown function over the domain. The simple equations that model these finite elements are then assembled into a larger system of equations that models the entire problem. FEM then approximates a solution by minimizing an associated error function via the

calculus of variations The calculus of variations (or variational calculus) is a field of mathematical analysis that uses variations, which are small changes in Function (mathematics), functions and functional (mathematics), functionals, to find maxima and minima of f ...

. Studying or analyzing a phenomenon with FEM is often referred to as finite element analysis (FEA).

Basic concepts

The subdivision of a whole domain into simpler parts has several advantages: * Accurate representation of complex geometry; * Inclusion of dissimilar material properties; * Easy representation of the total solution; and * Capture of local effects. A typical approach using the method involves the following steps: # Dividing the domain of the problem into a collection of subdomains, with each subdomain represented by a set of element equations for the original problem. # Systematically recombining all sets of element equations into a global system of equations for the final calculation. The global system of equations uses known solution techniques and can be calculated from the

initial value In multivariable calculus, an initial value problem (IVP) is an ordinary differential equation together with an initial condition which specifies the value of the unknown function at a given point in the domain. Modeling a system in physics or ...

s of the original problem to obtain a numerical answer. In the first step above, the element equations are simple equations that locally approximate the original complex equations to be studied, where the original equations are often

partial differential equation In mathematics, a partial differential equation (PDE) is an equation which involves a multivariable function and one or more of its partial derivatives. The function is often thought of as an "unknown" that solves the equation, similar to ho ...

s (PDEs). To explain the approximation of this process, FEM is commonly introduced as a special case of the

Galerkin method In mathematics, in the area of numerical analysis, Galerkin methods are a family of methods for converting a continuous operator problem, such as a differential equation, commonly in a weak formulation, to a discrete problem by applying linear c ...

. The process, in mathematical language, is to construct an integral of the

inner product In mathematics, an inner product space (or, rarely, a Hausdorff pre-Hilbert space) is a real vector space or a complex vector space with an operation called an inner product. The inner product of two vectors in the space is a scalar, ofte ...

of the residual and the

weight function A weight function is a mathematical device used when performing a sum, integral, or average to give some elements more "weight" or influence on the result than other elements in the same set. The result of this application of a weight function is ...

s; then, set the integral to zero. In simple terms, it is a procedure that minimizes the approximation error by fitting trial functions into the PDE. The residual is the error caused by the trial functions, and the weight functions are

polynomial In mathematics, a polynomial is a Expression (mathematics), mathematical expression consisting of indeterminate (variable), indeterminates (also called variable (mathematics), variables) and coefficients, that involves only the operations of addit ...

approximation functions that project the residual. The process eliminates all the spatial derivatives from the PDE, thus approximating the PDE locally using the following: * a set of

algebraic equations In mathematics, an algebraic equation or polynomial equation is an equation of the form P = 0, where ''P'' is a polynomial with coefficients in some field, often the field of the rational numbers. For example, x^5-3x+1=0 is an algebraic equation ...

for

steady-state In systems theory, a system or a process is in a steady state if the variables (called state variables) which define the behavior of the system or the process are unchanging in time. In continuous time, this means that for those properties ''p'' ...

problems; and * a set of

ordinary differential equation In mathematics, an ordinary differential equation (ODE) is a differential equation (DE) dependent on only a single independent variable (mathematics), variable. As with any other DE, its unknown(s) consists of one (or more) Function (mathematic ...

s for transient problems. These equation sets are element equations. They are

linear In mathematics, the term ''linear'' is used in two distinct senses for two different properties: * linearity of a '' function'' (or '' mapping''); * linearity of a '' polynomial''. An example of a linear function is the function defined by f(x) ...

if the underlying PDE is linear and vice versa. Algebraic equation sets that arise in the steady-state problems are solved using

numerical linear algebra Numerical linear algebra, sometimes called applied linear algebra, is the study of how matrix operations can be used to create computer algorithms which efficiently and accurately provide approximate answers to questions in continuous mathemati ...

ic methods. In contrast,

sets that occur in the transient problems are solved by numerical integrations using standard techniques such as

Euler's method In mathematics and computational science, the Euler method (also called the forward Euler method) is a first-order numerical procedure for solving ordinary differential equations (ODEs) with a given initial value. It is the most basic explic ...

or the Runge –Kutta method. In the second step above, a global system of equations is generated from the element equations by transforming coordinates from the subdomains' local nodes to the domain's global nodes. This spatial transformation includes appropriate orientation adjustments as applied in relation to the reference

coordinate system In geometry, a coordinate system is a system that uses one or more numbers, or coordinates, to uniquely determine and standardize the position of the points or other geometric elements on a manifold such as Euclidean space. The coordinates are ...

. The process is often carried out using FEM software with

coordinate In geometry, a coordinate system is a system that uses one or more numbers, or coordinates, to uniquely determine and standardize the position of the points or other geometric elements on a manifold such as Euclidean space. The coordinates are ...

data generated from the subdomains. The practical application of FEM is known as finite element analysis (FEA). FEA, as applied in

engineering Engineering is the practice of using natural science, mathematics, and the engineering design process to Problem solving#Engineering, solve problems within technology, increase efficiency and productivity, and improve Systems engineering, s ...

, is a computational tool for performing

engineering analysis Engineering analysis involves the application of scientific/mathematical analytic principles and processes to reveal the properties and state of a system, device or mechanism under study. Engineering analysis is decompositional: it proceeds by se ...

. It includes the use of

mesh generation Mesh generation is the practice of creating a polygon mesh, mesh, a subdivision of a continuous geometric space into discrete geometric and topological cells. Often these cells form a simplicial complex. Usually the cells partition the geometric ...

techniques for dividing a complex problem into smaller elements, as well as the use of software coded with a FEM algorithm. When applying FEA, the complex problem is usually a physical system with the underlying

physics Physics is the scientific study of matter, its Elementary particle, fundamental constituents, its motion and behavior through space and time, and the related entities of energy and force. "Physical science is that department of knowledge whi ...

, such as the Euler–Bernoulli beam equation, the

heat equation In mathematics and physics (more specifically thermodynamics), the heat equation is a parabolic partial differential equation. The theory of the heat equation was first developed by Joseph Fourier in 1822 for the purpose of modeling how a quanti ...

, or the

Navier Claude-Louis Navier (born Claude Louis Marie Henri Navier; ; 10 February 1785 – 21 August 1836) was a French civil engineer, affiliated with the French government, and a physicist who specialized in continuum mechanics. The Navier–Stokes eq ...

–Stokes equations, expressed in either PDEs or

integral equation In mathematical analysis, integral equations are equations in which an unknown function appears under an integral sign. In mathematical notation, integral equations may thus be expressed as being of the form: f(x_1,x_2,x_3,\ldots,x_n ; u(x_1,x_2 ...

s, while the divided, smaller elements of the complex problem represent different areas in the physical system. FEA may be used for analyzing problems over complicated domains (e.g., cars and oil pipelines) when the domain changes (e.g., during a solid-state reaction with a moving boundary), when the desired precision varies over the entire domain, or when the solution lacks smoothness. FEA simulations provide a valuable resource, as they remove multiple instances of creating and testing complex prototypes for various high-fidelity situations. For example, in a frontal crash simulation, it is possible to increase prediction accuracy in important areas, like the front of the car, and reduce it in the rear of the car, thus reducing the cost of the simulation. Another example would be in

numerical weather prediction Numerical weather prediction (NWP) uses mathematical models of the atmosphere and oceans to weather forecasting, predict the weather based on current weather conditions. Though first attempted in the 1920s, it was not until the advent of comput ...

, where it is more important to have accurate predictions over developing highly nonlinear phenomena, such as

tropical cyclone A tropical cyclone is a rapidly rotating storm system with a low-pressure area, a closed low-level atmospheric circulation, strong winds, and a spiral arrangement of thunderstorms that produce heavy rain and squalls. Depending on its locat ...

s in the atmosphere or

eddies In fluid dynamics, an eddy is the swirling of a fluid and the reverse current created when the fluid is in a turbulent flow regime. The moving fluid creates a space devoid of downstream-flowing fluid on the downstream side of the object. Fluid ...

in the ocean, rather than relatively calm areas. A clear, detailed, and practical presentation of this approach can be found in the textbook ''The Finite Element Method for Engineers''.

History

While it is difficult to quote the date of the invention of FEM, the method originated from the need to solve complex elasticity and

problems in civil and

aeronautical engineering Aerospace engineering is the primary field of engineering concerned with the development of aircraft and spacecraft. It has two major and overlapping branches: aeronautical engineering and astronautical engineering. Avionics engineering is s ...

. Its development can be traced back to work by Alexander Hrennikoff and

Richard Courant Richard Courant (January 8, 1888 – January 27, 1972) was a German-American mathematician. He is best known by the general public for the book '' What is Mathematics?'', co-written with Herbert Robbins. His research focused on the areas of real ...

in the early 1940s. Another pioneer was Ioannis Argyris. In the USSR, the introduction of the practical application of FEM is usually connected with Leonard Oganesyan. It was also independently rediscovered in China by Feng Kang in the late 1950s and early 1960s, based on the computations of dam constructions, where it was called the "

finite difference method In numerical analysis, finite-difference methods (FDM) are a class of numerical techniques for solving differential equations by approximating Derivative, derivatives with Finite difference approximation, finite differences. Both the spatial doma ...

" based on variation principles. Although the approaches used by these pioneers are different, they share one essential characteristic: the

of a continuous domain into a set of discrete sub-domains, usually called elements. Hrennikoff's work discretizes the domain by using a lattice analogy, while Courant's approach divides the domain into finite triangular sub-regions to solve

second-order Second-order may refer to: Mathematics * Second order approximation, an approximation that includes quadratic terms * Second-order arithmetic, an axiomatization allowing quantification of sets of numbers * Second-order differential equation, a d ...

elliptic partial differential equation In mathematics, an elliptic partial differential equation is a type of partial differential equation (PDE). In mathematical modeling, elliptic PDEs are frequently used to model steady states, unlike parabolic PDE and hyperbolic PDE which gene ...

s that arise from the problem of the torsion of a

cylinder A cylinder () has traditionally been a three-dimensional solid, one of the most basic of curvilinear geometric shapes. In elementary geometry, it is considered a prism with a circle as its base. A cylinder may also be defined as an infinite ...

. Courant's contribution was evolutionary, drawing on a large body of earlier results for PDEs developed by

Lord Rayleigh John William Strutt, 3rd Baron Rayleigh ( ; 12 November 1842 – 30 June 1919), was an English physicist who received the Nobel Prize in Physics in 1904 "for his investigations of the densities of the most important gases and for his discovery ...

Walther Ritz Walther Heinrich Wilhelm Ritz (22 February 1878 – 7 July 1909) was a Swiss theoretical physicist. He is most famous for his work with Johannes Rydberg on the Rydberg–Ritz combination principle. Ritz is also known for the variational method n ...

, and

Boris Galerkin Boris Grigoryevich Galerkin (, surname more accurately romanized as Galyorkin; –12 July 1945) was a Soviet mathematician and an engineer. Biography Early life Galerkin was born on in Polotsk, Vitebsk Governorate, Russian Empire, now part of ...

. The application of FEM gained momentum in the 1960s and 1970s due to the developments of J. H. Argyris and his co-workers at the

University of Stuttgart The University of Stuttgart () is a research university located in Stuttgart, Germany. It was founded in 1829 and is organized into 10 faculties. It is one of the oldest technical universities in Germany with programs in civil, mechanical, ind ...

; R. W. Clough and his co-workers at

University of California Berkeley The University of California, Berkeley (UC Berkeley, Berkeley, Cal, or California), is a public land-grant research university in Berkeley, California, United States. Founded in 1868 and named after the Anglo-Irish philosopher George Berkeley ...

; O. C. Zienkiewicz and his co-workers Ernest Hinton, Bruce Irons, and others at

Swansea University Swansea University () is a public university, public research university located in Swansea, Wales, United Kingdom. It was chartered as University College of Swansea in 1920, as the fourth college of the University of Wales. In 1996, it chang ...

; Philippe G. Ciarlet at the University of Paris 6; and Richard Gallagher and his co-workers at

Cornell University Cornell University is a Private university, private Ivy League research university based in Ithaca, New York, United States. The university was co-founded by American philanthropist Ezra Cornell and historian and educator Andrew Dickson W ...

. During this period, additional impetus was provided by the available open-source FEM programs. NASA sponsored the original version of NASTRAN. University of California Berkeley made the finite element programs SAP IV and, later, OpenSees widely available. In Norway, the ship classification society Det Norske Veritas (now

DNV GL Det Norske Veritas (DNV), formerly DNV GL, is an international accredited registrar and classification society headquartered in Høvik, Norway. DNV provides services for several industries, including maritime, oil and gas, renewable energy, ...

) developed Sesam in 1969 for use in the analysis of ships. A rigorous mathematical basis for FEM was provided in 1973 with a publication by

Gilbert Strang William Gilbert Strang (born November 27, 1934) is an American mathematician known for his contributions to Finite elements, finite element theory, the calculus of variations, wavelet analysis and linear algebra. He has made many contributions ...

and George Fix. The method has since been generalized for the numerical modeling of physical systems in a wide variety of

disciplines, such as

electromagnetism In physics, electromagnetism is an interaction that occurs between particles with electric charge via electromagnetic fields. The electromagnetic force is one of the four fundamental forces of nature. It is the dominant force in the interacti ...

, and

fluid dynamics In physics, physical chemistry and engineering, fluid dynamics is a subdiscipline of fluid mechanics that describes the flow of fluids – liquids and gases. It has several subdisciplines, including (the study of air and other gases in motion ...

Technical discussion

The structure of finite element methods

A finite element method is characterized by a variational formulation, a discretization strategy, one or more solution algorithms, and post-processing procedures. Examples of the variational formulation are the

, the discontinuous Galerkin method, mixed methods, etc. A discretization strategy is understood to mean a clearly defined set of procedures that cover (a) the creation of finite element meshes, (b) the definition of basis function on reference elements (also called shape functions), and (c) the mapping of reference elements onto the elements of the mesh. Examples of discretization strategies are the h-version, p-version, hp-version, x-FEM, isogeometric analysis, etc. Each discretization strategy has certain advantages and disadvantages. A reasonable criterion in selecting a discretization strategy is to realize nearly optimal performance for the broadest set of mathematical models in a particular model class. Various numerical solution algorithms can be classified into two broad categories; direct and iterative solvers. These algorithms are designed to exploit the sparsity of matrices that depend on the variational formulation and discretization strategy choices. Post-processing procedures are designed to extract the data of interest from a finite element solution. To meet the requirements of solution verification, postprocessors need to provide for ''a posteriori'' error estimation in terms of the quantities of interest. When the errors of approximation are larger than what is considered acceptable, then the discretization has to be changed either by an automated adaptive process or by the action of the analyst. Some very efficient postprocessors provide for the realization of superconvergence.

Illustrative problems P1 and P2

The following two problems demonstrate the finite element method. P1 is a one-dimensional problem

\text : \begin
u''(x) = f(x) \text (0,1), \\
u(0) = u(1) = 0,
\end

where

f

is given,

u

is an unknown function of

x

, and

u''

is the second derivative of

u

with respect to

x

. P2 is a two-dimensional problem (

Dirichlet problem In mathematics, a Dirichlet problem asks for a function which solves a specified partial differential equation (PDE) in the interior of a given region that takes prescribed values on the boundary of the region. The Dirichlet problem can be solved ...

)

\text: \begin
u_(x,y)+u_(x,y)=f(x,y) & \text \Omega, \\
u=0 & \text \partial \Omega,
\end

where

\Omega

is a connected open region in the

(x,y)

plane whose boundary

\partial \Omega

is nice (e.g., a

smooth manifold In mathematics, a differentiable manifold (also differential manifold) is a type of manifold that is locally similar enough to a vector space to allow one to apply calculus. Any manifold can be described by a collection of charts (atlas). One may ...

or a

polygon In geometry, a polygon () is a plane figure made up of line segments connected to form a closed polygonal chain. The segments of a closed polygonal chain are called its '' edges'' or ''sides''. The points where two edges meet are the polygon ...

), and

u_

and

u_

denote the second derivatives with respect to

x

and

y

, respectively. The problem P1 can be solved directly by computing

antiderivative In calculus, an antiderivative, inverse derivative, primitive function, primitive integral or indefinite integral of a continuous function is a differentiable function whose derivative is equal to the original function . This can be stated ...

s. However, this method of solving the

(BVP) works only when there is one spatial dimension. It does not generalize to higher-dimensional problems or problems like

u+V''=f

. For this reason, we will develop the finite element method for P1 and outline its generalization to P2. Our explanation will proceed in two steps, which mirror two essential steps one must take to solve a boundary value problem (BVP) using the FEM. * In the first step, one rephrases the original BVP in its weak form. Little to no computation is usually required for this step. The transformation is done by hand on paper. * The second step is discretization, where the weak form is discretized in a finite-dimensional space. After this second step, we have concrete formulae for a large but finite-dimensional linear problem whose solution will approximately solve the original BVP. This finite-dimensional problem is then implemented on a

computer A computer is a machine that can be Computer programming, programmed to automatically Execution (computing), carry out sequences of arithmetic or logical operations (''computation''). Modern digital electronic computers can perform generic set ...

Weak formulation

The first step is to convert P1 and P2 into their equivalent

weak formulation Weak formulations are important tools for the analysis of mathematical equations that permit the transfer of concepts of linear algebra to solve problems in other fields such as partial differential equations. In a weak formulation, equations or co ...

The weak form of P1

u

solves P1, then for any smooth function

v

that satisfies the displacement boundary conditions, i.e.

v=0

x=0

and

x=1

, we have Conversely, if

u

with

u(0) = u(1) = 0

satisfies (1) for every smooth function

v(x)

then one may show that this

u

will solve P1. The proof is easier for twice continuously differentiable

u

(

mean value theorem In mathematics, the mean value theorem (or Lagrange's mean value theorem) states, roughly, that for a given planar arc (geometry), arc between two endpoints, there is at least one point at which the tangent to the arc is parallel to the secant lin ...

) but may be proved in a distributional sense as well. We define a new operator or map

\phi(u,v)

by using

integration by parts In calculus, and more generally in mathematical analysis, integration by parts or partial integration is a process that finds the integral of a product of functions in terms of the integral of the product of their derivative and antiderivati ...

on the right-hand-side of (1): where we have used the assumption that

v(0) = v(1) = 0

The weak form of P2

If we integrate by parts using a form of

Green's identities In mathematics, Green's identities are a set of three identities in vector calculus relating the bulk with the boundary of a region on which differential operators act. They are named after the mathematician George Green, who discovered Green's ...

, we see that if

u

solves P2, then we may define

\phi(u,v)

for any

v

\int_\Omega fv\,ds = -\int_\Omega \nabla u \cdot \nabla v \, ds \equiv -\phi(u,v),

where

\nabla

denotes the

gradient In vector calculus, the gradient of a scalar-valued differentiable function f of several variables is the vector field (or vector-valued function) \nabla f whose value at a point p gives the direction and the rate of fastest increase. The g ...

and

\cdot

denotes the

dot product In mathematics, the dot product or scalar productThe term ''scalar product'' means literally "product with a Scalar (mathematics), scalar as a result". It is also used for other symmetric bilinear forms, for example in a pseudo-Euclidean space. N ...

in the two-dimensional plane. Once more

\,\!\phi

can be turned into an inner product on a suitable space

H_0^1(\Omega)

of once differentiable functions of

\Omega

that are zero on

\partial \Omega

. We have also assumed that

v \in H_0^1(\Omega)

(see

Sobolev space In mathematics, a Sobolev space is a vector space of functions equipped with a norm that is a combination of ''Lp''-norms of the function together with its derivatives up to a given order. The derivatives are understood in a suitable weak sense ...

s). The existence and uniqueness of the solution can also be shown.

A proof outline of the existence and uniqueness of the solution

We can loosely think of

H_0^1(0,1)

to be the

absolutely continuous In calculus and real analysis, absolute continuity is a smoothness property of functions that is stronger than continuity and uniform continuity. The notion of absolute continuity allows one to obtain generalizations of the relationship betwe ...

functions of

(0,1)

that are

0

x=0

and

x=1

(see

Sobolev spaces In mathematics, a Sobolev space is a vector space of functions equipped with a normed space, norm that is a combination of Lp norm, ''Lp''-norms of the function together with its derivatives up to a given order. The derivatives are understood in a ...

). Such functions are (weakly) once differentiable, and it turns out that the symmetric

bilinear map In mathematics, a bilinear map is a function combining elements of two vector spaces to yield an element of a third vector space, and is linear in each of its arguments. Matrix multiplication is an example. A bilinear map can also be defined for ...

\!\,\phi

then defines an

which turns

H_0^1(0,1)

into a

Hilbert space In mathematics, a Hilbert space is a real number, real or complex number, complex inner product space that is also a complete metric space with respect to the metric induced by the inner product. It generalizes the notion of Euclidean space. The ...

(a detailed proof is nontrivial). On the other hand, the left-hand-side

\int_0^1 f(x)v(x)dx

is also an inner product, this time on the

Lp space In mathematics, the spaces are function spaces defined using a natural generalization of the -norm for finite-dimensional vector spaces. They are sometimes called Lebesgue spaces, named after Henri Lebesgue , although according to the Bourba ...

L^2(0,1)

. An application of the

Riesz representation theorem The Riesz representation theorem, sometimes called the Riesz–Fréchet representation theorem after Frigyes Riesz and Maurice René Fréchet, establishes an important connection between a Hilbert space and its continuous dual space. If the un ...

for Hilbert spaces shows that there is a unique

u

solving (2) and, therefore, P1. This solution is a-priori only a member of

H_0^1(0,1)

, but using elliptic regularity, will be smooth if

f

is.

Discretization

P1 and P2 are ready to be discretized, which leads to a common sub-problem (3). The basic idea is to replace the infinite-dimensional linear problem: :Find

u \in  H_0^1

such that :

\forall v \in H_0^1, \; -\phi(u,v)=\int fv

with a finite-dimensional version: where

V

is a finite-dimensional subspace of

H_0^1

. There are many possible choices for

V

(one possibility leads to the

spectral method Spectral methods are a class of techniques used in applied mathematics and scientific computing to numerically solve certain differential equations. The idea is to write the solution of the differential equation as a sum of certain " basis funct ...

). However, we take

V

as a space of piecewise polynomial functions for the finite element method.

For problem P1

We take the interval

(0,1)

, choose

n

values of

x

with

0=x_0 < x_1 < \cdots < x_n < x_=1

and we define

V

by:

V = \

where we define

x_0=0

and

x_=1

. Observe that functions in

V

are not differentiable according to the elementary definition of calculus. Indeed, if

v \in V

then the derivative is typically not defined at any

x = x_k

k = 1,\ldots,n

. However, the derivative exists at every other value of

x

, and one can use this derivative for

For problem P2

We need

V

to be a set of functions of

\Omega

. In the figure on the right, we have illustrated a

triangulation In trigonometry and geometry, triangulation is the process of determining the location of a point by forming triangles to the point from known points. Applications In surveying Specifically in surveying, triangulation involves only angle m ...

of a 15-sided

al region

\Omega

in the plane (below), and a

piecewise linear function In mathematics, a piecewise linear or segmented function is a real-valued function of a real variable, whose graph is composed of straight-line segments. Definition A piecewise linear function is a function defined on a (possibly unbounded) ...

(above, in color) of this polygon which is linear on each triangle of the triangulation; the space

V

would consist of functions that are linear on each triangle of the chosen triangulation. One hopes that as the underlying triangular mesh becomes finer and finer, the solution of the discrete problem (3) will, in some sense, converge to the solution of the original boundary value problem P2. To measure this mesh fineness, the triangulation is indexed by a real-valued parameter

h > 0

which one takes to be very small. This parameter will be related to the largest or average triangle size in the triangulation. As we refine the triangulation, the space of piecewise linear functions

V

must also change with

h

. For this reason, one often reads

V_h

instead of

V

in the literature. Since we do not perform such an analysis, we will not use this notation.

Choosing a basis

To complete the discretization, we must select a basis of

V

. In the one-dimensional case, for each control point

x_k

we will choose the piecewise linear function

v_k

V

whose value is

1

x_k

and zero at every

x_j,\;j \neq k

, i.e.,

\\ 0 & \text, \end

for

k = 1,\dots,n

; this basis is a shifted and scaled tent function. For the two-dimensional case, we choose again one basis function

v_k

per vertex

x_k

of the triangulation of the planar region

\Omega

. The function

v_k

is the unique function of

V

whose value is

1

x_k

and zero at every

x_j,\;j \neq k

. Depending on the author, the word "element" in the "finite element method" refers to the domain's triangles, the piecewise linear basis function, or both. So, for instance, an author interested in curved domains might replace the triangles with curved primitives and so might describe the elements as being curvilinear. On the other hand, some authors replace "piecewise linear" with "piecewise quadratic" or even "piecewise polynomial". The author might then say "higher order element" instead of "higher degree polynomial". The finite element method is not restricted to triangles (tetrahedra in 3-d or higher-order simplexes in multidimensional spaces). Still, it can be defined on quadrilateral subdomains (hexahedra, prisms, or pyramids in 3-d, and so on). Higher-order shapes (curvilinear elements) can be defined with polynomial and even non-polynomial shapes (e.g., ellipse or circle). Examples of methods that use higher degree piecewise polynomial basis functions are the

hp-FEM hp-FEM is a generalization of the finite element method (FEM) for solving partial differential equations numerical analysis, numerically based on Piecewise, piecewise-polynomial approximations. hp-FEM originates from the discovery by Barna Szabó, ...

and spectral FEM. More advanced implementations (adaptive finite element methods) utilize a method to assess the quality of the results (based on error estimation theory) and modify the mesh during the solution aiming to achieve an approximate solution within some bounds from the exact solution of the continuum problem. Mesh adaptivity may utilize various techniques; the most popular are: * moving nodes (r-adaptivity) * refining (and unrefined) elements (h-adaptivity) * changing order of base functions (p-adaptivity) * combinations of the above ( hp-adaptivity).

Small support of the basis

The primary advantage of this choice of basis is that the inner products

\langle v_j,v_k \rangle = \int_0^1 v_j v_k\,dx

and

\phi(v_j,v_k)=\int_0^1 v_j' v_k'\,dx

will be zero for almost all

j,k

. (The matrix containing

\langle v_j, v_k \rangle

in the

(j,k)

location is known as the

Gramian matrix In linear algebra, the Gram matrix (or Gramian matrix, Gramian) of a set of vectors v_1,\dots, v_n in an inner product space is the Hermitian matrix of inner products, whose entries are given by the inner product G_ = \left\langle v_i, v_j \right\ ...

.) In the one dimensional case, the support of

v_k

is the interval

_,x_/math>. Hence, the integrands of \langle v_j, v_k \rangle and \phi(v_j,v_k) are identically zero whenever, j-k, >1 .

Similarly, in the planar case, if x_j and x_k do not share an edge of the triangulation, then the integrals \int_ v_j v_k\,ds and \int_ \nabla v_j \cdot \nabla v_k\,ds are both zero.

Matrix form of the problem

If we write

u(x) = \sum_^n u_k v_k(x)

and

f(x) = \sum_^n f_k v_k(x)

then problem (3), taking

v(x) = v_j(x)

for

j = 1, \dots, n

, becomes If we denote by

\mathbf

and

\mathbf

the column vectors

(u_1,\dots,u_n)^t

and

(f_1,\dots,f_n)^t

, and if we let

L = (L_)

and

M = (M_)

be matrices whose entries are

L_ = \phi (v_i,v_j)

and

M_ = \int v_i v_j dx

then we may rephrase (4) as It is not necessary to assume

f(x) = \sum_^n f_k v_k(x)

. For a general function

f(x)

, problem (3) with

v(x) = v_j(x)

for

j = 1, \dots, n

becomes actually simpler, since no matrix

M

is used, where

\mathbf = (b_1, \dots, b_n)^t

and

b_j = \int f v_j dx

for

j = 1, \dots, n

. As we have discussed before, most of the entries of

L

and

M

are zero because the basis functions

v_k

have small support. So we now have to solve a linear system in the unknown

\mathbf

where most of the entries of the matrix

L

, which we need to invert, are zero. Such matrices are known as

sparse matrices In numerical analysis and scientific computing, a sparse matrix or sparse array is a matrix (mathematics), matrix in which most of the elements are zero. There is no strict definition regarding the proportion of zero-value elements for a matrix ...

, and there are efficient solvers for such problems (much more efficient than actually inverting the matrix.) In addition,

L

is symmetric and positive definite, so a technique such as the

conjugate gradient method In mathematics, the conjugate gradient method is an algorithm for the numerical solution of particular systems of linear equations, namely those whose matrix is positive-semidefinite. The conjugate gradient method is often implemented as an it ...

is favored. For problems that are not too large, sparse

LU decomposition In numerical analysis and linear algebra, lower–upper (LU) decomposition or factorization factors a matrix as the product of a lower triangular matrix and an upper triangular matrix (see matrix multiplication and matrix decomposition). The produ ...

s and

Cholesky decomposition In linear algebra, the Cholesky decomposition or Cholesky factorization (pronounced ) is a decomposition of a Hermitian, positive-definite matrix into the product of a lower triangular matrix and its conjugate transpose, which is useful for eff ...

s still work well. For instance,

MATLAB MATLAB (an abbreviation of "MATrix LABoratory") is a proprietary multi-paradigm programming language and numeric computing environment developed by MathWorks. MATLAB allows matrix manipulations, plotting of functions and data, implementat ...

's backslash operator (which uses sparse LU, sparse Cholesky, and other factorization methods) can be sufficient for meshes with a hundred thousand vertices. The matrix

L

is usually referred to as the stiffness matrix, while the matrix

M

is dubbed the mass matrix.

General form of the finite element method

In general, the finite element method is characterized by the following process. *One chooses a grid for

\Omega

. In the preceding treatment, the grid consisted of triangles, but one can also use squares or curvilinear polygons. *Then, one chooses basis functions. We used piecewise linear basis functions in our discussion, but it is common to use piecewise polynomial basis functions. Separate consideration is the smoothness of the basis functions. For second-order

elliptic boundary value problem In mathematics, an ellipse is a plane curve surrounding two focal points, such that for all points on the curve, the sum of the two distances to the focal points is a constant. It generalizes a circle, which is the special type of ellipse in ...

s, piecewise polynomial basis function that is merely continuous suffice (i.e., the derivatives are discontinuous.) For higher-order partial differential equations, one must use smoother basis functions. For instance, for a fourth-order problem such as

u_ + u_ = f

, one may use piecewise quadratic basis functions that are

C^1

. Another consideration is the relation of the finite-dimensional space

V

to its infinite-dimensional counterpart in the examples above

H_0^1

. A conforming element method is one in which space

V

is a subspace of the element space for the continuous problem. The example above is such a method. If this condition is not satisfied, we obtain a nonconforming element method, an example of which is the space of piecewise linear functions over the mesh, which are continuous at each edge midpoint. Since these functions are generally discontinuous along the edges, this finite-dimensional space is not a subspace of the original

H_0^1

. Typically, one has an algorithm for subdividing a given mesh. If the primary method for increasing precision is to subdivide the mesh, one has an ''h''-method (''h'' is customarily the diameter of the largest element in the mesh.) In this manner, if one shows that the error with a grid

h

is bounded above by

Ch^p

, for some

C < \infty

and

p > 0

, then one has an order ''p'' method. Under specific hypotheses (for instance, if the domain is convex), a piecewise polynomial of order

d

method will have an error of order

p = d+1

. If instead of making ''h'' smaller, one increases the degree of the polynomials used in the basis function, one has a ''p''-method. If one combines these two refinement types, one obtains an ''hp''-method (

). In the hp-FEM, the polynomial degrees can vary from element to element. High-order methods with large uniform ''p'' are called spectral finite element methods ( SFEM). These are not to be confused with

s. For vector partial differential equations, the basis functions may take values in

\mathbb^n

Various types of finite element methods

AEM

The Applied Element Method or AEM combines features of both FEM and Discrete element method or (DEM).

A-FEM

Yang and Lui introduced the Augmented-Finite Element Method, whose goal was to model the weak and strong discontinuities without needing extra DoFs, as PuM stated.

CutFEM

The Cut Finite Element Approach was developed in 2014. The approach is "to make the discretization as independent as possible of the geometric description and minimize the complexity of mesh generation, while retaining the accuracy and robustness of a standard finite element method."

Generalized finite element method

The generalized finite element method (GFEM) uses local spaces consisting of functions, not necessarily polynomials, that reflect the available information on the unknown solution and thus ensure good local approximation. Then a

partition of unity In mathematics, a partition of unity on a topological space is a Set (mathematics), set of continuous function (topology), continuous functions from to the unit interval ,1such that for every point x\in X: * there is a neighbourhood (mathem ...

is used to “bond” these spaces together to form the approximating subspace. The effectiveness of GFEM has been shown when applied to problems with domains having complicated boundaries, problems with micro-scales, and problems with boundary layers.

Mixed finite element method

The mixed finite element method is a type of finite element method in which extra independent variables are introduced as nodal variables during the discretization of a partial differential equation problem.

Variable – polynomial

The

combines adaptively elements with variable size ''h'' and polynomial degree ''p'' to achieve exceptionally fast, exponential convergence rates.

hpk-FEM

The hpk-FEM combines adaptively elements with variable size ''h'', polynomial degree of the local approximations ''p'', and global differentiability of the local approximations (''k''-1) to achieve the best convergence rates.

XFEM

The extended finite element method (XFEM) is a numerical technique based on the generalized finite element method (GFEM) and the partition of unity method (PUM). It extends the classical finite element method by enriching the solution space for solutions to differential equations with discontinuous functions. Extended finite element methods enrich the approximation space to naturally reproduce the challenging feature associated with the problem of interest: the discontinuity, singularity, boundary layer, etc. It was shown that for some problems, such an embedding of the problem's feature into the approximation space can significantly improve convergence rates and accuracy. Moreover, treating problems with discontinuities with XFEMs suppresses the need to mesh and re-mesh the discontinuity surfaces, thus alleviating the computational costs and projection errors associated with conventional finite element methods at the cost of restricting the discontinuities to mesh edges. Several research codes implement this technique to various degrees: # GetFEM++ # xfem++ # openxfem++ XFEM has also been implemented in codes like Altair Radios, ASTER, Morfeo, and Abaqus. It is increasingly being adopted by other commercial finite element software, with a few plugins and actual core implementations available (ANSYS, SAMCEF, OOFELIE, etc.).

Scaled boundary finite element method (SBFEM)

The introduction of the scaled boundary finite element method (SBFEM) came from Song and Wolf (1997). The SBFEM has been one of the most profitable contributions in the area of numerical analysis of fracture mechanics problems. It is a semi-analytical fundamental-solutionless method combining the advantages of finite element formulations and procedures and boundary element discretization. However, unlike the boundary element method, no fundamental differential solution is required.

S-FEM

The S-FEM, Smoothed Finite Element Methods, is a particular class of numerical simulation algorithms for the simulation of physical phenomena. It was developed by combining mesh-free methods with the finite element method.

Spectral element method

Spectral element methods combine the geometric flexibility of finite elements and the acute accuracy of spectral methods. Spectral methods are the approximate solution of weak-form partial equations based on high-order Lagrangian interpolants and used only with certain quadrature rules.

Meshfree methods

Discontinuous Galerkin methods

Finite element limit analysis

Stretched grid method

Loubignac iteration

Loubignac iteration is an iterative method in finite element methods.

Crystal plasticity finite element method (CPFEM)

The crystal plasticity finite element method (CPFEM) is an advanced numerical tool developed by Franz Roters. Metals can be regarded as crystal aggregates, which behave anisotropy under deformation, such as abnormal stress and strain localization. CPFEM, based on the slip (shear strain rate), can calculate dislocation, crystal orientation, and other texture information to consider crystal anisotropy during the routine. It has been applied in the numerical study of material deformation, surface roughness, fractures, etc.

Virtual element method (VEM)

The virtual element method (VEM), introduced by Beirão da Veiga et al. (2013) as an extension of

mimetic Mimesis (; , ''mīmēsis'') is a term used in literary criticism and philosophy that carries a wide range of meanings, including ''imitatio'', imitation, Similarity (philosophy), similarity, receptivity, representation (arts), representation, m ...

finite difference A finite difference is a mathematical expression of the form . Finite differences (or the associated difference quotients) are often used as approximations of derivatives, such as in numerical differentiation. The difference operator, commonly d ...

(MFD) methods, is a generalization of the standard finite element method for arbitrary element geometries. This allows admission of general polygons (or

polyhedra In geometry, a polyhedron (: polyhedra or polyhedrons; ) is a three-dimensional figure with flat polygonal faces, straight edges and sharp corners or vertices. The term "polyhedron" may refer either to a solid figure or to its boundary su ...

in 3D) that are highly irregular and non-convex in shape. The name ''virtual'' derives from the fact that knowledge of the local shape function basis is not required and is, in fact, never explicitly calculated.

Link with the gradient discretization method

Some types of finite element methods (conforming, nonconforming, mixed finite element methods) are particular cases of the gradient discretization method (GDM). Hence the convergence properties of the GDM, which are established for a series of problems (linear and nonlinear elliptic problems, linear, nonlinear, and degenerate parabolic problems), hold as well for these particular FEMs.

Comparison to the finite difference method

The

(FDM) is an alternative way of approximating solutions of PDEs. The differences between FEM and FDM are: * The most attractive feature of the FEM is its ability to handle complicated geometries (and boundaries) with relative ease. While FDM in its basic form is restricted to handle rectangular shapes and simple alterations thereof, the handling of geometries in FEM is theoretically straightforward. * FDM is not usually used for irregular CAD geometries but more often for rectangular or block-shaped models. * FEM generally allows for more flexible mesh adaptivity than FDM. * The most attractive feature of finite differences is that it is straightforward to implement. * One could consider the FDM a particular case of the FEM approach in several ways. E.g., first-order FEM is identical to FDM for

Poisson's equation Poisson's equation is an elliptic partial differential equation of broad utility in theoretical physics. For example, the solution to Poisson's equation is the potential field caused by a given electric charge or mass density distribution; with t ...

if the problem is discretized by a regular rectangular mesh with each rectangle divided into two triangles. * There are reasons to consider the mathematical foundation of the finite element approximation more sound, for instance, because the quality of the approximation between grid points is poor in FDM. * The quality of a FEM approximation is often higher than in the corresponding FDM approach, but this is highly problem-dependent, and several examples to the contrary can be provided. Generally, FEM is the method of choice in all types of analysis in structural mechanics (i.e., solving for deformation and stresses in solid bodies or dynamics of structures). In contrast,

computational fluid dynamics Computational fluid dynamics (CFD) is a branch of fluid mechanics that uses numerical analysis and data structures to analyze and solve problems that involve fluid dynamics, fluid flows. Computers are used to perform the calculations required ...

(CFD) tend to use FDM or other methods like

finite volume method The finite volume method (FVM) is a method for representing and evaluating partial differential equations in the form of algebraic equations. In the finite volume method, volume integrals in a partial differential equation that contain a divergen ...

(FVM). CFD problems usually require discretization of the problem into a large number of cells/gridpoints (millions and more). Therefore the cost of the solution favors simpler, lower-order approximation within each cell. This is especially true for 'external flow' problems, like airflow around the car, airplane, or weather simulation.

Finite element and fast fourier transform (FFT) methods

Another method used for approximating solutions to a partial differential equation is the

Fast Fourier Transform A fast Fourier transform (FFT) is an algorithm that computes the discrete Fourier transform (DFT) of a sequence, or its inverse (IDFT). A Fourier transform converts a signal from its original domain (often time or space) to a representation in ...

(FFT), where the solution is approximated by a fourier series computed using the FFT. For approximating the mechanical response of materials under stress, FFT is often much faster, but FEM may be more accurate. One example of the respective advantages of the two methods is in simulation of

rolling Rolling is a Motion (physics)#Types of motion, type of motion that combines rotation (commonly, of an Axial symmetry, axially symmetric object) and Translation (geometry), translation of that object with respect to a surface (either one or the ot ...

a sheet of

aluminum Aluminium (or aluminum in North American English) is a chemical element; it has chemical symbol, symbol Al and atomic number 13. It has a density lower than that of other common metals, about one-third that of steel. Aluminium has ...

(an FCC metal), and

drawing Drawing is a Visual arts, visual art that uses an instrument to mark paper or another two-dimensional surface, or a digital representation of such. Traditionally, the instruments used to make a drawing include pencils, crayons, and ink pens, some ...

a wire of

tungsten Tungsten (also called wolfram) is a chemical element; it has symbol W and atomic number 74. It is a metal found naturally on Earth almost exclusively in compounds with other elements. It was identified as a distinct element in 1781 and first ...

(a BCC metal). This simulation did not have a sophisticated shape update algorithm for the FFT method. In both cases, the FFT method was more than 10 times as fast as FEM, but in the wire drawing simulation, where there were large deformations in

grains A grain is a small, hard, dry fruit ( caryopsis) – with or without an attached hull layer – harvested for human or animal consumption. A grain crop is a grain-producing plant. The two main types of commercial grain crops are cereals and le ...

, the FEM method was much more accurate. In the sheet rolling simulation, the results of the two methods were similar. FFT has a larger speed advantage in cases where the boundary conditions are given in the materials strain, and loses some of its efficiency in cases where the stress is used to apply the boundary conditions, as more iterations of the method are needed. The FE and FFT methods can also be combined in a

voxel In computing, a voxel is a representation of a value on a three-dimensional regular grid, akin to the two-dimensional pixel. Voxels are frequently used in the Data visualization, visualization and analysis of medical imaging, medical and scient ...

based method (2) to simulate deformation in materials, where the FE method is used for the macroscale stress and deformation, and the FFT method is used on the microscale to deal with the effects of microscale on the mechanical response. Unlike FEM, FFT methods’ similarities to image processing methods means that an actual image of the microstructure from a microscope can be input to the solver to get a more accurate stress response. Using a real image with FFT avoids meshing the microstructure, which would be required if using FEM simulation of the microstructure, and might be difficult. Because fourier approximations are inherently periodic, FFT can only be used in cases of periodic microstructure, but this is common in real materials. FFT can also be combined with FEM methods by using fourier components as the variational basis for approximating the fields inside an element, which can take advantage of the speed of FFT based solvers.

Application

Various specializations under the umbrella of the mechanical engineering discipline (such as aeronautical, biomechanical, and automotive industries) commonly use integrated FEM in the design and development of their products. Several modern FEM packages include specific components such as thermal, electromagnetic, fluid, and structural working environments. In a structural simulation, FEM helps tremendously in producing stiffness and strength visualizations and minimizing weight, materials, and costs. Human knee joint FE model

This powerful design tool has significantly improved both the standard of engineering designs and the design process methodology in many industrial applications.Hastings, J. K., Juds, M. A., Brauer, J. R., ''Accuracy and Economy of Finite Element Magnetic Analysis'', 33rd Annual National Relay Conference, April 1985. The introduction of FEM has substantially decreased the time to take products from concept to the production line. Testing and development have been accelerated primarily through improved initial prototype designs using FEM. In summary, benefits of FEM include increased accuracy, enhanced design and better insight into critical design parameters, virtual prototyping, fewer hardware prototypes, a faster and less expensive design cycle, increased productivity, and increased revenue. In the 1990s FEM was proposed for use in stochastic modeling for numerically solving probability models and later for reliability assessment. FEM is widely applied for approximating differential equations that describe physical systems. This method is very popular in the community of

Computational fluid dynamics Computational fluid dynamics (CFD) is a branch of fluid mechanics that uses numerical analysis and data structures to analyze and solve problems that involve fluid dynamics, fluid flows. Computers are used to perform the calculations required ...

, and there are many applications for solving

Navier–Stokes equations The Navier–Stokes equations ( ) are partial differential equations which describe the motion of viscous fluid substances. They were named after French engineer and physicist Claude-Louis Navier and the Irish physicist and mathematician Georg ...

with FEM. Recently, the application of FEM has been increasing in the researches of computational plasma. Promising numerical results using FEM for

Magnetohydrodynamics In physics and engineering, magnetohydrodynamics (MHD; also called magneto-fluid dynamics or hydromagnetics) is a model of electrically conducting fluids that treats all interpenetrating particle species together as a single Continuum ...

Vlasov equation In plasma physics, the Vlasov equation is a differential equation describing time evolution of the distribution function of collisionless plasma consisting of charged particles with long-range interaction, such as the Coulomb interaction. The e ...

, and

Schrödinger equation The Schrödinger equation is a partial differential equation that governs the wave function of a non-relativistic quantum-mechanical system. Its discovery was a significant landmark in the development of quantum mechanics. It is named after E ...

have been proposed.

Basic concepts

History

Technical discussion

The structure of finite element methods

Illustrative problems P1 and P2

Weak formulation

The weak form of P1

The weak form of P2

A proof outline of the existence and uniqueness of the solution

Discretization

For problem P1

For problem P2

Choosing a basis

Small support of the basis

Matrix form of the problem

General form of the finite element method

Various types of finite element methods

AEM

A-FEM

CutFEM

Generalized finite element method

Mixed finite element method

Variable – polynomial

hpk-FEM

XFEM

Scaled boundary finite element method (SBFEM)

S-FEM

Spectral element method

Meshfree methods

Discontinuous Galerkin methods

Finite element limit analysis

Stretched grid method

Loubignac iteration

Crystal plasticity finite element method (CPFEM)

Virtual element method (VEM)

Link with the gradient discretization method

Comparison to the finite difference method

Finite element and fast fourier transform (FFT) methods

Application

See also

References

Further reading