Ritz Method
   HOME

TheInfoList



OR:

The Ritz method is a direct method to find an approximate solution for
boundary value problem In mathematics, in the field of differential equations, a boundary value problem is a differential equation together with a set of additional constraints, called the boundary conditions. A solution to a boundary value problem is a solution to t ...
s. The method is named after
Walther Ritz Walther Heinrich Wilhelm Ritz (22 February 1878 – 7 July 1909) was a Swiss theoretical physicist. He is most famous for his work with Johannes Rydberg on the Rydberg–Ritz combination principle. Ritz is also known for the variational method na ...
, and is also commonly called the
Rayleigh–Ritz method The Rayleigh–Ritz method is a direct numerical method of approximating eigenvalues, originated in the context of solving physical boundary value problems and named after Lord Rayleigh and Walther Ritz. The name Rayleigh–Ritz is being debated v ...
and the Ritz-Galerkin method. In
quantum mechanics Quantum mechanics is a fundamental theory in physics that provides a description of the physical properties of nature at the scale of atoms and subatomic particles. It is the foundation of all quantum physics including quantum chemistry, ...
, a system of particles can be described in terms of an "energy functional" or
Hamiltonian Hamiltonian may refer to: * Hamiltonian mechanics, a function that represents the total energy of a system * Hamiltonian (quantum mechanics), an operator corresponding to the total energy of that system ** Dyall Hamiltonian, a modified Hamiltonian ...
, which will measure the energy of any proposed configuration of said particles. It turns out that certain privileged configurations are more likely than other configurations, and this has to do with the
eigenanalysis In linear algebra, an eigenvector () or characteristic vector of a linear transformation is a nonzero vector that changes at most by a scalar factor when that linear transformation is applied to it. The corresponding eigenvalue, often denoted ...
("analysis of characteristics") of this
Hamiltonian system A Hamiltonian system is a dynamical system governed by Hamilton's equations. In physics, this dynamical system describes the evolution of a physical system such as a planetary system or an electron in an electromagnetic field. These systems can b ...
. Because it is often impossible to analyze all of the infinite configurations of particles to find the one with the least amount of energy, it becomes essential to be able to approximate this Hamiltonian in some way for the purpose of numerical computations. The Ritz method can be used to achieve this goal. In the language of mathematics, it is exactly the
finite element method The finite element method (FEM) is a popular method for numerically solving differential equations arising in engineering and mathematical modeling. Typical problem areas of interest include the traditional fields of structural analysis, heat ...
used to compute the
eigenvector In linear algebra, an eigenvector () or characteristic vector of a linear transformation is a nonzero vector that changes at most by a scalar factor when that linear transformation is applied to it. The corresponding eigenvalue, often denoted b ...
s and
eigenvalue In linear algebra, an eigenvector () or characteristic vector of a linear transformation is a nonzero vector that changes at most by a scalar factor when that linear transformation is applied to it. The corresponding eigenvalue, often denoted b ...
s of a Hamiltonian system.


Definitions

As with other
variational method The calculus of variations (or Variational Calculus) is a field of mathematical analysis that uses variations, which are small changes in functions and functionals, to find maxima and minima of functionals: mappings from a set of functions t ...
s, a trial wave function, \Psi, is tested on the system. This trial function is selected to meet boundary conditions (and any other physical constraints). The exact function is not known; the trial function contains one or more adjustable parameters, which are varied to find a lowest energy configuration. It can be shown that the ground state energy, E_0, satisfies an inequality: E_0 \le \frac. That is, the ground-state energy is less than this value. The trial wave-function will always give an expectation value larger than or equal to the ground-energy. If the trial wave function is known to be
orthogonal In mathematics, orthogonality is the generalization of the geometric notion of ''perpendicularity''. By extension, orthogonality is also used to refer to the separation of specific features of a system. The term also has specialized meanings in ...
to the ground state, then it will provide a boundary for the energy of some excited state. The Ritz ansatz function is a linear combination of ''N'' known basis functions \left\lbrace\Psi_i\right\rbrace, parametrized by unknown coefficients: \Psi = \sum_^N c_i \Psi_i. With a known Hamiltonian, we can write its expected value as \varepsilon = \frac = \frac \equiv \frac. The basis functions are usually not orthogonal, so that the
overlap matrix In chemical bonds, an orbital overlap is the concentration of orbitals on adjacent atoms in the same regions of space. Orbital overlap can lead to bond formation. Linus Pauling explained the importance of orbital overlap in the molecular bond an ...
''S'' has nonzero nondiagonal elements. Either \left\lbrace c_i \right\rbrace or \left\lbrace c_i^* \right\rbrace (the conjugation of the first) can be used to minimize the expectation value. For instance, by making the partial derivatives of \varepsilon over \left\lbrace c_i^* \right\rbrace zero, the following equality is obtained for every ''k'' = 1, 2, ..., ''N'': \frac = \frac = 0, which leads to a set of ''N'' secular equations: \sum_^N c_j \left( H_ - \varepsilon S_ \right) = 0 \quad \text \quad k = 1,2,\dots,N. In the above equations, energy \varepsilon and the coefficients \left\lbrace c_j \right\rbrace are unknown. With respect to ''c'', this is a homogeneous set of linear equations, which has a solution when the
determinant In mathematics, the determinant is a scalar value that is a function of the entries of a square matrix. It characterizes some properties of the matrix and the linear map represented by the matrix. In particular, the determinant is nonzero if and ...
of the coefficients to these unknowns is zero: \det \left( H - \varepsilon S \right) = 0, which in turn is true only for ''N'' values of \varepsilon. Furthermore, since the Hamiltonian is a
hermitian operator In mathematics, a self-adjoint operator on an infinite-dimensional complex vector space ''V'' with inner product \langle\cdot,\cdot\rangle (equivalently, a Hermitian operator in the finite-dimensional case) is a linear map ''A'' (from ''V'' to itse ...
, the ''H'' matrix is also
hermitian {{Short description, none Numerous things are named after the French mathematician Charles Hermite (1822–1901): Hermite * Cubic Hermite spline, a type of third-degree spline * Gauss–Hermite quadrature, an extension of Gaussian quadrature meth ...
and the values of \varepsilon_i will be real. The lowest value among \varepsilon_i (i=1,2,..,N), \varepsilon_0, will be the best approximation to the ground state for the basis functions used. The remaining ''N-1'' energies are estimates of excited state energies. An approximation for the wave function of state ''i'' can be obtained by finding the coefficients \left\lbrace c_j \right\rbrace from the corresponding secular equation.


Applications in mechanical engineering

The Rayleigh–Ritz method is often used in
mechanical engineering Mechanical engineering is the study of physical machines that may involve force and movement. It is an engineering branch that combines engineering physics and mathematics principles with materials science, to design, analyze, manufacture, and ...
for finding the approximate real
resonant frequencies Resonance describes the phenomenon of increased amplitude that occurs when the frequency of an applied periodic force (or a Fourier component of it) is equal or close to a natural frequency of the system on which it acts. When an oscillati ...
of multi
degree of freedom Degrees of freedom (often abbreviated df or DOF) refers to the number of independent variables or parameters of a thermodynamic system. In various scientific fields, the word "freedom" is used to describe the limits to which physical movement or ...
systems, such as
spring mass system In classical mechanics, a harmonic oscillator is a system that, when displaced from its equilibrium position, experiences a restoring force ''F'' proportional to the displacement ''x'': \vec F = -k \vec x, where ''k'' is a positive constan ...
s or
flywheel A flywheel is a mechanical device which uses the conservation of angular momentum to store rotational energy; a form of kinetic energy proportional to the product of its moment of inertia and the square of its rotational speed. In particular, ass ...
s on a shaft with varying
cross section Cross section may refer to: * Cross section (geometry) ** Cross-sectional views in architecture & engineering 3D *Cross section (geology) * Cross section (electronics) * Radar cross section, measure of detectability * Cross section (physics) **Abs ...
. It is an extension of Rayleigh's method. It can also be used for finding buckling loads and post-buckling behaviour for columns. Consider the case whereby we want to find the resonant frequency of oscillation of a system. First, write the oscillation in the form, y(x,t) = Y(x) \cos\omega t with an unknown mode shape Y(x). Next, find the total energy of the system, consisting of a kinetic energy term and a potential energy term. The kinetic energy term involves the square of the time derivative of y(x,t) and thus gains a factor of \omega ^2. Thus, we can calculate the total energy of the system and express it in the following form: E = T + V \equiv A
(x) An emoticon (, , rarely , ), short for "emotion icon", also known simply as an emote, is a pictorial representation of a facial expression using characters—usually punctuation marks, numbers, and letters—to express a person's feelings, ...
\omega^2\sin^2 \omega t + B
(x) An emoticon (, , rarely , ), short for "emotion icon", also known simply as an emote, is a pictorial representation of a facial expression using characters—usually punctuation marks, numbers, and letters—to express a person's feelings, ...
\cos^2 \omega t By conservation of energy, the average kinetic energy must be equal to the average potential energy. Thus, \omega^2 = \frac = R
(x) An emoticon (, , rarely , ), short for "emotion icon", also known simply as an emote, is a pictorial representation of a facial expression using characters—usually punctuation marks, numbers, and letters—to express a person's feelings, ...
/math> which is also known as the
Rayleigh quotient In mathematics, the Rayleigh quotient () for a given complex Hermitian matrix ''M'' and nonzero vector ''x'' is defined as: R(M,x) = . For real matrices and vectors, the condition of being Hermitian reduces to that of being symmetric, and the con ...
. Thus, if we knew the mode shape Y(x), we would be able to calculate A
(x) An emoticon (, , rarely , ), short for "emotion icon", also known simply as an emote, is a pictorial representation of a facial expression using characters—usually punctuation marks, numbers, and letters—to express a person's feelings, ...
/math> and B
(x) An emoticon (, , rarely , ), short for "emotion icon", also known simply as an emote, is a pictorial representation of a facial expression using characters—usually punctuation marks, numbers, and letters—to express a person's feelings, ...
/math>, and in turn get the eigenfrequency. However, we do not yet know the mode shape. In order to find this, we can approximate Y(x) as a combination of a few approximating functions Y_i(x) Y(x) = \sum_^N c_i Y_i(x) where c_1,c_2,\cdots,c_N are constants to be determined. In general, if we choose a random set of c_1,c_2,\cdots,c_N, it will describe a superposition of the actual eigenmodes of the system. However, if we seek c_1,c_2,\cdots,c_N such that the eigenfrequency \omega^2 is minimised, then the mode described by this set of c_1,c_2,\cdots,c_N will be close to the lowest possible actual eigenmode of the system. Thus, this finds the lowest eigenfrequency. If we find eigenmodes orthogonal to this approximated lowest eigenmode, we can approximately find the next few eigenfrequencies as well. In general, we can express A
(x) An emoticon (, , rarely , ), short for "emotion icon", also known simply as an emote, is a pictorial representation of a facial expression using characters—usually punctuation marks, numbers, and letters—to express a person's feelings, ...
/math> and B
(x) An emoticon (, , rarely , ), short for "emotion icon", also known simply as an emote, is a pictorial representation of a facial expression using characters—usually punctuation marks, numbers, and letters—to express a person's feelings, ...
/math> as a collection of terms quadratic in the coefficients c_i: B
(x) An emoticon (, , rarely , ), short for "emotion icon", also known simply as an emote, is a pictorial representation of a facial expression using characters—usually punctuation marks, numbers, and letters—to express a person's feelings, ...
= \sum_i \sum_j c_i c_j K_ = \mathbf^\mathsf K \mathbf A
(x) An emoticon (, , rarely , ), short for "emotion icon", also known simply as an emote, is a pictorial representation of a facial expression using characters—usually punctuation marks, numbers, and letters—to express a person's feelings, ...
= \sum_i \sum_j c_i c_j M_ = \mathbf^\mathsf M \mathbf where K and M are the stiffness matrix and mass matrix of a discrete system respectively. The minimization of \omega^2 becomes: \frac = \frac \frac = 0 Solving this, \mathbf^\mathsf M \mathbf\frac - \mathbf^\mathsf K \mathbf \frac = 0 K \mathbf c - \fracM\mathbf = \mathbf K \mathbf - \omega^2 M \mathbf = \mathbf For a non-trivial solution of c, we require determinant of the matrix coefficient of c to be zero. \det(K - \omega^2 M)=0 This gives a solution for the first ''N'' eigenfrequencies and eigenmodes of the system, with N being the number of approximating functions.


Simple case of double spring-mass system

The following discussion uses the simplest case, where the system has two lumped springs and two lumped masses, and only two mode shapes are assumed. Hence and . A
mode shape A normal mode of a dynamical system is a pattern of motion in which all parts of the system move sinusoidally with the same frequency and with a fixed phase relation. The free motion described by the normal modes takes place at fixed frequencies. ...
is assumed for the system, with two terms, one of which is weighted by a factor ''B'', e.g. ''Y'' =  , 1nbsp;+ ''B'' , −1
Simple harmonic motion In mechanics and physics, simple harmonic motion (sometimes abbreviated ) is a special type of periodic motion of a body resulting from a dynamic equilibrium between an inertial force, proportional to the acceleration of the body away from the ...
theory says that the
velocity Velocity is the directional speed of an object in motion as an indication of its rate of change in position as observed from a particular frame of reference and as measured by a particular standard of time (e.g. northbound). Velocity is a ...
at the time when deflection is zero, is the
angular frequency In physics, angular frequency "''ω''" (also referred to by the terms angular speed, circular frequency, orbital frequency, radian frequency, and pulsatance) is a scalar measure of rotation rate. It refers to the angular displacement per unit tim ...
\omega times the deflection (y) at time of maximum deflection. In this example the
kinetic energy In physics, the kinetic energy of an object is the energy that it possesses due to its motion. It is defined as the work needed to accelerate a body of a given mass from rest to its stated velocity. Having gained this energy during its accele ...
(KE) for each mass is \frac\omega^2 Y_1^2 m_1 etc., and the
potential energy In physics, potential energy is the energy held by an object because of its position relative to other objects, stresses within itself, its electric charge, or other factors. Common types of potential energy include the gravitational potentia ...
(PE) for each
spring Spring(s) may refer to: Common uses * Spring (season), a season of the year * Spring (device), a mechanical device that stores energy * Spring (hydrology), a natural source of water * Spring (mathematics), a geometric surface in the shape of a ...
is \frac k_1 Y_1^2 etc. We also know that without damping, the maximal KE equals the maximal PE. Thus, \sum_^2 \left(\frac \omega^2 Y_i^2 M_i\right)=\sum_^2 \left(\frac K_i Y_i^2\right) The overall amplitude of the mode shape cancels out from each side, always. That is, the actual size of the assumed deflection does not matter, just the mode ''shape''. Mathematical manipulations then obtain an expression for \omega, in terms of B, which can be differentiated with respect to B, to find the minimum, i.e. when d\omega/dB=0. This gives the value of B for which \omega is lowest. This is an upper bound solution for \omega if \omega is hoped to be the predicted fundamental frequency of the system because the mode shape is ''assumed'', but we have found the lowest value of that upper bound, given our assumptions, because B is used to find the optimal 'mix' of the two assumed mode shape functions. There are many tricks with this method, the most important is to try and choose realistic assumed mode shapes. For example, in the case of beam deflection problems it is wise to use a deformed shape that is analytically similar to the expected solution. A quartic may fit most of the easy problems of simply linked beams even if the order of the deformed solution may be lower. The springs and masses do not have to be discrete, they can be continuous (or a mixture), and this method can be easily used in a
spreadsheet A spreadsheet is a computer application for computation, organization, analysis and storage of data in tabular form. Spreadsheets were developed as computerized analogs of paper accounting worksheets. The program operates on data entered in cel ...
to find the natural frequencies of quite complex distributed systems, if you can describe the distributed KE and PE terms easily, or else break the continuous elements up into discrete parts. This method could be used iteratively, adding additional mode shapes to the previous best solution, or you can build up a long expression with many Bs and many mode shapes, and then differentiate them partially.


The relationship with the finite element method

In the language of the finite element method, the matrix H_ is precisely the ''stiffness matrix'' of the Hamiltonian in the piecewise linear element space, and the matrix S_ is the ''mass matrix''. In the language of linear algebra, the value \epsilon is an eigenvalue of the discretized Hamiltonian, and the vector c is a discretized eigenvector.


See also

*
Rayleigh–Ritz method The Rayleigh–Ritz method is a direct numerical method of approximating eigenvalues, originated in the context of solving physical boundary value problems and named after Lord Rayleigh and Walther Ritz. It is used in all applications that involve ...
*
Sturm–Liouville theory In mathematics and its applications, a Sturm–Liouville problem is a second-order linear ordinary differential equation of the form: \frac\!\!\left ,p(x)\frac\right+ q(x)y = -\lambda\, w(x)y, for given functions p(x), q(x) and w(x), together wit ...
*
Hilbert space In mathematics, Hilbert spaces (named after David Hilbert) allow generalizing the methods of linear algebra and calculus from (finite-dimensional) Euclidean vector spaces to spaces that may be infinite-dimensional. Hilbert spaces arise natural ...
*
Galerkin method In mathematics, in the area of numerical analysis, Galerkin methods, named after the Russian mathematician Boris Galerkin, convert a continuous operator problem, such as a differential equation, commonly in a weak formulation, to a discrete proble ...


Sources


Papers

*Walter Ritz (1909) "Über eine neue Methode zur Lösung gewisser Variationsprobleme der mathematischen Physik" ''Journal für die Reine und Angewandte Mathematik'', vol. 135, pages 1–61. Available online at: http://gdz.sub.uni-goettingen.de/no_cache/dms/load/img/?IDDOC=261182 . *J.K. MacDonald, "Successive Approximations by the Rayleigh–Ritz Variation Method", ''Phys. Rev.'' 43 (1933) 830 Available online at: http://journals.aps.org/pr/abstract/10.1103/PhysRev.43.830


External links


Ritz method
in the ''
Encyclopedia of Mathematics The ''Encyclopedia of Mathematics'' (also ''EOM'' and formerly ''Encyclopaedia of Mathematics'') is a large reference work in mathematics. Overview The 2002 version contains more than 8,000 entries covering most areas of mathematics at a graduat ...
'' *{{cite journal , title=From Euler, Ritz, and Galerkin to Modern Computing , last1=Gander , first1=Martin J., last2=Wanner , first2=Gerhard , journal=SIAM Review , year=2012 , volume=54 , issue=4 , pages=627–666 , doi=10.1137/100804036 Perturbation theory Quantum chemistry