Virial theorem

From Wikipedia, the free encyclopedia
Revision as of 15:54, 12 June 2025 by 137.248.121.94 (talk)
Jump to navigation Jump to search

Template:Short description

In mechanics, the virial theorem provides a general equation that relates the average over time of the total kinetic energy of a stable system of discrete particles, bound by a conservative force (where the work done is independent of path), with that of the total potential energy of the system. Mathematically, the theorem states that

T=12k=1N𝐅k𝐫k,

where T is the total kinetic energy of the N particles, Fk represents the force on the kth particle, which is located at position Template:Math, and angle brackets represent the average over time of the enclosed quantity. The word virial for the right-hand side of the equation derives from Script error: No such module "Lang"., the Latin word for "force" or "energy", and was given its technical definition by Rudolf Clausius in 1870.[1]

The significance of the virial theorem is that it allows the average total kinetic energy to be calculated even for very complicated systems that defy an exact solution, such as those considered in statistical mechanics; this average total kinetic energy is related to the temperature of the system by the equipartition theorem. However, the virial theorem does not depend on the notion of temperature and holds even for systems that are not in thermal equilibrium. The virial theorem has been generalized in various ways, most notably to a tensor form.

If the force between any two particles of the system results from a potential energy V(r)=αrn that is proportional to some power n of the interparticle distance r, the virial theorem takes the simple form

2T=nVTOT.

Thus, twice the average total kinetic energy T equals n times the average total potential energy VTOT. Whereas V(r) represents the potential energy between two particles of distance r, VTOT represents the total potential energy of the system, i.e., the sum of the potential energy V(r) over all pairs of particles in the system. A common example of such a system is a star held together by its own gravity, where n=1.

History

In 1870, Rudolf Clausius delivered the lecture "On a Mechanical Theorem Applicable to Heat" to the Association for Natural and Medical Sciences of the Lower Rhine, following a 20-year study of thermodynamics. The lecture stated that the mean vis viva of the system is equal to its virial, or that the average kinetic energy is one half of the average potential energy. The virial theorem can be obtained directly from Lagrange's identityTemplate:Moved resource as applied in classical gravitational dynamics, the original form of which was included in Lagrange's "Essay on the Problem of Three Bodies" published in 1772. Carl Jacobi's generalization of the identity to N bodies and to the present form of Laplace's identity closely resembles the classical virial theorem. However, the interpretations leading to the development of the equations were very different, since at the time of development, statistical dynamics had not yet unified the separate studies of thermodynamics and classical dynamics.[2] The theorem was later utilized, popularized, generalized and further developed by James Clerk Maxwell, Lord Rayleigh, Henri Poincaré, Subrahmanyan Chandrasekhar, Enrico Fermi, Paul Ledoux, Richard Bader and Eugene Parker. Fritz Zwicky was the first to use the virial theorem to deduce the existence of unseen matter, which is now called dark matter. Richard Bader showed that the charge distribution of a total system can be partitioned into its kinetic and potential energies that obey the virial theorem.[3] As another example of its many applications, the virial theorem has been used to derive the Chandrasekhar limit for the stability of white dwarf stars.

Illustrative special case

Consider N=2 particles with equal mass m, acted upon by mutually attractive forces. Suppose the particles are at diametrically opposite points of a circular orbit with radius r. The velocities are 𝐯1(t) and 𝐯2(t)=𝐯1(t), which are normal to forces 𝐅1(t) and 𝐅2(t)=𝐅1(t). The respective magnitudes are fixed at v and F. The average kinetic energy of the system in an interval of time from t1 to t2 is

T=1t2t1t1t2k=1N12mk|𝐯k(t)|2dt=1t2t1t1t2(12m|𝐯1(t)|2+12m|𝐯2(t)|2)dt=mv2.

Taking center of mass as the origin, the particles have positions 𝐫1(t) and 𝐫2(t)=𝐫1(t) with fixed magnitude r. The attractive forces act in opposite directions as positions, so 𝐅1(t)𝐫1(t)=𝐅2(t)𝐫2(t)=Fr. Applying the centripetal force formula F=mv2/r results in

12k=1N𝐅k𝐫k=12(FrFr)=Fr=mv2rr=mv2=T,

as required. Note: If the origin is displaced, then we'd obtain the same result. This is because the dot product of the displacement with equal and opposite forces 𝐅1(t), 𝐅2(t) results in net cancellation.

Statement and derivation

Although the virial theorem depends on averaging the total kinetic and potential energies, the presentation here postpones the averaging to the last step.

For a collection of N point particles, the scalar moment of inertia I about the origin is

I=k=1Nmk|𝐫k|2=k=1Nmkrk2,

where mk and 𝐫k represent the mass and position of the kth particle. rk=|𝐫|k is the position vector magnitude. Consider the scalar

G=k=1N𝐩k𝐫k,

where 𝐩k is the momentum vector of the kth particle.[4] Assuming that the masses are constant, G is one-half the time derivative of this moment of inertia:

12dIdt=12ddtk=1Nmk𝐫k𝐫k=k=1Nmkd𝐫kdt𝐫k=k=1N𝐩k𝐫k=G.

In turn, the time derivative of G is

dGdt=k=1N𝐩kd𝐫kdt+k=1Nd𝐩kdt𝐫k=k=1Nmkd𝐫kdtd𝐫kdt+k=1N𝐅k𝐫k=2T+k=1N𝐅k𝐫k,

where mk is the mass of the kth particle, 𝐅k=d𝐩kdt is the net force on that particle, and T is the total kinetic energy of the system according to the 𝐯k=d𝐫kdt velocity of each particle,

T=12k=1Nmkvk2=12k=1Nmkd𝐫kdtd𝐫kdt.

Connection with the potential energy between particles

The total force 𝐅k on particle k is the sum of all the forces from the other particles j in the system:

𝐅k=j=1N𝐅jk,

where 𝐅jk is the force applied by particle j on particle k. Hence, the virial can be written as

12k=1N𝐅k𝐫k=12k=1Nj=1N𝐅jk𝐫k.

Since no particle acts on itself (i.e., 𝐅jj=0 for 1jN), we split the sum in terms below and above this diagonal and add them together in pairs:

k=1N𝐅k𝐫k=k=1Nj=1N𝐅jk𝐫k=k=2Nj=1k1𝐅jk𝐫k+k=1N1j=k+1N𝐅jk𝐫k=k=2Nj=1k1𝐅jk𝐫k+j=2Nk=1j1𝐅jk𝐫k=k=2Nj=1k1(𝐅jk𝐫k+𝐅kj𝐫j)=k=2Nj=1k1(𝐅jk𝐫k𝐅jk𝐫j)=k=2Nj=1k1𝐅jk(𝐫k𝐫j),

where we have used Newton's third law of motion, i.e., 𝐅jk=𝐅kj (equal and opposite reaction).

It often happens that the forces can be derived from a potential energy Vjk that is a function only of the distance rjk between the point particles j and k. Since the force is the negative gradient of the potential energy, we have in this case

𝐅jk=𝐫kVjk=dVjkdrjk(𝐫k𝐫jrjk),

which is equal and opposite to 𝐅kj=𝐫jVkj=𝐫jVjk, the force applied by particle k on particle j, as may be confirmed by explicit calculation. Hence,

k=1N𝐅k𝐫k=k=2Nj=1k1𝐅jk(𝐫k𝐫j)=k=2Nj=1k1dVjkdrjk|𝐫k𝐫j|2rjk=k=2Nj=1k1dVjkdrjkrjk.

Thus

dGdt=2T+k=1N𝐅k𝐫k=2Tk=2Nj=1k1dVjkdrjkrjk.

Special case of power-law forces

In a common special case, the potential energy V between two particles is proportional to a power n of their distance rij:

Vjk=αrjkn,

where the coefficient α and the exponent n are constants. In such cases, the virial is

12k=1N𝐅k𝐫k=12k=1Nj<kdVjkdrjkrjk=12k=1Nj<knαrjkn1rjk=12k=1Nj<knVjk=n2VTOT,

where

VTOT=k=1Nj<kVjk

is the total potential energy of the system.

Thus

dGdt=2T+k=1N𝐅k𝐫k=2TnVTOT.

For gravitating systems the exponent n=1, giving Lagrange's identity

dGdt=12d2Idt2=2T+VTOT,

which was derived by Joseph-Louis Lagrange and extended by Carl Jacobi.

Time averaging

The average of this derivative over a duration τ is defined as

dGdtτ=1τ0τdGdtdt=1τG(0)G(τ)dG=G(τ)G(0)τ,

from which we obtain the exact equation

dGdtτ=2Tτ+k=1N𝐅k𝐫kτ.

The virial theorem states that if dG/dtτ=0, then

2Tτ=k=1N𝐅k𝐫kτ.

There are many reasons why the average of the time derivative might vanish. One often-cited reason applies to stably bound systems, that is, to systems that hang together forever and whose parameters are finite. In this case, velocities and coordinates of the particles of the system have upper and lower limits, so that Gbound is bounded between two extremes, Gmin and Gmax, and the average goes to zero in the limit of infinite τ:

limτ|dGbounddtτ|=limτ|G(τ)G(0)τ|limτGmaxGminτ=0.

Even if the average of the time derivative of G is only approximately zero, the virial theorem holds to the same degree of approximation.

For power-law forces with an exponent n, the general equation holds:

Tτ=12k=1N𝐅k𝐫kτ=n2VTOTτ.

For gravitational attraction, n=1, and the average kinetic energy equals half of the average negative potential energy:

Tτ=12VTOTτ.

This general result is useful for complex gravitating systems such as planetary systems or galaxies.

A simple application of the virial theorem concerns galaxy clusters. If a region of space is unusually full of galaxies, it is safe to assume that they have been together for a long time, and the virial theorem can be applied. Doppler effect measurements give lower bounds for their relative velocities, and the virial theorem gives a lower bound for the total mass of the cluster, including any dark matter.

If the ergodic hypothesis holds for the system under consideration, the averaging need not be taken over time; an ensemble average can also be taken, with equivalent results.

In quantum mechanics

Although originally derived for classical mechanics, the virial theorem also holds for quantum mechanics, as first shown by Vladimir Fock[5] using the Ehrenfest theorem.

Evaluate the commutator of the Hamiltonian

H=V({Xi})+nPn22mn

with the position operator Xn and the momentum operator

Pn=iddXn

of particle n,

[H,XnPn]=Xn[H,Pn]+[H,Xn]Pn=iXndVdXniPn2mn.

Summing over all particles, one finds that for

Q=nXnPn

the commutator is

i[H,Q]=2TnXndVdXn,

where T=nPn2/2mn is the kinetic energy. The left-hand side of this equation is just dQ/dt, according to the Heisenberg equation of motion. The expectation value >math>\langle dQ/dt\rangle</math> of this time derivative vanishes in a stationary state, leading to the quantum virial theorem:

2T=nXndVdXn.

Pokhozhaev's identity

Script error: No such module "Unsubst". In the field of quantum mechanics, there exists another form of the virial theorem, applicable to localized solutions to the stationary nonlinear Schrödinger equation or Klein–Gordon equation, is Pokhozhaev's identity,[6] also known as Derrick's theorem. Let g(s) be continuous and real-valued, with g(0)=0.

Denote G(s)=0sg(t)dt. Let

uLloc(n),uL2(n),G(u())L1(n),n

be a solution to the equation

2u=g(u),

in the sense of distributions. Then u satisfies the relation

(n22)n|u(x)|2dx=nnG(u(x))dx.

In special relativity

Script error: No such module "Unsubst".For a single particle in special relativity, it is not the case that T=12𝐩𝐯. Instead, it is true that T=(γ1)mc2, where γ is the Lorentz factor

γ=11v2c2,

and β=𝐯c. We have

12𝐩𝐯=12βγmcβc=12γβ2mc2=(γβ22(γ1))T.

The last expression can be simplified to

(1+1β22)T=(γ+12γ)T.

Thus, under the conditions described in earlier sections (including Newton's third law of motion, 𝐅jk=𝐅kj, despite relativity), the time average for N particles with a power law potential is

n2VTOTτ=k=1N(1+1βk22)Tkτ=k=1N(γk+12γk)Tkτ.

In particular, the ratio of kinetic energy to potential energy is no longer fixed, but necessarily falls into an interval:

2TTOTnVTOT[1,2],

where the more relativistic systems exhibit the larger ratios.

Examples

The virial theorem has a particularly simple form for periodic motion. It can be used to perform perturbative calculation for nonlinear oscillators.[7]

It can also be used to study motion in a central potential.[4] If the central potential is of the form Urn, the virial theorem simplifies to T=n2U.Script error: No such module "Unsubst". In particular, for gravitational or electrostatic (Coulomb) attraction, T=12U.

Driven damped harmonic oscillator

Analysis based on Sivardiere, 1986.[7] For a one-dimensional oscillator with mass m, position x, driving force Fcos(ωt), spring constant k, and damping coefficient γ, the equation of motion is

md2xdt2acceleration=kxddspring  γdxdtfriction + Fcos(ωt)ddexternal driving.

When the oscillator has reached a steady state, it performs a stable oscillation x=Xcos(ωt+φ), where X is the amplitude, and φ is the phase angle.

Applying the virial theorem, we have mx˙x˙=kxx+γxx˙Fcos(ωt)x, which simplifies to Fcos(φ)=m(ω02ω2)X, where ω0=k/m is the natural frequency of the oscillator.

To solve the two unknowns, we need another equation. In steady state, the power lost per cycle is equal to the power gained per cycle:

x˙γx˙power dissipated=x˙Fcosωtpower input,

which simplifies to sinφ=γXωF.

Now we have two equations that yield the solution

{X=F2γ2ω2+m2(ω02ω2)2,tanφ=γωm(ω02ω2).

Ideal-gas law

Consider a container filled with an ideal gas consisting of point masses. The only forces applied to the point masses are due to the container walls. In this case, the expression in the virial theorem equals

i𝐅i𝐫i=P𝐧^𝐫dA,

since, by definition, the pressure P is the average force per area exerted by the gas upon the walls, which is normal to the wall. There is a minus sign because 𝐧^ is the unit normal vector pointing outwards, and the force to be used is the one upon the particles by the wall.


Then the virial theorem states that

T=P2𝐧^𝐫dA.

By the divergence theorem, 𝐧^𝐫dA=𝐫dV=3dV=3V.

From equipartition, the average total kinetic energy T=N12mv2=N32kT. Hence, PV=NkT, the ideal gas law.[8]

Dark matter

In 1933, Fritz Zwicky applied the virial theorem to estimate the mass of Coma Cluster, and discovered a discrepancy of mass of about 450, which he explained as due to "dark matter".[9] He refined the analysis in 1937, finding a discrepancy of about 500.[10][11]

Theoretical analysis

He approximated the Coma cluster as a spherical "gas" of N stars of roughly equal mass m, which gives T=12Nmv2. The total gravitational potential energy of the cluster is U=i<jGm2ri,j, giving U=Gm2i<j1/ri,j. Assuming the motion of the stars are all the same over a long enough time (ergodicity), U=12N2Gm21/r.

Zwicky estimated U as the gravitational potential of a uniform ball of constant density, giving U=35GN2m2R.

So by the virial theorem, the total mass of the cluster is

Nm=5v23G1r

Data

Zwicky1933[9] estimated that there are N=800 galaxies in the cluster, each having observed stellar mass m=109M (suggested by Hubble), and the cluster has radius R=106ly. He also measured the radial velocities of the galaxies by doppler shifts in galactic spectra to be vr2=(1000km/s)2. Assuming equipartition of kinetic energy, v2=3vr2.

By the virial theorem, the total mass of the cluster should be 5Rvr2G3.6×1014M. However, the observed mass is Nm=8×1011M, meaning the total mass is 450 times that of observed mass.

Generalizations

Lord Rayleigh published a generalization of the virial theorem in 1900,[12] which was partially reprinted in 1903.[13] Henri Poincaré proved and applied a form of the virial theorem in 1911 to the problem of formation of the Solar System from a proto-stellar cloud (then known as cosmogony).[14] A variational form of the virial theorem was developed in 1945 by Ledoux.[15] A tensor form of the virial theorem was developed by Parker,[16] Chandrasekhar[17] and Fermi.[18] The following generalization of the virial theorem has been established by Pollard in 1964 for the case of the inverse square law:[19][20]Script error: No such module "Unsubst". 2limτ+Tτ=limτ+Uτif and only iflimτ+τ2I(τ)=0. A boundary term otherwise must be added.[21]

Inclusion of electromagnetic fields

The virial theorem can be extended to include electric and magnetic fields. The result is[22]

12d2Idt2+VxkGktd3r=2(T+U)+WE+WMxk(pik+Tik)dSi,

where I is the moment of inertia, G is the momentum density of the electromagnetic field, T is the kinetic energy of the "fluid", U is the random "thermal" energy of the particles, WE and WM are the electric and magnetic energy content of the volume considered. Finally, pik is the fluid-pressure tensor expressed in the local moving coordinate system

pik=ΣnσmσvivkσViVkΣmσnσ,

and Tik is the electromagnetic stress tensor,

Tik=(ε0E22+B22μ0)δik(ε0EiEk+BiBkμ0).

A plasmoid is a finite configuration of magnetic fields and plasma. With the virial theorem it is easy to see that any such configuration will expand if not contained by external forces. In a finite configuration without pressure-bearing walls or magnetic coils, the surface integral will vanish. Since all the other terms on the right hand side are positive, the acceleration of the moment of inertia will also be positive. It is also easy to estimate the expansion time τ. If a total mass M is confined within a radius R, then the moment of inertia is roughly MR2, and the left hand side of the virial theorem is MR2τ2. The terms on the right hand side add up to about pR3, where p is the larger of the plasma pressure or the magnetic pressure. Equating these two terms and solving for τ, we find

τRcs,

where cs is the speed of the ion acoustic wave (or the Alfvén wave, if the magnetic pressure is higher than the plasma pressure). Thus the lifetime of a plasmoid is expected to be on the order of the acoustic (or Alfvén) transit time.

Relativistic uniform system

In case when in the physical system the pressure field, the electromagnetic and gravitational fields are taken into account, as well as the field of particles’ acceleration, the virial theorem is written in the relativistic form as follows:[23]

Wk0.6k=1N𝐅k𝐫k,

where the value Wk=γcT exceeds the kinetic energy of the particles T by a factor equal to the Lorentz factor γc of the particles at the center of the system. Under normal conditions we can assume that γc1, then we can see that in the virial theorem the kinetic energy is related to the potential energy not by the coefficient 12, but rather by the coefficient close to 0.6. The difference from the classical case arises due to considering the pressure field and the field of particles’ acceleration inside the system, while the derivative of the scalar G is not equal to zero and should be considered as the material derivative.

An analysis of the integral theorem of generalized virial makes it possible to find, on the basis of field theory, a formula for the root-mean-square speed of typical particles of a system without using the notion of temperature:[24]

vrms=c14πηρ0r2c2γc2sin2(rc4πηρ0),

where c is the speed of light, η is the acceleration field constant, ρ0 is the mass density of particles, r is the current radius.

Unlike the virial theorem for particles, for the electromagnetic field the virial theorem is written as follows:[25]

Ekf+2Wf=0,

where the energy Ekf=Aαjαgdx1dx2dx3 considered as the kinetic field energy associated with four-current jα, and

Wf=14μ0FαβFαβgdx1dx2dx3

sets the potential field energy found through the components of the electromagnetic tensor.

In astrophysics

The virial theorem is frequently applied in astrophysics, especially relating the gravitational potential energy of a system to its kinetic or thermal energy. Some common virial relations are Script error: No such module "Unsubst". 35GMR=32kBTmp=12v2 for a mass M, radius R, velocity v, and temperature T. The constants are Newton's constant G, the Boltzmann constant kB, and proton mass mp. Note that these relations are only approximate, and often the leading numerical factors (e.g. 35 or 12) are neglected entirely.

Galaxies and cosmology (virial mass and radius)

Script error: No such module "Labelled list hatnote". In astronomy, the mass and size of a galaxy (or general overdensity) is often defined in terms of the "virial mass" and "virial radius" respectively. Because galaxies and overdensities in continuous fluids can be highly extended (even to infinity in some models, such as an isothermal sphere), it can be hard to define specific, finite measures of their mass and size. The virial theorem, and related concepts, provide an often convenient means by which to quantify these properties.

In galaxy dynamics, the mass of a galaxy is often inferred by measuring the rotation velocity of its gas and stars, assuming circular Keplerian orbits. Using the virial theorem, the velocity dispersion σ can be used in a similar way. Taking the kinetic energy (per particle) of the system as T=12v232σ2, and the potential energy (per particle) as U35GMR we can write

GMRσ2.

Here R is the radius at which the velocity dispersion is being measured, and M is the mass within that radius. The virial mass and radius are generally defined for the radius at which the velocity dispersion is a maximum, i.e.

GMvirRvirσmax2.

As numerous approximations have been made, in addition to the approximate nature of these definitions, order-unity proportionality constants are often omitted (as in the above equations). These relations are thus only accurate in an order of magnitude sense, or when used self-consistently.

An alternate definition of the virial mass and radius is often used in cosmology where it is used to refer to the radius of a sphere, centered on a galaxy or a galaxy cluster, within which virial equilibrium holds. Since this radius is difficult to determine observationally, it is often approximated as the radius within which the average density is greater, by a specified factor, than the critical density ρcrit=3H28πG where H is the Hubble parameter and G is the gravitational constant. A common choice for the factor is 200, which corresponds roughly to the typical over-density in spherical top-hat collapse (see Virial mass), in which case the virial radius is approximated as

rvirr200=r,ρ=200ρcrit.

The virial mass is then defined relative to this radius as

MvirM200=43πr2003200ρcrit.

Stars

The virial theorem is applicable to the cores of stars, by establishing a relation between gravitational potential energy and thermal kinetic energy (i.e. temperature). As stars on the main sequence convert hydrogen into helium in their cores, the mean molecular weight of the core increases and it must contract to maintain enough pressure to support its own weight. This contraction decreases its potential energy and, the virial theorem states, increases its thermal energy. The core temperature increases even as energy is lost, effectively a negative specific heat.[26] This continues beyond the main sequence, unless the core becomes degenerate since that causes the pressure to become independent of temperature and the virial relation with n=1 no longer holds.[27]

See also

References

Template:Reflist

Further reading

  • Script error: No such module "citation/CS1".
  • Script error: No such module "citation/CS1".
  • Script error: No such module "Citation/CS1".

External links

  1. Script error: No such module "Citation/CS1".
  2. Script error: No such module "citation/CS1".
  3. Script error: No such module "Citation/CS1".
  4. a b Script error: No such module "citation/CS1".
  5. Script error: No such module "Citation/CS1".
  6. Script error: No such module "Citation/CS1".
  7. a b Script error: No such module "Citation/CS1".
  8. Script error: No such module "citation/CS1".
  9. a b Script error: No such module "Citation/CS1".
  10. Script error: No such module "Citation/CS1".
  11. Script error: No such module "Citation/CS1".
  12. Script error: No such module "Citation/CS1".
  13. Script error: No such module "citation/CS1".
  14. Script error: No such module "citation/CS1".
  15. Script error: No such module "Citation/CS1".
  16. Script error: No such module "Citation/CS1".
  17. Script error: No such module "Citation/CS1".
  18. Script error: No such module "Citation/CS1".
  19. Script error: No such module "Citation/CS1".
  20. Script error: No such module "citation/CS1".
  21. Script error: No such module "Citation/CS1".
  22. Script error: No such module "citation/CS1".
  23. Script error: No such module "Citation/CS1".
  24. Script error: No such module "Citation/CS1".
  25. Script error: No such module "Citation/CS1".
  26. Script error: No such module "citation/CS1".
  27. Script error: No such module "citation/CS1".