- Source: Virial theorem
In statistical mechanics, the virial theorem provides a general equation that relates the average over time of the total kinetic energy of a stable system of discrete particles, bound by a conservative force (where the work done is independent of path) with that of the total potential energy of the system. Mathematically, the theorem states
⟨
T
⟩
=
−
1
2
∑
k
=
1
N
⟨
F
k
⋅
r
k
⟩
{\displaystyle \left\langle T\right\rangle =-{\frac {1}{2}}\,\sum _{k=1}^{N}{\bigl \langle }\mathbf {F} _{k}\cdot \mathbf {r} _{k}{\bigr \rangle }}
where T is the total kinetic energy of the N particles, Fk represents the force on the kth particle, which is located at position rk, and angle brackets represent the average over time of the enclosed quantity. The word virial for the right-hand side of the equation derives from vis, the Latin word for "force" or "energy", and was given its technical definition by Rudolf Clausius in 1870.
The significance of the virial theorem is that it allows the average total kinetic energy to be calculated even for very complicated systems that defy an exact solution, such as those considered in statistical mechanics; this average total kinetic energy is related to the temperature of the system by the equipartition theorem. However, the virial theorem does not depend on the notion of temperature and holds even for systems that are not in thermal equilibrium. The virial theorem has been generalized in various ways, most notably to a tensor form.
If the force between any two particles of the system results from a potential energy V(r) = αrn that is proportional to some power n of the interparticle distance r, the virial theorem takes the simple form
2
⟨
T
⟩
=
n
⟨
V
TOT
⟩
.
{\displaystyle 2\langle T\rangle =n\langle V_{\text{TOT}}\rangle .}
Thus, twice the average total kinetic energy ⟨T⟩ equals n times the average total potential energy ⟨VTOT⟩. Whereas V(r) represents the potential energy between two particles of distance r, VTOT represents the total potential energy of the system, i.e., the sum of the potential energy V(r) over all pairs of particles in the system. A common example of such a system is a star held together by its own gravity, where n equals −1.
History
In 1870, Rudolf Clausius delivered the lecture "On a Mechanical Theorem Applicable to Heat" to the Association for Natural and Medical Sciences of the Lower Rhine, following a 20-year study of thermodynamics. The lecture stated that the mean vis viva of the system is equal to its virial, or that the average kinetic energy is equal to 1/2 the average potential energy. The virial theorem can be obtained directly from Lagrange's identity as applied in classical gravitational dynamics, the original form of which was included in Lagrange's "Essay on the Problem of Three Bodies" published in 1772. Karl Jacobi's generalization of the identity to N bodies and to the present form of Laplace's identity closely resembles the classical virial theorem. However, the interpretations leading to the development of the equations were very different, since at the time of development, statistical dynamics had not yet unified the separate studies of thermodynamics and classical dynamics. The theorem was later utilized, popularized, generalized and further developed by James Clerk Maxwell, Lord Rayleigh, Henri Poincaré, Subrahmanyan Chandrasekhar, Enrico Fermi, Paul Ledoux, Richard Bader and Eugene Parker. Fritz Zwicky was the first to use the virial theorem to deduce the existence of unseen matter, which is now called dark matter. Richard Bader showed the charge distribution of a total system can be partitioned into its kinetic and potential energies that obey the virial theorem. As another example of its many applications, the virial theorem has been used to derive the Chandrasekhar limit for the stability of white dwarf stars.
Illustrative special case
Consider N = 2 particles with equal mass m, acted upon by mutually attractive forces. Suppose the particles are at diametrically opposite points of a circular orbit with radius r. The velocities are v1(t) and v2(t) = −v1(t), which are normal to forces F1(t) and F2(t) = −F1(t). The respective magnitudes are fixed at v and F. The average kinetic energy of the system in an interval of time from t1 to t2 is
⟨
T
⟩
=
1
t
2
−
t
1
∫
t
1
t
2
∑
k
=
1
N
1
2
m
k
|
v
k
(
t
)
|
2
d
t
=
1
t
2
−
t
1
∫
t
1
t
2
(
1
2
m
|
v
1
(
t
)
|
2
+
1
2
m
|
v
2
(
t
)
|
2
)
d
t
=
m
v
2
.
{\displaystyle \langle T\rangle ={\frac {1}{t_{2}-t_{1}}}\int _{t_{1}}^{t_{2}}\sum _{k=1}^{N}{\frac {1}{2}}m_{k}\left|\mathbf {v} _{k}(t)\right|^{2}dt={\frac {1}{t_{2}-t_{1}}}\int _{t_{1}}^{t_{2}}\left({\frac {1}{2}}m|\mathbf {v} _{1}(t)|^{2}+{\frac {1}{2}}m|\mathbf {v} _{2}(t)|^{2}\right)dt=mv^{2}.}
Taking center of mass as the origin, the particles have positions r1(t) and r2(t) = −r1(t) with fixed magnitude r. The attractive forces act in opposite directions as positions, so F1(t) ⋅ r1(t) = F2(t) ⋅ r2(t) = −Fr. Applying the centripetal force formula F = mv2/r results in:
−
1
2
∑
k
=
1
N
⟨
F
k
⋅
r
k
⟩
=
−
1
2
(
−
F
r
−
F
r
)
=
F
r
=
m
v
2
r
⋅
r
=
m
v
2
=
⟨
T
⟩
,
{\displaystyle -{\frac {1}{2}}\sum _{k=1}^{N}{\bigl \langle }\mathbf {F} _{k}\cdot \mathbf {r} _{k}{\bigr \rangle }=-{\frac {1}{2}}(-Fr-Fr)=Fr={\frac {mv^{2}}{r}}\cdot r=mv^{2}=\langle T\rangle ,}
as required. Note: If the origin is displaced then we'd obtain the same result. This is because the dot product of the displacement with equal and opposite forces F1(t), F2(t) results in net cancellation.
Statement and derivation
Although the virial theorem depends on averaging the total kinetic and potential energies, the presentation here postpones the averaging to the last step.
For a collection of N point particles, the scalar moment of inertia I about the origin is defined by the equation
I
=
∑
k
=
1
N
m
k
|
r
k
|
2
=
∑
k
=
1
N
m
k
r
k
2
{\displaystyle I=\sum _{k=1}^{N}m_{k}\left|\mathbf {r} _{k}\right|^{2}=\sum _{k=1}^{N}m_{k}r_{k}^{2}}
where mk and rk represent the mass and position of the kth particle. rk = |rk| is the position vector magnitude. The scalar G is defined by the equation
G
=
∑
k
=
1
N
p
k
⋅
r
k
{\displaystyle G=\sum _{k=1}^{N}\mathbf {p} _{k}\cdot \mathbf {r} _{k}}
where pk is the momentum vector of the kth particle. Assuming that the masses are constant, G is one-half the time derivative of this moment of inertia
1
2
d
I
d
t
=
1
2
d
d
t
∑
k
=
1
N
m
k
r
k
⋅
r
k
=
∑
k
=
1
N
m
k
d
r
k
d
t
⋅
r
k
=
∑
k
=
1
N
p
k
⋅
r
k
=
G
.
{\displaystyle {\begin{aligned}{\frac {1}{2}}{\frac {dI}{dt}}&={\frac {1}{2}}{\frac {d}{dt}}\sum _{k=1}^{N}m_{k}\mathbf {r} _{k}\cdot \mathbf {r} _{k}\\&=\sum _{k=1}^{N}m_{k}\,{\frac {d\mathbf {r} _{k}}{dt}}\cdot \mathbf {r} _{k}\\&=\sum _{k=1}^{N}\mathbf {p} _{k}\cdot \mathbf {r} _{k}=G\,.\end{aligned}}}
In turn, the time derivative of G can be written
d
G
d
t
=
∑
k
=
1
N
p
k
⋅
d
r
k
d
t
+
∑
k
=
1
N
d
p
k
d
t
⋅
r
k
=
∑
k
=
1
N
m
k
d
r
k
d
t
⋅
d
r
k
d
t
+
∑
k
=
1
N
F
k
⋅
r
k
=
2
T
+
∑
k
=
1
N
F
k
⋅
r
k
{\displaystyle {\begin{aligned}{\frac {dG}{dt}}&=\sum _{k=1}^{N}\mathbf {p} _{k}\cdot {\frac {d\mathbf {r} _{k}}{dt}}+\sum _{k=1}^{N}{\frac {d\mathbf {p} _{k}}{dt}}\cdot \mathbf {r} _{k}\\&=\sum _{k=1}^{N}m_{k}{\frac {d\mathbf {r} _{k}}{dt}}\cdot {\frac {d\mathbf {r} _{k}}{dt}}+\sum _{k=1}^{N}\mathbf {F} _{k}\cdot \mathbf {r} _{k}\\&=2T+\sum _{k=1}^{N}\mathbf {F} _{k}\cdot \mathbf {r} _{k}\end{aligned}}}
where mk is the mass of the kth particle, Fk = dpk/dt is the net force on that particle, and T is the total kinetic energy of the system according to the vk = drk/dt velocity of each particle
T
=
1
2
∑
k
=
1
N
m
k
v
k
2
=
1
2
∑
k
=
1
N
m
k
d
r
k
d
t
⋅
d
r
k
d
t
.
{\displaystyle T={\frac {1}{2}}\sum _{k=1}^{N}m_{k}v_{k}^{2}={\frac {1}{2}}\sum _{k=1}^{N}m_{k}{\frac {d\mathbf {r} _{k}}{dt}}\cdot {\frac {d\mathbf {r} _{k}}{dt}}.}
= Connection with the potential energy between particles
=The total force Fk on particle k is the sum of all the forces from the other particles j in the system
F
k
=
∑
j
=
1
N
F
j
k
{\displaystyle \mathbf {F} _{k}=\sum _{j=1}^{N}\mathbf {F} _{jk}}
where Fjk is the force applied by particle j on particle k. Hence, the virial can be written
−
1
2
∑
k
=
1
N
F
k
⋅
r
k
=
−
1
2
∑
k
=
1
N
∑
j
=
1
N
F
j
k
⋅
r
k
.
{\displaystyle -{\frac {1}{2}}\,\sum _{k=1}^{N}\mathbf {F} _{k}\cdot \mathbf {r} _{k}=-{\frac {1}{2}}\,\sum _{k=1}^{N}\sum _{j=1}^{N}\mathbf {F} _{jk}\cdot \mathbf {r} _{k}\,.}
Since no particle acts on itself (i.e., Fjj = 0 for 1 ≤ j ≤ N), we split the sum in terms below and above this diagonal and we add them together in pairs:
∑
k
=
1
N
F
k
⋅
r
k
=
∑
k
=
1
N
∑
j
=
1
N
F
j
k
⋅
r
k
=
∑
k
=
2
N
∑
j
=
1
k
−
1
(
F
j
k
⋅
r
k
+
F
k
j
⋅
r
j
)
=
∑
k
=
2
N
∑
j
=
1
k
−
1
(
F
j
k
⋅
r
k
−
F
j
k
⋅
r
j
)
=
∑
k
=
2
N
∑
j
=
1
k
−
1
F
j
k
⋅
(
r
k
−
r
j
)
{\displaystyle {\begin{aligned}\sum _{k=1}^{N}\mathbf {F} _{k}\cdot \mathbf {r} _{k}&=\sum _{k=1}^{N}\sum _{j=1}^{N}\mathbf {F} _{jk}\cdot \mathbf {r} _{k}=\sum _{k=2}^{N}\sum _{j=1}^{k-1}\left(\mathbf {F} _{jk}\cdot \mathbf {r} _{k}+\mathbf {F} _{kj}\cdot \mathbf {r} _{j}\right)\\&=\sum _{k=2}^{N}\sum _{j=1}^{k-1}\left(\mathbf {F} _{jk}\cdot \mathbf {r} _{k}-\mathbf {F} _{jk}\cdot \mathbf {r} _{j}\right)=\sum _{k=2}^{N}\sum _{j=1}^{k-1}\mathbf {F} _{jk}\cdot \left(\mathbf {r} _{k}-\mathbf {r} _{j}\right)\end{aligned}}}
where we have assumed that Newton's third law of motion holds, i.e., Fjk = −Fkj (equal and opposite reaction).
It often happens that the forces can be derived from a potential energy Vjk that is a function only of the distance rjk between the point particles j and k. Since the force is the negative gradient of the potential energy, we have in this case
F
j
k
=
−
∇
r
k
V
j
k
=
−
d
V
j
k
d
r
j
k
(
r
k
−
r
j
r
j
k
)
,
{\displaystyle \mathbf {F} _{jk}=-\nabla _{\mathbf {r} _{k}}V_{jk}=-{\frac {dV_{jk}}{dr_{jk}}}\left({\frac {\mathbf {r} _{k}-\mathbf {r} _{j}}{r_{jk}}}\right),}
which is equal and opposite to Fkj = −∇rjVkj = −∇rjVjk, the force applied by particle k on particle j, as may be confirmed by explicit calculation. Hence,
∑
k
=
1
N
F
k
⋅
r
k
=
∑
k
=
2
N
∑
j
=
1
k
−
1
F
j
k
⋅
(
r
k
−
r
j
)
=
−
∑
k
=
2
N
∑
j
=
1
k
−
1
d
V
j
k
d
r
j
k
|
r
k
−
r
j
|
2
r
j
k
=
−
∑
k
=
2
N
∑
j
=
1
k
−
1
d
V
j
k
d
r
j
k
r
j
k
.
{\displaystyle {\begin{aligned}\sum _{k=1}^{N}\mathbf {F} _{k}\cdot \mathbf {r} _{k}&=\sum _{k=2}^{N}\sum _{j=1}^{k-1}\mathbf {F} _{jk}\cdot \left(\mathbf {r} _{k}-\mathbf {r} _{j}\right)\\&=-\sum _{k=2}^{N}\sum _{j=1}^{k-1}{\frac {dV_{jk}}{dr_{jk}}}{\frac {|\mathbf {r} _{k}-\mathbf {r} _{j}|^{2}}{r_{jk}}}\\&=-\sum _{k=2}^{N}\sum _{j=1}^{k-1}{\frac {dV_{jk}}{dr_{jk}}}r_{jk}.\end{aligned}}}
Thus, we have
d
G
d
t
=
2
T
+
∑
k
=
1
N
F
k
⋅
r
k
=
2
T
−
∑
k
=
2
N
∑
j
=
1
k
−
1
d
V
j
k
d
r
j
k
r
j
k
.
{\displaystyle {\frac {dG}{dt}}=2T+\sum _{k=1}^{N}\mathbf {F} _{k}\cdot \mathbf {r} _{k}=2T-\sum _{k=2}^{N}\sum _{j=1}^{k-1}{\frac {dV_{jk}}{dr_{jk}}}r_{jk}.}
= Special case of power-law forces
=In a common special case, the potential energy V between two particles is proportional to a power n of their distance rij
V
j
k
=
α
r
j
k
n
,
{\displaystyle V_{jk}=\alpha r_{jk}^{n},}
where the coefficient α and the exponent n are constants. In such cases, the virial is given by the equation
−
1
2
∑
k
=
1
N
F
k
⋅
r
k
=
1
2
∑
k
=
1
N
∑
j
<
k
d
V
j
k
d
r
j
k
r
j
k
=
1
2
∑
k
=
1
N
∑
j
<
k
n
α
r
j
k
n
−
1
r
j
k
=
1
2
∑
k
=
1
N
∑
j
<
k
n
V
j
k
=
n
2
V
TOT
{\displaystyle {\begin{aligned}-{\frac {1}{2}}\,\sum _{k=1}^{N}\mathbf {F} _{k}\cdot \mathbf {r} _{k}&={\frac {1}{2}}\,\sum _{k=1}^{N}\sum _{j
where VTOT is the total potential energy of the system
V
TOT
=
∑
k
=
1
N
∑
j
<
k
V
j
k
.
{\displaystyle V_{\text{TOT}}=\sum _{k=1}^{N}\sum _{j
Thus, we have
d
G
d
t
=
2
T
+
∑
k
=
1
N
F
k
⋅
r
k
=
2
T
−
n
V
TOT
.
{\displaystyle {\frac {dG}{dt}}=2T+\sum _{k=1}^{N}\mathbf {F} _{k}\cdot \mathbf {r} _{k}=2T-nV_{\text{TOT}}\,.}
For gravitating systems the exponent n equals −1, giving Lagrange's identity
d
G
d
t
=
1
2
d
2
I
d
t
2
=
2
T
+
V
TOT
{\displaystyle {\frac {dG}{dt}}={\frac {1}{2}}{\frac {d^{2}I}{dt^{2}}}=2T+V_{\text{TOT}}}
which was derived by Joseph-Louis Lagrange and extended by Carl Jacobi.
= Time averaging
=The average of this derivative over a duration of time, τ, is defined as
⟨
d
G
d
t
⟩
τ
=
1
τ
∫
0
τ
d
G
d
t
d
t
=
1
τ
∫
G
(
0
)
G
(
τ
)
d
G
=
G
(
τ
)
−
G
(
0
)
τ
,
{\displaystyle \left\langle {\frac {dG}{dt}}\right\rangle _{\tau }={\frac {1}{\tau }}\int _{0}^{\tau }{\frac {dG}{dt}}\,dt={\frac {1}{\tau }}\int _{G(0)}^{G(\tau )}\,dG={\frac {G(\tau )-G(0)}{\tau }},}
from which we obtain the exact equation
⟨
d
G
d
t
⟩
τ
=
2
⟨
T
⟩
τ
+
∑
k
=
1
N
⟨
F
k
⋅
r
k
⟩
τ
.
{\displaystyle \left\langle {\frac {dG}{dt}}\right\rangle _{\tau }=2\left\langle T\right\rangle _{\tau }+\sum _{k=1}^{N}\left\langle \mathbf {F} _{k}\cdot \mathbf {r} _{k}\right\rangle _{\tau }.}
The virial theorem states that if ⟨dG/dt⟩τ = 0, then
2
⟨
T
⟩
τ
=
−
∑
k
=
1
N
⟨
F
k
⋅
r
k
⟩
τ
.
{\displaystyle 2\left\langle T\right\rangle _{\tau }=-\sum _{k=1}^{N}\left\langle \mathbf {F} _{k}\cdot \mathbf {r} _{k}\right\rangle _{\tau }.}
There are many reasons why the average of the time derivative might vanish, ⟨dG/dt⟩τ = 0. One often-cited reason applies to stably-bound systems, that is to say systems that hang together forever and whose parameters are finite. In that case, velocities and coordinates of the particles of the system have upper and lower limits so that Gbound, is bounded between two extremes, Gmin and Gmax, and the average goes to zero in the limit of infinite τ:
lim
τ
→
∞
|
⟨
d
G
b
o
u
n
d
d
t
⟩
τ
|
=
lim
τ
→
∞
|
G
(
τ
)
−
G
(
0
)
τ
|
≤
lim
τ
→
∞
G
max
−
G
min
τ
=
0.
{\displaystyle \lim _{\tau \to \infty }\left|\left\langle {\frac {dG^{\mathrm {bound} }}{dt}}\right\rangle _{\tau }\right|=\lim _{\tau \to \infty }\left|{\frac {G(\tau )-G(0)}{\tau }}\right|\leq \lim _{\tau \to \infty }{\frac {G_{\max }-G_{\min }}{\tau }}=0.}
Even if the average of the time derivative of G is only approximately zero, the virial theorem holds to the same degree of approximation.
For power-law forces with an exponent n, the general equation holds:
⟨
T
⟩
τ
=
−
1
2
∑
k
=
1
N
⟨
F
k
⋅
r
k
⟩
τ
=
n
2
⟨
V
TOT
⟩
τ
.
{\displaystyle \langle T\rangle _{\tau }=-{\frac {1}{2}}\sum _{k=1}^{N}\langle \mathbf {F} _{k}\cdot \mathbf {r} _{k}\rangle _{\tau }={\frac {n}{2}}\langle V_{\text{TOT}}\rangle _{\tau }.}
For gravitational attraction, n equals −1 and the average kinetic energy equals half of the average negative potential energy
⟨
T
⟩
τ
=
−
1
2
⟨
V
TOT
⟩
τ
.
{\displaystyle \langle T\rangle _{\tau }=-{\frac {1}{2}}\langle V_{\text{TOT}}\rangle _{\tau }.}
This general result is useful for complex gravitating systems such as solar systems or galaxies.
A simple application of the virial theorem concerns galaxy clusters. If a region of space is unusually full of galaxies, it is safe to assume that they have been together for a long time, and the virial theorem can be applied. Doppler effect measurements give lower bounds for their relative velocities, and the virial theorem gives a lower bound for the total mass of the cluster, including any dark matter.
If the ergodic hypothesis holds for the system under consideration, the averaging need not be taken over time; an ensemble average can also be taken, with equivalent results.
In quantum mechanics
Although originally derived for classical mechanics, the virial theorem also holds for quantum mechanics, as first shown by Fock using the Ehrenfest theorem.
Evaluate the commutator of the Hamiltonian
H
=
V
(
{
X
i
}
)
+
∑
n
P
n
2
2
m
{\displaystyle H=V{\bigl (}\{X_{i}\}{\bigr )}+\sum _{n}{\frac {P_{n}^{2}}{2m}}}
with the position operator Xn and the momentum operator
P
n
=
−
i
ℏ
d
d
X
n
{\displaystyle P_{n}=-i\hbar {\frac {d}{dX_{n}}}}
of particle n,
[
H
,
X
n
P
n
]
=
X
n
[
H
,
P
n
]
+
[
H
,
X
n
]
P
n
=
i
ℏ
X
n
d
V
d
X
n
−
i
ℏ
P
n
2
m
.
{\displaystyle [H,X_{n}P_{n}]=X_{n}[H,P_{n}]+[H,X_{n}]P_{n}=i\hbar X_{n}{\frac {dV}{dX_{n}}}-i\hbar {\frac {P_{n}^{2}}{m}}~.}
Summing over all particles, one finds for
Q
=
∑
n
X
n
P
n
{\displaystyle Q=\sum _{n}X_{n}P_{n}}
the commutator amounts to
i
ℏ
[
H
,
Q
]
=
2
T
−
∑
n
X
n
d
V
d
X
n
{\displaystyle {\frac {i}{\hbar }}[H,Q]=2T-\sum _{n}X_{n}{\frac {dV}{dX_{n}}}}
where
T
=
∑
n
P
n
2
2
m
{\textstyle T=\sum _{n}{\frac {P_{n}^{2}}{2m}}}
is the kinetic energy. The left-hand side of this equation is just dQ/dt, according to the Heisenberg equation of motion. The expectation value ⟨dQ/dt⟩ of this time derivative vanishes in a stationary state, leading to the quantum virial theorem,
2
⟨
T
⟩
=
∑
n
⟨
X
n
d
V
d
X
n
⟩
.
{\displaystyle 2\langle T\rangle =\sum _{n}\left\langle X_{n}{\frac {dV}{dX_{n}}}\right\rangle ~.}
= Pokhozhaev's identity
=In the field of quantum mechanics, there exists another form of the virial theorem, applicable to localized solutions to the stationary nonlinear Schrödinger equation or Klein–Gordon equation, is Pokhozhaev's identity, also known as Derrick's theorem.
Let
g
(
s
)
{\displaystyle g(s)}
be continuous and real-valued, with
g
(
0
)
=
0
{\displaystyle g(0)=0}
.
Denote
G
(
s
)
=
∫
0
s
g
(
t
)
d
t
{\textstyle G(s)=\int _{0}^{s}g(t)\,dt}
.
Let
u
∈
L
l
o
c
∞
(
R
n
)
,
∇
u
∈
L
2
(
R
n
)
,
G
(
u
(
⋅
)
)
∈
L
1
(
R
n
)
,
n
∈
N
,
{\displaystyle u\in L_{\mathrm {loc} }^{\infty }(\mathbb {R} ^{n}),\qquad \nabla u\in L^{2}(\mathbb {R} ^{n}),\qquad G(u(\cdot ))\in L^{1}(\mathbb {R} ^{n}),\qquad n\in \mathbb {N} ,}
be a solution to the equation
−
∇
2
u
=
g
(
u
)
,
{\displaystyle -\nabla ^{2}u=g(u),}
in the sense of distributions.
Then
u
{\displaystyle u}
satisfies the relation
(
n
−
2
2
)
∫
R
n
|
∇
u
(
x
)
|
2
d
x
=
n
∫
R
n
G
(
u
(
x
)
)
d
x
.
{\displaystyle \left({\frac {n-2}{2}}\right)\int _{\mathbb {R} ^{n}}|\nabla u(x)|^{2}\,dx=n\int _{\mathbb {R} ^{n}}G(u(x))\,dx.}
In special relativity
For a single particle in special relativity, it is not the case that T = 1/2p · v. Instead, it is true that T = (γ − 1) mc2, where γ is the Lorentz factor
γ
=
1
1
−
v
2
c
2
{\displaystyle \gamma ={\frac {1}{\sqrt {1-{\frac {v^{2}}{c^{2}}}}}}}
and β = v/c. We have,
1
2
p
⋅
v
=
1
2
β
γ
m
c
⋅
β
c
=
1
2
γ
β
2
m
c
2
=
(
γ
β
2
2
(
γ
−
1
)
)
T
.
{\displaystyle {\begin{aligned}{\frac {1}{2}}\mathbf {p} \cdot \mathbf {v} &={\frac {1}{2}}{\boldsymbol {\beta }}\gamma mc\cdot {\boldsymbol {\beta }}c\\[5pt]&={\frac {1}{2}}\gamma \beta ^{2}mc^{2}\\[5pt]&=\left({\frac {\gamma \beta ^{2}}{2(\gamma -1)}}\right)T\,.\end{aligned}}}
The last expression can be simplified to
(
1
+
1
−
β
2
2
)
T
or
(
γ
+
1
2
γ
)
T
{\displaystyle \left({\frac {1+{\sqrt {1-\beta ^{2}}}}{2}}\right)T\qquad {\text{or}}\qquad \left({\frac {\gamma +1}{2\gamma }}\right)T}
.
Thus, under the conditions described in earlier sections (including Newton's third law of motion, Fjk = −Fkj, despite relativity), the time average for N particles with a power law potential is
n
2
⟨
V
T
O
T
⟩
τ
=
⟨
∑
k
=
1
N
(
1
+
1
−
β
k
2
2
)
T
k
⟩
τ
=
⟨
∑
k
=
1
N
(
γ
k
+
1
2
γ
k
)
T
k
⟩
τ
.
{\displaystyle {\frac {n}{2}}\left\langle V_{\mathrm {TOT} }\right\rangle _{\tau }=\left\langle \sum _{k=1}^{N}\left({\frac {1+{\sqrt {1-\beta _{k}^{2}}}}{2}}\right)T_{k}\right\rangle _{\tau }=\left\langle \sum _{k=1}^{N}\left({\frac {\gamma _{k}+1}{2\gamma _{k}}}\right)T_{k}\right\rangle _{\tau }\,.}
In particular, the ratio of kinetic energy to potential energy is no longer fixed, but necessarily falls into an interval:
2
⟨
T
T
O
T
⟩
n
⟨
V
T
O
T
⟩
∈
[
1
,
2
]
,
{\displaystyle {\frac {2\langle T_{\mathrm {TOT} }\rangle }{n\langle V_{\mathrm {TOT} }\rangle }}\in \left[1,2\right]\,,}
where the more relativistic systems exhibit the larger ratios.
Examples
The virial theorem has a particularly simple form for periodic motion. It can be used to perform perturbative calculation for nonlinear oscillators.
It can also be used to study motion in a central potential. If the central potential is of the form
U
∝
r
n
{\displaystyle U\propto r^{n}}
, the virial theorem simplifies to
⟨
T
⟩
=
n
2
⟨
U
⟩
{\displaystyle \langle T\rangle ={\frac {n}{2}}\langle U\rangle }
. In particular, for gravitational or electrostatic (Coulomb) attraction,
⟨
T
⟩
=
−
1
2
⟨
U
⟩
{\displaystyle \langle T\rangle =-{\frac {1}{2}}\langle U\rangle }
.
= Driven damped harmonic oscillator
=Analysis based on. For a one-dimensional oscillator with mass
m
{\displaystyle m}
, position
x
{\displaystyle x}
, driving force
F
cos
(
ω
t
)
{\displaystyle F\cos(\omega t)}
, spring constant
k
{\displaystyle k}
, and damping coefficient
γ
{\displaystyle \gamma }
, the equation of motion is
m
d
2
x
d
t
2
⏟
acceleration
=
−
k
x
⏟
spring
−
γ
d
x
d
t
⏟
friction
+
F
cos
(
ω
t
)
⏟
external driving
{\displaystyle m\underbrace {\frac {d^{2}x}{dt^{2}}} _{\text{acceleration}}=\underbrace {-kx} _{\text{spring}}\underbrace {-\gamma {\frac {dx}{dt}}} _{\text{friction}}\underbrace {+F\cos(\omega t)} _{\text{external driving}}}
When the oscillator has reached a steady state, it performs a stable oscillation
x
=
X
cos
(
ω
t
+
φ
)
{\displaystyle x=X\cos(\omega t+\varphi )}
, where
X
{\displaystyle X}
is the amplitude and
φ
{\displaystyle \varphi }
is the phase angle.
Applying the virial theorem, we have
m
⟨
x
˙
x
˙
⟩
=
k
⟨
x
x
⟩
+
γ
⟨
x
x
˙
⟩
−
F
⟨
cos
(
ω
t
)
x
⟩
{\displaystyle m\langle {\dot {x}}{\dot {x}}\rangle =k\langle xx\rangle +\gamma \langle x{\dot {x}}\rangle -F\langle \cos(\omega t)x\rangle }
, which simplifies to
F
cos
(
φ
)
=
m
(
ω
0
2
−
ω
2
)
X
{\displaystyle F\cos(\varphi )=m(\omega _{0}^{2}-\omega ^{2})X}
, where
ω
0
=
k
/
m
{\displaystyle \omega _{0}={\sqrt {k/m}}}
is the natural frequency of the oscillator.
To solve the two unknowns, we need another equation. In steady state, the power lost per cycle is equal to the power gained per cycle:
⟨
x
˙
γ
x
˙
⟩
⏟
power dissipated
=
⟨
x
˙
F
cos
ω
t
⟩
⏟
power input
{\displaystyle \underbrace {\langle {\dot {x}}\;\gamma {\dot {x}}\rangle } _{\text{power dissipated}}=\underbrace {\langle {\dot {x}}\;F\cos \omega t\rangle } _{\text{power input}}}
, which simplifies to
sin
φ
=
−
γ
X
ω
F
{\displaystyle \sin \varphi =-{\frac {\gamma X\omega }{F}}}
.
Now we have two equations that yield the solution
{
X
=
F
2
γ
2
ω
2
+
m
2
(
ω
0
2
−
ω
2
)
2
tan
φ
=
−
γ
ω
m
(
ω
0
2
−
ω
2
)
{\displaystyle {\begin{cases}X&={\sqrt {\frac {F^{2}}{\gamma ^{2}\omega ^{2}+m^{2}(\omega _{0}^{2}-\omega ^{2})^{2}}}}\\\tan \varphi &=-{\frac {\gamma \omega }{m(\omega _{0}^{2}-\omega ^{2})}}\end{cases}}}
.
= Ideal gas law
=Consider a container filled with an ideal gas consisting of point masses. The force applied to the point masses is the negative of the forces applied to the wall of the container, which is of the form
d
F
=
−
n
^
P
d
A
{\displaystyle d\mathbf {F} =-\mathbf {\hat {n}} PdA}
, where
n
^
{\displaystyle \mathbf {\hat {n}} }
is the unit normal vector pointing outwards. Then the virial theorem states
⟨
T
⟩
=
−
1
2
⟨
∑
i
F
i
⋅
r
i
⟩
=
P
2
∫
n
^
⋅
r
d
A
{\displaystyle \langle T\rangle =-{\frac {1}{2}}{\Bigg \langle }\sum _{i}\mathbf {F} _{i}\cdot \mathbf {r} _{i}{\Bigg \rangle }={\frac {P}{2}}\int \mathbf {\hat {n}} \cdot \mathbf {r} dA}
By the divergence theorem,
∫
n
^
⋅
r
d
A
=
∫
∇
⋅
r
d
V
=
3
∫
d
V
=
3
V
{\textstyle \int \mathbf {\hat {n}} \cdot \mathbf {r} dA=\int \nabla \cdot \mathbf {r} dV=3\int dV=3V}
. And since the average total kinetic energy
⟨
T
⟩
=
N
⟨
1
2
m
v
2
⟩
=
N
⋅
3
2
k
T
{\textstyle \langle T\rangle =N\langle {\frac {1}{2}}mv^{2}\rangle =N\cdot {\frac {3}{2}}kT}
, we have
P
V
=
N
k
T
{\displaystyle PV=NkT}
.
= Dark matter
=In 1933, Fritz Zwicky applied the virial theorem to estimate the mass of Coma Cluster, and discovered a discrepancy of mass of about 450, which he explained as due to "dark matter". He refined the analysis in 1937, finding a discrepancy of about 500.
Theoretical analysis
He approximated the Coma cluster as a spherical "gas" of
N
{\displaystyle N}
stars of roughly equal mass
m
{\displaystyle m}
, which gives
⟨
T
⟩
=
1
2
N
m
⟨
v
2
⟩
{\textstyle \langle T\rangle ={\frac {1}{2}}Nm\langle v^{2}\rangle }
. The total gravitational potential energy of the cluster is
U
=
−
∑
i
<
j
G
m
2
r
i
,
j
{\displaystyle U=-\sum _{i
, giving
⟨
U
⟩
=
−
G
m
2
∑
i
<
j
⟨
1
/
r
i
,
j
⟩
{\textstyle \langle U\rangle =-Gm^{2}\sum _{i
. Assuming the motion of the stars are all the same over a long enough time (ergodicity),
⟨
U
⟩
=
−
1
2
N
2
G
m
2
⟨
1
/
r
⟩
{\textstyle \langle U\rangle =-{\frac {1}{2}}N^{2}Gm^{2}\langle {1}/{r}\rangle }
.
Zwicky estimated
⟨
U
⟩
{\displaystyle \langle U\rangle }
as the gravitational potential of a uniform ball of constant density, giving
⟨
U
⟩
=
−
3
5
G
N
2
m
2
R
{\textstyle \langle U\rangle =-{\frac {3}{5}}{\frac {GN^{2}m^{2}}{R}}}
.
So by the virial theorem, the total mass of the cluster is
N
m
=
5
⟨
v
2
⟩
3
G
⟨
1
r
⟩
{\displaystyle Nm={\frac {5\langle v^{2}\rangle }{3G\langle {\frac {1}{r}}\rangle }}}
Data
Zwicky
1933
{\displaystyle _{1933}}
estimated that there are
N
=
800
{\displaystyle N=800}
galaxies in the cluster, each having observed stellar mass
m
=
10
9
M
⊙
{\displaystyle m=10^{9}M_{\odot }}
(suggested by Hubble), and the cluster has radius
R
=
10
6
ly
{\displaystyle R=10^{6}{\text{ly}}}
. He also measured the radial velocities of the galaxies by doppler shifts in galactic spectra to be
⟨
v
r
2
⟩
=
(
1000
km/s
)
2
{\displaystyle \langle v_{r}^{2}\rangle =(1000{\text{km/s}})^{2}}
. Assuming equipartition of kinetic energy,
⟨
v
2
⟩
=
3
⟨
v
r
2
⟩
{\displaystyle \langle v^{2}\rangle =3\langle v_{r}^{2}\rangle }
.
By the virial theorem, the total mass of the cluster should be
5
R
⟨
v
r
2
⟩
G
≈
3.6
×
10
14
M
⊙
{\displaystyle {\frac {5R\langle v_{r}^{2}\rangle }{G}}\approx 3.6\times 10^{14}M_{\odot }}
. However, the observed mass is
N
m
=
8
×
10
11
M
⊙
{\displaystyle Nm=8\times 10^{11}M_{\odot }}
, meaning the total mass is 450 times that of observed mass.
Generalizations
Lord Rayleigh published a generalization of the virial theorem in 1900 which was partially reprinted in 1903. Henri Poincaré proved and applied a form of the virial theorem in 1911 to the problem of formation of the Solar System from a proto-stellar cloud (then known as cosmogony). A variational form of the virial theorem was developed in 1945 by Ledoux. A tensor form of the virial theorem was developed by Parker, Chandrasekhar and Fermi. The following generalization of the virial theorem has been established by Pollard in 1964 for the case of the inverse square law:
2
lim
τ
→
+
∞
⟨
T
⟩
τ
=
lim
τ
→
+
∞
⟨
U
⟩
τ
if and only if
lim
τ
→
+
∞
τ
−
2
I
(
τ
)
=
0
.
{\displaystyle 2\lim _{\tau \to +\infty }\langle T\rangle _{\tau }=\lim _{\tau \to +\infty }\langle U\rangle _{\tau }\qquad {\text{if and only if}}\quad \lim _{\tau \to +\infty }{\tau }^{-2}I(\tau )=0\,.}
A boundary term otherwise must be added.
Inclusion of electromagnetic fields
The virial theorem can be extended to include electric and magnetic fields. The result is
1
2
d
2
I
d
t
2
+
∫
V
x
k
∂
G
k
∂
t
d
3
r
=
2
(
T
+
U
)
+
W
E
+
W
M
−
∫
x
k
(
p
i
k
+
T
i
k
)
d
S
i
,
{\displaystyle {\frac {1}{2}}{\frac {d^{2}I}{dt^{2}}}+\int _{V}x_{k}{\frac {\partial G_{k}}{\partial t}}\,d^{3}r=2(T+U)+W^{\mathrm {E} }+W^{\mathrm {M} }-\int x_{k}(p_{ik}+T_{ik})\,dS_{i},}
where I is the moment of inertia, G is the momentum density of the electromagnetic field, T is the kinetic energy of the "fluid", U is the random "thermal" energy of the particles, WE and WM are the electric and magnetic energy content of the volume considered. Finally, pik is the fluid-pressure tensor expressed in the local moving coordinate system
p
i
k
=
Σ
n
σ
m
σ
⟨
v
i
v
k
⟩
σ
−
V
i
V
k
Σ
m
σ
n
σ
,
{\displaystyle p_{ik}=\Sigma n^{\sigma }m^{\sigma }\langle v_{i}v_{k}\rangle ^{\sigma }-V_{i}V_{k}\Sigma m^{\sigma }n^{\sigma },}
and Tik is the electromagnetic stress tensor,
T
i
k
=
(
ε
0
E
2
2
+
B
2
2
μ
0
)
δ
i
k
−
(
ε
0
E
i
E
k
+
B
i
B
k
μ
0
)
.
{\displaystyle T_{ik}=\left({\frac {\varepsilon _{0}E^{2}}{2}}+{\frac {B^{2}}{2\mu _{0}}}\right)\delta _{ik}-\left(\varepsilon _{0}E_{i}E_{k}+{\frac {B_{i}B_{k}}{\mu _{0}}}\right).}
A plasmoid is a finite configuration of magnetic fields and plasma. With the virial theorem it is easy to see that any such configuration will expand if not contained by external forces. In a finite configuration without pressure-bearing walls or magnetic coils, the surface integral will vanish. Since all the other terms on the right hand side are positive, the acceleration of the moment of inertia will also be positive. It is also easy to estimate the expansion time τ. If a total mass M is confined within a radius R, then the moment of inertia is roughly MR2, and the left hand side of the virial theorem is MR2/τ2. The terms on the right hand side add up to about pR3, where p is the larger of the plasma pressure or the magnetic pressure. Equating these two terms and solving for τ, we find
τ
∼
R
c
s
,
{\displaystyle \tau \,\sim {\frac {R}{c_{\mathrm {s} }}},}
where cs is the speed of the ion acoustic wave (or the Alfvén wave, if the magnetic pressure is higher than the plasma pressure). Thus the lifetime of a plasmoid is expected to be on the order of the acoustic (or Alfvén) transit time.
Relativistic uniform system
In case when in the physical system the pressure field, the electromagnetic and gravitational fields are taken into account, as well as the field of particles’ acceleration, the virial theorem is written in the relativistic form as follows:
⟨
W
k
⟩
≈
−
0.6
∑
k
=
1
N
⟨
F
k
⋅
r
k
⟩
,
{\displaystyle \left\langle W_{k}\right\rangle \approx -0.6\sum _{k=1}^{N}\langle \mathbf {F} _{k}\cdot \mathbf {r} _{k}\rangle ,}
where the value Wk ≈ γcT exceeds the kinetic energy of the particles T by a factor equal to the Lorentz factor γc of the particles at the center of the system. Under normal conditions we can assume that γc ≈ 1, then we can see that in the virial theorem the kinetic energy is related to the potential energy not by the coefficient 1/2, but rather by the coefficient close to 0.6. The difference from the classical case arises due to considering the pressure field and the field of particles’ acceleration inside the system, while the derivative of the scalar G is not equal to zero and should be considered as the material derivative.
An analysis of the integral theorem of generalized virial makes it possible to find, on the basis of field theory, a formula for the root-mean-square speed of typical particles of a system without using the notion of temperature:
v
r
m
s
=
c
1
−
4
π
η
ρ
0
r
2
c
2
γ
c
2
sin
2
(
r
c
4
π
η
ρ
0
)
,
{\displaystyle v_{\mathrm {rms} }=c{\sqrt {1-{\frac {4\pi \eta \rho _{0}r^{2}}{c^{2}\gamma _{c}^{2}\sin ^{2}\left({\frac {r}{c}}{\sqrt {4\pi \eta \rho _{0}}}\right)}}}},}
where
c
{\displaystyle ~c}
is the speed of light,
η
{\displaystyle ~\eta }
is the acceleration field constant,
ρ
0
{\displaystyle ~\rho _{0}}
is the mass density of particles,
r
{\displaystyle ~r}
is the current radius.
Unlike the virial theorem for particles, for the electromagnetic field the virial theorem is written as follows:
E
k
f
+
2
W
f
=
0
,
{\displaystyle ~E_{kf}+2W_{f}=0,}
where the energy
E
k
f
=
∫
A
α
j
α
−
g
d
x
1
d
x
2
d
x
3
{\textstyle ~E_{kf}=\int A_{\alpha }j^{\alpha }{\sqrt {-g}}\,dx^{1}\,dx^{2}\,dx^{3}}
considered as the kinetic field energy associated with four-current
j
α
{\displaystyle j^{\alpha }}
, and
W
f
=
1
4
μ
0
∫
F
α
β
F
α
β
−
g
d
x
1
d
x
2
d
x
3
{\displaystyle ~W_{f}={\frac {1}{4\mu _{0}}}\int F_{\alpha \beta }F^{\alpha \beta }{\sqrt {-g}}\,dx^{1}\,dx^{2}\,dx^{3}}
sets the potential field energy found through the components of the electromagnetic tensor.
In astrophysics
The virial theorem is frequently applied in astrophysics, especially relating the gravitational potential energy of a system to its kinetic or thermal energy. Some common virial relations are
3
5
G
M
R
=
3
2
k
B
T
m
p
=
1
2
v
2
{\displaystyle {\frac {3}{5}}{\frac {GM}{R}}={\frac {3}{2}}{\frac {k_{\mathrm {B} }T}{m_{\mathrm {p} }}}={\frac {1}{2}}v^{2}}
for a mass M, radius R, velocity v, and temperature T. The constants are Newton's constant G, the Boltzmann constant kB, and proton mass mp. Note that these relations are only approximate, and often the leading numerical factors (e.g. 3/5 or 1/2) are neglected entirely.
= Galaxies and cosmology (virial mass and radius)
=In astronomy, the mass and size of a galaxy (or general overdensity) is often defined in terms of the "virial mass" and "virial radius" respectively. Because galaxies and overdensities in continuous fluids can be highly extended (even to infinity in some models, such as an isothermal sphere), it can be hard to define specific, finite measures of their mass and size. The virial theorem, and related concepts, provide an often convenient means by which to quantify these properties.
In galaxy dynamics, the mass of a galaxy is often inferred by measuring the rotation velocity of its gas and stars, assuming circular Keplerian orbits. Using the virial theorem, the velocity dispersion σ can be used in a similar way. Taking the kinetic energy (per particle) of the system as T = 1/2v2 ~ 3/2σ2, and the potential energy (per particle) as U ~ 3/5 GM/R we can write
G
M
R
≈
σ
2
.
{\displaystyle {\frac {GM}{R}}\approx \sigma ^{2}.}
Here
R
{\displaystyle R}
is the radius at which the velocity dispersion is being measured, and M is the mass within that radius. The virial mass and radius are generally defined for the radius at which the velocity dispersion is a maximum, i.e.
G
M
vir
R
vir
≈
σ
max
2
.
{\displaystyle {\frac {GM_{\text{vir}}}{R_{\text{vir}}}}\approx \sigma _{\max }^{2}.}
As numerous approximations have been made, in addition to the approximate nature of these definitions, order-unity proportionality constants are often omitted (as in the above equations). These relations are thus only accurate in an order of magnitude sense, or when used self-consistently.
An alternate definition of the virial mass and radius is often used in cosmology where it is used to refer to the radius of a sphere, centered on a galaxy or a galaxy cluster, within which virial equilibrium holds. Since this radius is difficult to determine observationally, it is often approximated as the radius within which the average density is greater, by a specified factor, than the critical density
ρ
crit
=
3
H
2
8
π
G
{\displaystyle \rho _{\text{crit}}={\frac {3H^{2}}{8\pi G}}}
where H is the Hubble parameter and G is the gravitational constant. A common choice for the factor is 200, which corresponds roughly to the typical over-density in spherical top-hat collapse (see Virial mass), in which case the virial radius is approximated as
r
vir
≈
r
200
=
r
,
ρ
=
200
⋅
ρ
crit
.
{\displaystyle r_{\text{vir}}\approx r_{200}=r,\qquad \rho =200\cdot \rho _{\text{crit}}.}
The virial mass is then defined relative to this radius as
M
vir
≈
M
200
=
4
3
π
r
200
3
⋅
200
ρ
crit
.
{\displaystyle M_{\text{vir}}\approx M_{200}={\frac {4}{3}}\pi r_{200}^{3}\cdot 200\rho _{\text{crit}}.}
= Stars
=The virial theorem is applicable to the cores of stars, by establishing a relation between gravitational potential energy and thermal kinetic energy (i.e. temperature). As stars on the main sequence convert hydrogen into helium in their cores, the mean molecular weight of the core increases and it must contract to maintain enough pressure to support its own weight. This contraction decreases its potential energy and, the virial theorem states, increases its thermal energy. The core temperature increases even as energy is lost, effectively a negative specific heat. This continues beyond the main sequence, unless the core becomes degenerate since that causes the pressure to become independent of temperature and the virial relation with n equals −1 no longer holds.
See also
Virial coefficient
Virial stress
Virial mass
Chandrasekhar tensor
Chandrasekhar virial equations
Derrick's theorem
Equipartition theorem
Ehrenfest theorem
Pokhozhaev's identity
References
Further reading
Goldstein, H. (1980). Classical Mechanics (2nd ed.). Addison–Wesley. ISBN 978-0-201-02918-5.
Collins, G. W. (1978). The Virial Theorem in Stellar Astrophysics. Pachart Press. Bibcode:1978vtsa.book.....C. ISBN 978-0-912918-13-6.
i̇Pekoğlu, Y.; Turgut, S. (2016). "An elementary derivation of the quantum virial theorem from Hellmann–Feynman theorem". European Journal of Physics. 37 (4): 045405. Bibcode:2016EJPh...37d5405I. doi:10.1088/0143-0807/37/4/045405. S2CID 125030620.
External links
The Virial Theorem at MathPages
Gravitational Contraction and Star Formation, Georgia State University
Kata Kunci Pencarian:
- Rudolf Clausius
- Subrahmanyan Chandrasekhar
- Teorema ekuipartisi
- Virial theorem
- Virial mass
- Equipartition theorem
- Virial expansion
- Density functional theory
- Stellar dynamics
- Virial stress
- Dark matter
- Rudolf Clausius
- Hartree