Laplace-Runge-Lenz vector

From Wikipedia, the free encyclopedia

Throughout this article, vectors and their magnitudes are indicated by boldface and italic type, respectively, e.g., \left| \mathbf{A} \right| = A.
Figure 1: The Laplace-Runge-Lenz vector A (shown in red) at four points (labeled 1, 2, 3 and 4) on the elliptical orbit of a bound point particle moving under an inverse-square central force.  The center of attraction is shown as a small black circle from which the position vectors (likewise black) emanate.  The angular momentum vector L is perpendicular to the orbit.  The coplanar vectors p×L and (mk/r)r are shown in blue and green, respectively; these variables are defined below.  The vector A is constant in direction and magnitude.
Enlarge
Figure 1: The Laplace-Runge-Lenz vector A (shown in red) at four points (labeled 1, 2, 3 and 4) on the elliptical orbit of a bound point particle moving under an inverse-square central force. The center of attraction is shown as a small black circle from which the position vectors (likewise black) emanate. The angular momentum vector L is perpendicular to the orbit. The coplanar vectors p×L and (mk/r)r are shown in blue and green, respectively; these variables are defined below. The vector A is constant in direction and magnitude.

The Laplace-Runge-Lenz vector is a constant of motion in a key problem of classical mechanics: Kepler's problem,[1] in which two bodies interact by a central force that varies as the inverse square of the distance between them (such as Newtonian gravity or Coulomb's law of electrostatics). Their relative motion can be deduced by simple geometry from this vector and the total energy,[2] which is useful in calculating astronomical orbits.

The conservation of this vector also reveals a subtle symmetry of the Kepler problem. The Kepler problem has the property that the momentum vector p always traces out a circle.[3] Due to the arrangement of these circles for a given total energy E, the Kepler problem is mathematically equivalent to a particle moving freely on a four-dimensional sphere.[4] In this mathematical analogy, the conserved Laplace-Runge-Lenz vector corresponds to extra components of the angular momentum in the four-dimensional space.[5]

By the correspondence principle, the Laplace-Runge-Lenz vector has a quantum mechanical analogue, which was critical in the first derivation of the spectrum of the hydrogen atom,[6] before the invention of the Schrödinger equation.

The Laplace-Runge-Lenz vector is also known as the Laplace vector, the Runge-Lenz vector and the Lenz vector although, ironically, none of those scientists invented it. The Laplace-Runge-Lenz vector has been re-discovered several times[7] and is also equivalent to the dimensionless eccentricity vector of celestial mechanics.[8] Various generalizations of the Laplace-Runge-Lenz vector have been defined, which incorporate the effects of special relativity, electromagnetic fields and even different types of central forces.

Contents

[edit] Context

A single particle moving under any conservative central force has at least four constants of motion, the total energy E and the three Cartesian components of the angular momentum vector L. The particle's orbit is confined to a plane defined by the particle's initial momentum p (or, equivalently, its velocity v) and the vector r between the particle and the center of force (Figure 1).

As defined below, the Laplace-Runge-Lenz vector A always lies in the plane of motion for any central force. However, A is constant only for an inverse-square central force.[2] For most central forces, however, this vector A is not constant, but changes in both length and direction; if the central force is approximately an inverse-square law, the vector A is approximately constant in length, but slowly rotates its direction. A generalized conserved Laplace-Runge-Lenz vector \mathcal{A} can be defined for all central forces, but this generalized vector is a complicated function of position, and usually not expressible in closed form.[9][10]

The plane of motion is perpendicular to the angular momentum vector L, which is constant; this may be expressed mathematically by the vector dot product equation r⋅L = 0; likewise, since A lies in that plane, A⋅L = 0.

[edit] History of rediscovery

The Laplace-Runge-Lenz vector A is a constant of motion of the important Kepler problem, and is useful in describing astronomical orbits, such as the motion of the planets. Nevertheless, it has never been well known among physicists, possibly because it is less intuitive than momentum and angular momentum. Consequently, it has been rediscovered independently several times over the last three centuries.[7] Jakob Hermann was the first to show that A is conserved for a special case of the inverse-square central force,[11] and worked out its connection to the eccentricity of the orbital ellipse. Hermann's work was generalized to its modern form by Johann Bernoulli in 1710.[12] At the end of the century, Pierre-Simon de Laplace rediscovered the conservation of A, deriving it analytically, rather than geometrically.[13] In the middle of the nineteenth century, William Rowan Hamilton derived the equivalent eccentricity vector defined below,[8] using it to show that the momentum vector p moves on a circle for motion under an inverse-square central force (Figure 3).[3] At the beginning of the twentieth century, Josiah Willard Gibbs derived the same vector by vector analysis.[14] Gibbs' derivation was used as an example by Carle Runge in a popular German textbook on vectors,[15] which was referenced by Wilhelm Lenz in his paper on the (old) quantum mechanical treatment of the hydrogen atom.[16] In 1926, the vector was used by Wolfgang Pauli to derive the spectrum of hydrogen using modern quantum mechanics, but not the Schrödinger equation;[6] after Pauli's publication, it became known mainly as the Runge-Lenz vector.

[edit] Mathematical definition

For a single particle acted on by an inverse-square central force described by the equation \mathbf{F}(r)=\frac{-k}{r^{2}}\mathbf{\hat{r}}, the Laplace-Runge-Lenz vector A is defined mathematically by the formula[2]

\mathbf{A} = \mathbf{p} \times \mathbf{L} - m k \mathbf{\hat{r}}

where

  • m\!\, is the mass of the point particle moving under the central force,
  • \mathbf{p}\!\, is its momentum vector,
  • \mathbf{L} = \mathbf{r} \times \mathbf{p}\!\, is its angular momentum vector,
  • k\!\, is a parameter that describes strength of the central force,
  • \mathbf{r}\!\, is the position vector of the particle (Figure 1), and
  • \mathbf{\hat{r}}\!\, is the corresponding unit vector, i.e., \mathbf{\hat{r}} = \frac{\mathbf{r}}{r} where r is the magnitude of r.

Since the assumed force is conservative, the total energy E is a constant of motion

E = \frac{p^{2}}{2m} - \frac{k}{r} = \frac{1}{2} mv^{2} - \frac{k}{r}

Furthermore, the assumed force is a central force, and thus the angular momentum vector L is also conserved and defines the plane in which the particle travels. The Laplace-Runge-Lenz vector A is perpendicular to the angular momentum vector L because both p × L and r are perpendicular to L. It follows that A lies in the plane of the orbit.

This definition of the Laplace-Runge-Lenz vector A pertains to a single point particle of mass m moving under the action of a fixed force. However, the same definition may be extended to two-body problems such as Kepler's problem, by taking m as the reduced mass of the two bodies and r as the vector between the two bodies.

[edit] Derivation of the Kepler orbits

Figure 2: Simplified version of Figure 1, defining the angle θ between A and r at one point of the orbit.
Enlarge
Figure 2: Simplified version of Figure 1, defining the angle θ between A and r at one point of the orbit.

The shape and orientation of the Kepler problem orbits can be determined from the Laplace-Runge-Lenz vector as follows. Taking the dot product of A with the position vector r gives the equation

\mathbf{A} \cdot \mathbf{r} = Ar \cos\theta =  \mathbf{r} \cdot \left( \mathbf{p} \times \mathbf{L} \right) - mkr

where θ is the angle between r and A (Figure 2). Permuting the scalar triple product r ⋅ (p×L) = L ⋅ (r×p) = LL = L2, and rearranging yields the defining formula for a conic section

\frac{1}{r} = \frac{mk}{L^{2}} \left( 1 + \frac{A}{mk} \cos\theta \right)

of eccentricity e\!\, given by

e = \frac{A}{mk} = \frac{\left|\mathbf{A}\right|}{m k}

Taking the dot product of A with itself yields the equation

A^2= m^2 k^2 + 2 m E L^2 \,

which may be re-written in terms of the eccentricity

e^{2}  - 1= \frac{2L^{2}}{mk^{2}}E

Thus, if the energy is negative (bound orbits), the eccentricity is less than one and the orbit is an ellipse. Conversely, if the energy is positive (unbound orbits, also called "scattered orbits"), the eccentricity is greater than one and the orbit is a hyperbola. Finally, if the energy is exactly zero, the eccentricity is one and the orbit is a parabola. In all cases, the direction of A lies along the symmetry axis of the conic section and points from the center of force toward the periapsis, the point of closest approach.

[edit] Circular momentum hodographs

Figure 3: The momentum vector p (shown in blue) moves on a circle as the particle moves on an ellipse. The four labeled points correspond to those in Figure 1.  The circle is centered on the y-axis at position A/L (shown in magenta), with radius mk/L (shown in green).  The angle η determines the eccentricity e of the elliptical orbit (cos η = e).  By the inscribed angle theorem for circles, η is also the angle between any point on the circle and the two points of intersection with the px axis, px=±p0.
Enlarge
Figure 3: The momentum vector p (shown in blue) moves on a circle as the particle moves on an ellipse. The four labeled points correspond to those in Figure 1. The circle is centered on the y-axis at position A/L (shown in magenta), with radius mk/L (shown in green). The angle η determines the eccentricity e of the elliptical orbit (cos η = e). By the inscribed angle theorem for circles, η is also the angle between any point on the circle and the two points of intersection with the px axis, pxp0.

The conservation of the Laplace-Runge-Lenz vector A and angular momentum vector L is useful in showing that the momentum vector p moves on a circle under an inverse-square central force. Taking the cross product of A and L yields an equation for p

L^{2} \mathbf{p} = \mathbf{L} \times \mathbf{A} - mk \hat{\mathbf{r}} \times \mathbf{L}

Taking L along the z-axis and the major semiaxis as the x-axis yields the equation

p_{x}^{2} + \left(p_{y} - A/L \right)^{2} = \left( mk/L \right)^{2}

In other words, the momentum vector p is confined to a circle of radius mk/L centered on (0, A/L). The eccentricity e corresponds to the cosine of the angle η shown in Figure 3. For brevity, it is also useful to introduce the variable p_{0} = \sqrt{2m\left| E \right|}. This circular hodograph is useful in illustrating the symmetry of the Kepler problem.

[edit] Constants of motion and superintegrability

The seven scalar quantities E, A and L (being vectors, the latter two contribute three conserved quantities each) are related by two equations, AL = 0 and A2 = m2k2 + 2 m E L2, giving five independent constants of motion. This is consistent with the six initial conditions (the particle's initial position and velocity vectors, each with three components) that specify the orbit of the particle, since the initial time is not determined by a constant of motion. Since the magnitude of A (and the eccentricity e of the orbit) can be determined from the total angular momentum L and the energy E, only the direction of A is conserved independently; moreover, since A must be perpendicular to L, it contributes only one additional conserved quantity.

A mechanical system with d degrees of freedom can have at most 2d − 1 constants of motion, since there are 2d initial conditions and the initial time cannot be determined by a constant of motion. A system with more than d constants of motion is called superintegrable and a system with 2d − 1 constants is called maximally superintegrable.[17] Since the solution of the Hamilton-Jacobi equation in one coordinate system can yield only d constants of motion, superintegrable systems must be separable in more than one coordinate system.[18] The Kepler problem is maximally superintegrable, since it has three degrees of freedom (d=3) and five independent constant of motion; its Hamilton-Jacobi equation is separable in both spherical coordinates and parabolic coordinates,[19] as described below. Maximally superintegrable systems follow closed, one-dimensional orbits in phase space, since the orbit is the intersection of the phase-space isosurfaces of their constants of motion. Maximally superintegrable systems can be quantized using only commutation relations, as illustrated below.[20]

[edit] Alternative scalings, symbols and formulations

Contrary to the momentum and angular momentum vectors p and L, there is no universally accepted definition of the Laplace-Runge-Lenz vector; several different scaling factors and symbols are used in the scientific literature. The most common definition is given above, but another common alternative is to divide by the constant mk to obtain a dimensionless conserved eccentricity vector

\mathbf{e} =  \frac{1}{mk} \left(\mathbf{p} \times \mathbf{L} \right) - \mathbf{\hat{r}} =  \frac{m}{k} \left(\mathbf{v} \times \mathbf{r} \times \mathbf{v}\right) - \mathbf{\hat{r}}

where v is the velocity vector. This scaled vector e has the same direction as A and its magnitude equals the eccentricity of the orbit. Other scaled versions are also possible, e.g., by dividing A by m alone

\mathbf{M} = \mathbf{v} \times \mathbf{L} - k\mathbf{\hat{r}}

or by p0

\mathbf{D} = \frac{\mathbf{A}}{p_{0}} =  \frac{1}{\sqrt{2m\left| E \right|}}  \left\{ \mathbf{p} \times \mathbf{L} - m k \mathbf{\hat{r}} \right\}

which has the same units as the angular momentum vector L. In rare cases, the sign of the Laplace-Runge-Lenz vector may be reversed, i.e., scaled by −1. Other common symbols for the Laplace-Runge-Lenz vector include a, R, F, J and V. However, the choice of scaling and symbol for the Laplace-Runge-Lenz vector do not affect its conservation.

Figure 4: The angular momentum vector L, the Laplace-Runge-Lenz vector A and Hamilton's vector, the binormal  B, are mutually perpendicular; A and B point along the major and minor axes, respectively, of an elliptical orbit of the Kepler problem.
Enlarge
Figure 4: The angular momentum vector L, the Laplace-Runge-Lenz vector A and Hamilton's vector, the binormal B, are mutually perpendicular; A and B point along the major and minor axes, respectively, of an elliptical orbit of the Kepler problem.

An alternative conserved vector is the binormal vector B studied by William Rowan Hamilton[8]

\mathbf{B} = \mathbf{p} - \left(\frac{mk}{L^{2}r} \right) \  \left( \mathbf{L} \times \mathbf{r} \right)

which is conserved and points along the minor semiaxis of the ellipse; the Laplace-Runge-Lenz vector A = B × L is the cross product of B and L (Figure 4). The vector B is denoted as "binormal" since it is perpendicular to both A and L. Similar to the Laplace-Runge-Lenz vector itself, the binormal vector can be defined with different scalings and symbols.

The two conserved vectors, A and B can be combined to form a conserved dyadic tensor W[9]

\mathbf{W} = \alpha \mathbf{A} \otimes \mathbf{A} + \beta \, \mathbf{B} \otimes \mathbf{B}

where α and β are arbitrary scaling constants and \otimes represents the tensor product (which is not related to the vector cross product, despite ther similar symbol). Written in explicit components, this equation reads

W_{ij} = \alpha A_{i} A_{j} + \beta B_{i} B_{j} \,

Being perpendicular to each another, the vectors A and B can be viewed as the principal axes of the conserved tensor W, i.e., its scaled eigenvectors. W is perpendicular to L

\mathbf{L} \cdot \mathbf{W} =   \alpha \left( \mathbf{L} \cdot \mathbf{A} \right) \mathbf{A} + \beta \left( \mathbf{L} \cdot \mathbf{B} \right) \mathbf{B} = 0

since A and B are both perpendicular to L as well, LA = LB = 0. For clarification, this equation reads in explicit components

\left( \mathbf{L} \cdot \mathbf{W} \right)_{j} =   \alpha \left( \sum_{i=1}^{3} L_{i} A_{i} \right) A_{j} + \beta \left( \sum_{i=1}^{3} L_{i} B_{i} \right) B_{j} = 0

[edit] Conservation under inverse-square forces

The force \mathbf{F} acting on the particle is assumed to be a central force

\mathbf{F} = \frac{d\mathbf{p}}{dt} = f(r) \frac{\mathbf{r}}{r} = f(r) \mathbf{\hat{r}}

for some function f(r) of the radius r. Since the angular momentum \mathbf{L} = \mathbf{r} \times \mathbf{p} is conserved under central forces, \frac{d}{dt}\mathbf{L} = 0 and

\frac{d}{dt} \left( \mathbf{p} \times \mathbf{L} \right) = \frac{d\mathbf{p}}{dt} \times \mathbf{L}  = f(r) \mathbf{\hat{r}} \times \mathbf{r} \times m \frac{d\mathbf{r}}{dt} = f(r) \frac{m}{r} \left[ \mathbf{r} \left(\mathbf{r} \cdot \frac{d\mathbf{r}}{dt} \right) - r^{2} \frac{d\mathbf{r}}{dt} \right]

where the momentum \mathbf{p} = m \frac{d\mathbf{r}}{dt} and where the triple cross product has been simplified using Lagrange's formula

\mathbf{r} \times \mathbf{r} \times \frac{d\mathbf{r}}{dt} = \mathbf{r} \left(\mathbf{r} \cdot \frac{d\mathbf{r}}{dt} \right) - r^{2} \frac{d\mathbf{r}}{dt}

The identity

\frac{d}{dt} \left( \mathbf{r} \cdot \mathbf{r} \right) = 2 \mathbf{r} \cdot \frac{d\mathbf{r}}{dt} = \frac{d}{dt} \left( r^{2} \right) = 2r\frac{dr}{dt}

yields the equation

\frac{d}{dt} \left( \mathbf{p} \times \mathbf{L} \right) =  -m f(r) r^{2} \left[ \frac{1}{r} \frac{d\mathbf{r}}{dt} -  \frac{\mathbf{r}}{r^{2}} \frac{dr}{dt}\right] =  -m f(r) r^{2} \frac{d}{dt} \left( \frac{\mathbf{r}}{r}\right)

For the special case of an inverse-square central force f(r)=\frac{-k}{r^{2}}, this equals

\frac{d}{dt} \left( \mathbf{p} \times \mathbf{L} \right) =  m k \frac{d}{dt} \left( \frac{\mathbf{r}}{r}\right) =  \frac{d}{dt} \left( mk\mathbf{\hat{r}} \right)

Therefore, A is conserved for inverse-square central forces

\frac{d}{dt} \mathbf{A} = \frac{d}{dt} \left( \mathbf{p} \times \mathbf{L} \right) - \frac{d}{dt} \left( mk\mathbf{\hat{r}} \right) = 0

As described below, this Laplace-Runge Lenz vector A is a special case of a general conserved vector \mathcal{A} that can be defined for all central forces.[9][10] However, since most central forces do not produce closed orbits (see Bertrand's theorem), the analogous vector \mathcal{A} rarely has a simple definition and is generally a multivalued function of the angle θ between r and \mathcal{A}.

[edit] Evolution under perturbed potentials

Figure 5: Gradually precessing elliptical orbit, with an eccentricity e=0.9.  Such precession arises in the Kepler problem if the attractive central force deviates slightly from an inverse-square law.  The rate of precession can be calculated using the formulae in the text.
Enlarge
Figure 5: Gradually precessing elliptical orbit, with an eccentricity e=0.9. Such precession arises in the Kepler problem if the attractive central force deviates slightly from an inverse-square law. The rate of precession can be calculated using the formulae in the text.

In many practical problems such as planetary motion, the interaction between two bodies is approximately an inverse-square central force, but not exactly. In such cases, the Laplace-Runge-Lenz vector A is not perfectly constant. However, if the perturbing potential h(r) is a conservative central force, the total energy E and angular momentum vector L are conserved. Therefore, the motion still lies in a plane perpendicular to L and the magnitude A is conserved, by the equation A^2= m^2 k^2 +2 m E L^2 \!\,. Consequently, the direction of A slowly rotates in the orbit of the plane; using canonical perturbation theory and action-angle coordinates, it is straightforward to show[2] that A rotates at a rate of

\frac{\partial}{\partial L} \langle h(r) \rangle =  \frac{\partial}{\partial L} \left\{ \frac{1}{T} \int_{0}^{T} h(r) \ dt \right\} =  \frac{\partial}{\partial L} \left\{ \frac{m}{L^{2}} \int_{0}^{2\pi} r^{2} h(r) d\theta \right\}

where T is the orbital period and the identity L dt = m r2 was used to convert the time integral into an angular integral (Figure 5). As an example, the theory of general relativity adds a small inverse-cubic perturbation to the normal Newtonian inverse-square gravitational force;[21] specifically,

h(r) = \frac{kL^{2}}{m^{2}c^{2}} \left( \frac{1}{r^{3}} \right)

Inserting this function into the integral and using the equation

\frac{1}{r} = \frac{mk}{L^{2}} \left( 1 + \frac{A}{mk} \cos\theta \right)

to express r in terms of θ, the precession rate of the periapsis caused by this non-Newtonian perturbation is calculated to be[21]

\frac{6\pi k^{2}}{TL^{2}c^{2}}

which closely matches the observed orbital precession of Mercury[22] and binary pulsars.[23] This agreement with experiment is considered to be strong evidence for general relativity.[24][25]

[edit] Poisson brackets

The three components Li of the angular momentum vector L have the Poisson brackets

\left[ L_{i}, L_{j}\right] = \sum_{s=1}^{3} \epsilon_{ijs} L_{s}

where i=1,2,3 and εijs is the fully antisymmetric tensor, i.e., the Levi-Civita symbol; the summation index s is used here to avoid confusion with the force parameter k defined above. The Poisson brackets are represented here as square brackets (not curly braces), both for consistency with the references and because they will be interpreted as quantum mechanical commutation relations in the next section and as Lie brackets in a following section.

As noted above, a scaled Laplace-Runge-Lenz vector D may be defined with the same units as angular momentum by dividing A by p0. The Poisson brackets of D with the angular momentum vector L can be written in a similar form

\left[ D_{i}, L_{j}\right] = \sum_{s=1}^{3} \epsilon_{ijs} D_{s}

The Poisson brackets of D with itself depend on the sign of E, i.e., on whether the total energy E is negative (producing closed, elliptical orbits under an inverse-square central force) or positive (producing open, hyperbolic orbits under an inverse-square central force). For negative energies — i.e., for bound systems — the Poisson brackets are

\left[ D_{i}, D_{j}\right] = \sum_{s=1}^{3} \epsilon_{ijs} L_{s}

whereas, for positive energy, the Poisson brackets have the opposite sign

\left[ D_{i}, D_{j}\right] = -\sum_{s=1}^{3} \epsilon_{ijs} L_{s}

The Casimir invariants for negative energies are defined by

C_{1} = \mathbf{D} \cdot \mathbf{D} + \mathbf{L} \cdot \mathbf{L} = \frac{mk^{2}}{2\left|E\right|}
C_{2} = \mathbf{D} \cdot \mathbf{L} = 0

and have zero Poisson brackets with all components of D and L

\left[ C_{1}, L_{i} \right] = \left[ C_{1}, D_{i} \right] =  \left[ C_{2}, L_{i} \right] = \left[ C_{2}, D_{i} \right] = 0

C2 is trivially zero, since the two vectors are always perpendicular. However, the other invariant C1 is non-trivial and depends only on m, k and E. This invariant allows the energy levels of hydrogen-like atoms to be derived using only quantum mechanical canonical commutation relations, instead of the more customary Schrödinger equation.

[edit] Quantum mechanics of the hydrogen atom

Figure 6: Energy levels of the hydrogen atom as predicted from the commutation relations of angular momentum and Laplace-Runge-Lenz vector operators; these energy levels have been verified experimentally.
Enlarge
Figure 6: Energy levels of the hydrogen atom as predicted from the commutation relations of angular momentum and Laplace-Runge-Lenz vector operators; these energy levels have been verified experimentally.

Poisson brackets provide a simple method for quantizing a classical system; the commutation relation of two quantum mechanical operators equals the Poisson bracket of the corresponding classical variables, multiplied by i\hbar.[26] By carrying out this quantization and calculating the eigenvalues of the C1 Casimir operator for the Kepler problem, Wolfgang Pauli was able to derive the energy levels of hydrogen-like atoms (Figure 6) and, thus, their atomic emission spectrum.[6] This elegant derivation was obtained prior to the development of the Schrödinger equation.[27]

A subtlety of the quantum mechanical operator for the Laplace-Runge-Lenz vector A is that the momentum and angular momentum operators do not commute; hence, the cross product of p and L must be defined carefully.[28] Typically, the operators for the Cartesian components As are defined using a symmetric product

A_{s} = - m k \hat{r}_{s} + \frac{1}{2} \sum_{i=1}^{3} \sum_{j=1}^{3} \epsilon_{sij} \left( p_{i} l_{j} + l_{j} p_{i} \right)

from which the corresponding ladder operators can be defined

J_{0} = A_{3} \,
J_{\pm 1} = \mp \frac{1}{\sqrt{2}} \left( A_{1} \pm i A_{2} \right)

A normalized first Casimir invariant operator can likewise be defined

C_{1} = - \frac{m k^{2}}{2 \hbar^{2}} H^{-1} - I

where H−1 is the inverse of the Hamiltonian energy operator and I is the identity operator. Applying these ladder operators to the eigenstates \left| l m n \right.\rangle of the total angular momentum, azimuthal angular momentum and energy operators, the eigenvalues of the first Casimir operator C1 are n2 − 1; importantly, they are independent of the l and m quantum numbers, making the energy levels degenerate.[28] Hence, the energy levels are given by

E_{n} = - \frac{m k^{2}}{2\hbar^{2} n^{2}}

which equals the Rydberg formula for hydrogen-like atoms (Figure 6).

[edit] Hamilton-Jacobi equation in parabolic coordinates

The constancy of the Laplace-Runge-Lenz vector can also be derived from the Hamilton-Jacobi equation in parabolic coordinates (ξ, η), which are defined by the equations

\xi = r + x \,
\eta = r - x \,

where r represents the radius in the plane of the orbit

r = \sqrt{x^{2} + y^{2}}

The inversion of these coordinates is

x = \frac{1}{2} \left( \xi - \eta \right)
y = \sqrt{\xi\eta}

Separation of the Hamilton-Jacobi equation in these coordinates yields the two equivalent equations[19][29]

2\xi p_{\xi}^{2} - mk - mE\xi = -\Gamma
2\eta p_{\eta}^{2} - mk - mE\eta = \Gamma

where Γ is a constant of motion. Subtraction and re-expression in terms of the Cartesian momenta px and py shows that Γ is equivalent to the Laplace-Runge-Lenz vector

\Gamma = p_{y} \left( x p_{y} - y p_{x} \right) - mk\frac{x}{r} = A_{x}

This Hamilton-Jacobi approach can be used to derive a conserved generalized Laplace-Runge-Lenz vector \mathcal{A} in the presence of an electric field E [19][30]

\mathcal{A} = \mathbf{A} + \frac{mq}{2} \left[ \left( \mathbf{r} \times \mathbf{E} \right) \times \mathbf{r} \right]

where q is the charge on the orbiting particle.

[edit] Conservation and symmetry

The conservation of the Laplace-Runge-Lenz vector corresponds to a subtle symmetry of the system. In classical mechanics, symmetries are continuous operations that map one orbit onto another without changing the energy of the system; in quantum mechanics, symmetries are continuous operations that "mix" electronic orbitals of the same energy, i.e., degenerate energy levels. A conserved quantity is usually associated with such symmetries.[2] For example, every central force is symmetric under the rotation group SO(3), leading to the conservation of angular momentum L. Classically, an overall rotation of the system does not affect the energy of an orbit; quantum mechanically, rotations mix the spherical harmonics of the same quantum number l without changing the energy.

Figure 7: The family of circular momentum hodographs for a given energy E.  All the circles pass through the same two points  on the px-axis (cf. Figure 3).  This family of hodographs corresponds to one family of Apollonian circles, and the σ isosurfaces of bipolar coordinates.
Enlarge
Figure 7: The family of circular momentum hodographs for a given energy E. All the circles pass through the same two points \pm p_{0} = \pm \sqrt{2m\left| E \right|} on the px-axis (cf. Figure 3). This family of hodographs corresponds to one family of Apollonian circles, and the σ isosurfaces of bipolar coordinates.

The symmetry for the inverse-square central force is higher and more subtle. The peculiar symmetry of the Kepler problem results in the conservation of both the angular momentum vector L and the Laplace-Runge-Lenz vector A (as defined above) and, quantum mechanically, ensures that the energy levels of hydrogen do not depend on the angular momentum quantum numbers l and m. The symmetry is more subtle, however, because the symmetry operation must take place in a higher-dimensional space; such symmetries are often called "hidden symmetries".[31] Classically, the higher symmetry of the Kepler problem allows for continuous alterations of the orbits that preserve energy but not angular momentum; expressed another way, orbits of the same energy but different angular momentum (eccentricity) can be transformed continuously into one another. Quantum mechanically, this corresponds to mixing orbitals that differ in the l and m quantum numbers, such as the s (l=0) and p (l=1) atomic orbitals. Such mixing cannot be done with ordinary three-dimensional translations or rotations, but is equivalent to a rotation in a higher dimension.

For negative energies — i.e., for bound systems — the higher symmetry group is SO(4), which preserves the length of four-dimensional vectors

\left| \mathbf{e} \right|^{2} = e_{1}^{2} + e_{2}^{2} + e_{3}^{2} + e_{4}^{2}

In 1935, Vladimir Fock showed that the quantum mechanical bound Kepler problem is equivalent to the problem of a free particle confined to a four-dimensional hypersphere.[4] Specifically, Fock showed that the Schrödinger wavefunction in momentum space for the Kepler problem was the stereographic projection of the spherical harmonics on the hypersphere. Rotation of the hypersphere and reprojection results in a continuous mapping of the elliptical orbits without changing the energy; quantum mechanically, this corresponds to a mixing of all orbitals of the same energy quantum number n. Valentine Bargmann noted subsequently that the Poisson brackets for the angular momentum vector L and the scaled Laplace-Runge-Lenz vector D formed the Lie algebra for SO(4).[5] Simply put, the six quantities D and L correspond to the six conserved angular momenta in four dimensions, associated with the six possible simple rotations in that space (there are six ways of choosing two axes from four). This conclusion does not imply that our universe is a four-dimensional hypersphere; it merely means that this particular physics problem (the two-body problem for inverse-square central forces) is mathematically equivalent to a free particle on a four-dimensional hypersphere.

For positive energies — i.e., for unbound, "scattered" systems — the higher symmetry group is SO(3,1), which preserves the Minkowski length of 4-vectors

ds^{2} = e_{1}^{2} + e_{2}^{2} + e_{3}^{2} - e_{4}^{2}

Both the negative- and positive-energy cases were considered by Fock[4] and Bargmann[5] and have been reviewed encyclopedically by Bander and Itzykson.[32][33]

The orbits of central-force systems — and those of the Kepler problem in particular — are also symmetric under reflection. Therefore, the SO(3), SO(4) and SO(3,1) groups cited above are not the full symmetry groups of their orbits; the full groups are O(3), O(4) and O(3,1), respectively. Nevertheless, only the connected subgroups, SO(3), SO(4) and SO(3,1), are needed to demonstrate the conservation of the angular momentum and Laplace-Runge-Lenz vectors; the reflection symmetry is irrelevant for conservation, which may be derived from the Lie algebra of the group.

[edit] Rotational symmetry in four dimensions

Figure 8: The momentum hodographs of Figure 7 correspond to stereographic projections of great circles on the four-dimensional η unit sphere.  All of the great circles intersect the ηx axis, which is perpendicular to the page; the projection is from the North pole (the w unit vector) to the ηx-ηy plane, as shown here for the magenta hodograph by the dashed black lines.  The great circle at a latitude α corresponds to an eccentricity e = sin α.  The colors of the great circles shown here correspond to their matching hodographs in Figure 7.
Enlarge
Figure 8: The momentum hodographs of Figure 7 correspond to stereographic projections of great circles on the four-dimensional η unit sphere. All of the great circles intersect the ηx axis, which is perpendicular to the page; the projection is from the North pole (the w unit vector) to the ηxy plane, as shown here for the magenta hodograph by the dashed black lines. The great circle at a latitude α corresponds to an eccentricity e = sin α. The colors of the great circles shown here correspond to their matching hodographs in Figure 7.

The connection between the Kepler problem and four-dimensional rotational symmetry SO(4) can be readily visualized.[32][34][35] Let the four-dimensional Cartesian coordinates be denoted (w, x, y, z) where (x, y, z) represent the Cartesian coordinates of the normal position vector r. The three-dimensional momentum vector p is associated with a four-dimensional vector \boldsymbol\eta on a four-dimensional unit sphere

\boldsymbol\eta =  \frac{p^{2} - p_{0}^{2}}{p^{2} + p_{0}^{2}} \mathbf{\hat{w}} + \frac{2 p_{0}}{p^{2} + p_{0}^{2}} \mathbf{p} =  \frac{mk - r p_{0}^{2}}{mk} \mathbf{\hat{w}} + \frac{rp_{0}}{mk} \mathbf{p}

where \mathbf{\hat{w}} is the unit vector along the new w-axis. The transformation mapping p to η can be uniquely inverted; for example, the x-component of the momentum equals

p_{x} = p_{0} \frac{\eta_{x}}{1 - \eta_{w}}

and similarly for py and pz. In other words, the three-dimensional vector p is a stereographic projection of the four-dimensional \boldsymbol\eta vector, scaled by p0 (Figure 8).

Without loss of generality, we may eliminate the normal rotational symmetry by choosing the Cartesian coordinates such that the z-axis is aligned with the angular momentum vector L and the momentum hodographs are aligned as they are in Figure 7, with the centers of the circles on the y-axis. Since the motion is planar, and p and L are perpendicular, pz = ηz = 0 and attention may be restricted to the three-dimensional vector \boldsymbol\eta = (ηw, ηx, ηy). The family of Apollonian circles of momentum hodographs (Figure 7) correspond to a family of great circles on the three-dimensional \boldsymbol\eta sphere, all of which intersect the ηx-axis at the two foci ηx = ±1, corresponding to the momentum hodograph foci at px = ±p0. These great circles are related by a simple rotation about the ηx-axis (Figure 8). This rotational symmetry transforms all the orbits of the same energy into one another; however, such a rotation is orthogonal to the usual three-dimensional rotations, since it transforms the fourth dimension ηw. This higher symmetry is characteristic of the Kepler problem and corresponds to the conservation of the Laplace-Runge-Lenz vector.

An elegant action-angle variables solution for the Kepler problem can be obtained by eliminating the redundant four-dimensional coordinates \boldsymbol\etain favor of elliptic cylindrical coordinates (χ, ψ, φ)[36]

\eta_{w} = \mathrm{cn}\, \chi \  \mathrm{cn}\, \psi
\eta_{x} = \mathrm{sn}\, \chi \  \mathrm{dn}\, \psi \  \cos \phi
\eta_{y} = \mathrm{sn}\, \chi \  \mathrm{dn}\, \psi \  \sin \phi
\eta_{z} = \mathrm{dn}\, \chi \  \mathrm{sn}\, \psi

where sn, cn and dn are Jacobi's elliptic functions.

[edit] Noether's theorem

The connection between the rotational symmetry just described and the conservation of the Laplace-Runge-Lenz vector can be made quantitative as follows. Noether's theorem states that any infinitesimal variation of the generalized coordinates of a physical system

\delta q_{i} = \epsilon g_{i}(\mathbf{q}, \mathbf{\dot{q}}, t)

that causes the Lagrangian to vary to first order by a total time derivative

\delta L = \epsilon \frac{d}{dt} G(\mathbf{q}, t)

corresponds to a conserved quantity Γ

\Gamma = -G + \sum_{i} g_{i} \left( \frac{\partial L}{\partial \dot{q}_{i}}\right)

In particular, the conserved Laplace-Runge-Lenz vector component As corresponds to the variation in the coordinates[37]

\delta x_{i} = \frac{\epsilon}{2} \left[ 2 p_{i} x_{s} - x_{i} p_{s} - \delta_{is} \left( \mathbf{r} \cdot \mathbf{p} \right) \right]

where i equals 1, 2 and 3, with xi and pi being the ith components of the position and momentum vectors r and p, respectively; as usual, δis represents the Kronecker delta. The resulting first-order change in the Lagrangian is

\delta L = \epsilon mk\frac{d}{dt} \left( \frac{x_{s}}{r} \right)

Substitution into the general formula for the conserved quantity Γ yields the conserved component As of the Laplace-Runge-Lenz vector

A_{s} = \left[ p^{2} x_{s} - p_{s} \ \left(\mathbf{r} \cdot \mathbf{p}\right) \right] - mk \left( \frac{x_{s}}{r} \right) =  \left[ \mathbf{p} \times \mathbf{r} \times \mathbf{p} \right]_{s} - mk \left( \frac{x_{s}}{r} \right)

[edit] Lie transformation

Figure 9: The Lie transformation from which the conservation of the Laplace-Runge-Lenz vector A is derived.  As the scaling parameter λ varies, the energy and angular momentum changes, but the eccentricity e and the magnitude and direction of A do not.
Enlarge
Figure 9: The Lie transformation from which the conservation of the Laplace-Runge-Lenz vector A is derived. As the scaling parameter λ varies, the energy and angular momentum changes, but the eccentricity e and the magnitude and direction of A do not.

The Noether theorem derivation of the conservation of the Laplace-Runge-Lenz vector A is elegant, but has one drawback: the coordinate variation δxi involves not only the position r, but also the momentum p or, equivalently, the velocity v.[38] This drawback may be eliminated by instead deriving the conservation of A using an approach pioneered by Sophus Lie.[39][40] Specifically, one may define a Lie transformation[31] in which the coordinates r and the time t are scaled by different powers of a parameter λ (Figure 9)

t \rightarrow \lambda^{3}t, \  \mathbf{r} \rightarrow \lambda^{2}\mathbf{r}, \  \mathbf{p} \rightarrow \frac{1}{\lambda}\mathbf{p}

This transformation changes the total angular momentum L and energy E

L \rightarrow \lambda L, \  E \rightarrow \frac{1}{\lambda^{2}} E

but preserves their product EL2. Therefore, the eccentricity e and the magnitude A are preserved, as may be seen from the equation for A2

A2 = m2k2e2 = m2k2 + 2mEL2

The direction of A is preserved as well, since the semiaxes are not altered by a global scaling. This transformation also preserves Kepler's third law, namely, that the semiaxis a and the period T form a constant T2/a3.

[edit] Generalizations to other potentials and relativity

The Laplace-Runge-Lenz vector has been generalized to other potentials and even to special relativity. The most general form can be written as[9]

\mathcal{A} =  \left( \frac{\partial \xi}{\partial u} \right) \left(\mathbf{p} \times \mathbf{L}\right)  +  \left[ \xi - u \left( \frac{\partial \xi}{\partial u} \right)\right] L^{2}  \mathbf{\hat{r}}

where u = 1/r (cf. Bertrand's theorem) and ξ = cos θ, with the angle θ defined by

\theta = L \int^{u} \frac{du}{\sqrt{m^{2} c^{2} \left(\gamma^{2} - 1 \right) - L^{2} u^{2}}}

and γ is the Lorentz factor. As before, we may obtain a conserved binormal vector B by taking the cross product with the conserved angular momentum vector

\mathcal{B} = \mathbf{L} \times \mathcal{A}

These two vectors may likewise be combined into a conserved dyadic tensor W

\mathcal{W} = \alpha \mathcal{A} \otimes \mathcal{A} + \beta \, \mathcal{B} \otimes \mathcal{B}

For illustration, the Laplace-Runge-Lenz vector for a non-relativistic, isotopic harmonic oscillator can be calculated.[9] Since the force is central

\mathbf{F}(r)= -k \mathbf{r}

the angular momentum vector is conserved and the motion lies in a plane. The conserved dyadic tensor can be written in a simple form

\mathcal{W} = \frac{1}{2m} \mathbf{p} \otimes \mathbf{p} + \frac{k}{2} \, \mathbf{r} \otimes \mathbf{r}

although it should be noted that p and r are not necessarily perpendicular. The corresponding Runge-Lenz vector is more complicated

\mathcal{A} = \frac{1}{\sqrt{mr^{2}\omega_{0} A - mr^{2}E + L^{2}}} \left\{ \left( \mathbf{p} \times \mathbf{L} \right) + \left(mr\omega_{0} A - mrE \right) \mathbf{\hat{r}} \right\}

where \omega_{0} = \sqrt{\frac{k}{m}} is the natural oscillation frequency.

[edit] See also

[edit] References

  1. ^ Arnold, VI (1989). Mathematical Methods of Classical Mechanics, 2nd ed.. New York: Springer-Verlag, 38. ISBN 0-387-96890-3.
  2. ^ a b c d e Goldstein, H. (1980). Classical Mechanics, 2nd edition, Addison Wesley, 102–105,421–422.
  3. ^ a b Hamilton, WR (1847). "Unknown title". Proceedings of the Royal Irish Academy 3: 344ff.
  4. ^ a b c Fock, V (1935). "Zur Theorie des Wasserstoffatoms". Zeitschrift für Physik 98: 145–154.
  5. ^ a b c Bargmann, V (1936). "Zur Theorie des Wasserstoffatoms: Bemerkungen zur gleichnamigen Arbeit von V. Fock". Zeitschrift für Physik 99: 576–582.
  6. ^ a b c Pauli, W (1926). "Über das Wasserstoffspektrum vom Standpunkt der neuen Quantenmechanik". Zeitschrift für Physik 36: 336–363.
  7. ^ a b Goldstein, H. (1975). "Prehistory of the Runge-Lenz vector". American Journal of Physics 43: 735–738.
    Goldstein, H. (1976). "More on the prehistory of the Runge-Lenz vector". American Journal of Physics 44: 1123–1124.
  8. ^ a b c Hamilton, WR (1847). "Applications of Quaternions to Some Dynamical Questions". Proceedings of the Royal Irish Academy 3: Appendix III.
  9. ^ a b c d e Fradkin, DM (1967). "Existence of the Dynamic Symmetries O4 and SU3 for All Classical Central Potential Problems". Progress of Theoretical Physics 37: 798–812.
  10. ^ a b Yoshida, T (1987). "Two methods of generalisation of the Laplace-Runge-Lenz vector". European Journal of Physics 8: 258–259.
  11. ^ Hermann, J (1710). "Unknown title". Giornale de Letterati D'Italia 2: 447–467.
    Hermann, J (1710). "Extrait d'une lettre de M. Herman à M. Bernoulli datée de Padoüe le 12. Juillet 1710". Histoire de l'academie royale des sciences (Paris) 1732: 519–521.
  12. ^ Bernoulli, J (1710). "Extrait de la Réponse de M. Bernoulli à M. Herman datée de Basle le 7. Octobre 1710". Histoire de l'academie royale des sciences (Paris) 1732: 521–544.
  13. ^ Laplace, PS (1799). Traité de mécanique celeste, Tome I, Premiere Partie, Livre II, pp.165ff.
  14. ^ Gibbs, JW, Wilson EB (1901). Vector Analysis. New York: Scribners, p. 135.
  15. ^ Runge, C (1919). Vektoranalysis. Leipzig: Hirzel, Volume I.
  16. ^ Lenz, W (1924). "Über den Bewegungsverlauf und Quantenzustände der gestörten Keplerbewegung". Zeitschrift für Physik 24: 197–207.
  17. ^ Evans, NW (1990). "Superintegrability in classical mechanics". Physical Review A 41: 5666–5676.
  18. ^ Sommerfeld, A (1923). Atomic Structure and Spectral Lines. London: Methuen, 118.
  19. ^ a b c Landau, LD, Lifshitz EM (1976). Mechanics, 3rd edition, Pergamon Press, p. 154. ISBN 0-08-021022-8 (hardcover) and ISBN 0-08-029141-4 (softcover).
  20. ^ Evans, NW (1991). "Group theory of the Smorodinsky-Winternitz system". Journal of Mathematical Physics 32: 3369–3375.
  21. ^ a b Einstein, A (1915). "Erklärung der Perihelbewegung des Merkur aus der allgemeinen Relativitätstheorie". Sitzungsberichte der Preussischen Akademie der Wissenschaften 1915: 831–839.
  22. ^ Le Verrier, UJJ (1859). "Lettre de M. Le Verrier à M. Faye sur la Théorie de Mercure et sur le Mouvement du Périhélie de cette Planète". Comptes Rendus de l'Academie de Sciences (Paris) 49: 379–383.
  23. ^ Will, CM (1979). General Relativity, an Einstein Century Survey, SW Hawking and W Israel, eds., Cambridge: Cambridge University Press, Chapter 2.
  24. ^ Pais, A. (1982). Subtle is the Lord: The Science and the Life of Albert Einstein. Oxford University Press.
  25. ^ Roseveare, NT (1982). Mercury's Perihelion from Le Verrier to Einstein. Oxford University Press.
  26. ^ Dirac, PAM (1958). Principles of Quantum Mechanics, 4th revised edition. Oxford University Press.
  27. ^ Schrödinger, E (1926). "Quantisierung als Eigenwertproblem". Annalen der Physik 384: 361–376.
  28. ^ a b Bohm, A. (1986). Quantum Mechanics: Foundations and Applications, 2nd edition, Springer Verlag, 208–222.
  29. ^ Dulock, VA, McIntosh HV (1966). "On the Degeneracy of the Kepler Problem". Pacific Journal of Mathematics 19: 39–55.
  30. ^ Redmond, PJ (1964). "Generalization of the Runge-Lenz Vector in the Presence of an Electric Field". Physical Review 133: B1352–B1353.
  31. ^ a b Prince, GE, Eliezer CJ (1981). "On the Lie symmetries of the classical Kepler problem". Journal of Physics A: Mathematical and General 14: 587–596.
  32. ^ a b Bander, M, Itzykson C (1966). "Group Theory and the Hydrogen Atom (I)". Reviews of Modern Physics 38: 330–345.
  33. ^ Bander, M, Itzykson C (1966). "Group Theory and the Hydrogen Atom (II)". Reviews of Modern Physics 38: 346–358.
  34. ^ Rogers, HH (1973). "Symmetry transformations of the clasical Kepler problem". Journal of Mathematical Physics 14: 1125–1129.
  35. ^ Guillemin, V, Sternberg S (1990). Variations on a Theme by Kepler. American Mathematical Society Colloquium Publications, volume 42. ISBN 0-8218-1042-1.
  36. ^ Lakshmanan, M, Hasegawa H. "On the canonical equivalence of the Kepler problem in coordinate and momentum spaces". Journal of Physics A 17: L889–L893.
  37. ^ Lévy-Leblond, JM (1971). "Conservation Laws for Gauge-Invariant Lagrangians in Classical Mechanics". American Journal of Physics 39: 502–506.
  38. ^ Gonzalez-Gascon, F (1977). "Notes on the symmetries of systems of differential equations". Journal of Mathematical Physics 18: 1763–1767.
  39. ^ Lie, S (1891). Vorlesungen über Differentialgleichungen. Leipzig: Teubner.
  40. ^ Ince, EL (1926). Ordinary Differential Equations. New York: Dover (1956 reprint), 93–113.

[edit] Further reading

In other languages