- Boxes
- definitions
- Ellipses
- theorems and lemmas
- Blue border
- the statement of this result is ready to be formalized; all prerequisites are done
- Orange border
- the statement of this result is not ready to be formalized; the blueprint needs more work
- Blue background
- the proof of this result is ready to be formalized; all prerequisites are done
- Green border
- the statement of this result is formalized
- Green background
- the proof of this result is formalized
- Dark green background
- the proof of this result and all its ancestors are formalized
- Dark green border
- this is in Mathlib
The signed areal velocity along a curve \(\gamma \) is
where \(L(0) = q(0) \times p(0)\). The swept area on a time interval \([t_1, t_2]\) is:
For a phase point \(z = (q, p)\), define:
Here \(H\) is the total energy (kinetic minus potential), \(L\) is the angular momentum vector, and \(A\) is the Laplace–Runge–Lenz vector. All three are proved constant along every non-collision solution in Chapter 4.
For a phase point \(z = (q, p)\) away from collision, define:
Here \(e\) is the eccentricity. \(\ell \) is the semi-latus rectum. \(a\) is the semi-major axis, meaning half the length of the long axis of the ellipse. \(b\) is the semi-minor axis, meaning half the length of the short axis of the ellipse. Since \(L\), \(A\), and \(H\) are conserved, all four quantities are constant along a solution. Geometric data. The center of the ellipse is not at the origin. The origin is a focus. The distance from the center to the focus is \(ae\). The relation \(b^2 = a\ell \) will be used later.
Fix \(h {\lt} 0\). The negative-energy shell \(M_h\) is the set of all phase points \((q, p)\) with \(H(q, p) = h\) and \(r \neq 0\). Equivalently, it is the fixed-energy surface cut out by the equation \(H = h\) inside the non-collision phase space, so it has dimension five.
On \(M_h\), define the rescaled Lenz vector:
This is well-defined because \(h\) is a fixed negative number, so \(\sqrt{-2mh}\) is an honest real constant on the whole surface. The rescaling is chosen so that \(\{ K_u, K_v\} = L_{u \times v}\) on \(M_h\) (proved in Theorem 21).
The radial distance is \(r = \| q\| \). The non-collision phase space is the subset where \(r \neq 0\), i.e., where the planet is not at the origin.
A configuration period is a minimal positive time \(T {\gt} 0\) such that \(q(t + T) = q(t)\) for all \(t\). An orbital period adds the condition that the swept area over \([0, T]\) equals \(\pi ab\).
The phase space is \(\mathbb {R}^3 \times \mathbb {R}^3\). A point is written \((q, p)\) where \(q \in \mathbb {R}^3\) is the position and \(p \in \mathbb {R}^3\) is the momentum. The mass \(m {\gt} 0\) and gravitational parameter \(\mu {\gt} 0\) are positive real constants.
For a smooth function \(F\) on phase space, write \(\partial _{q_i}F\) for its partial derivative in the \(q_i\) direction and \(\partial _{p_i}F\) in the \(p_i\) direction. The Poisson bracket is:
If \(H\) is the Hamiltonian, then \(\{ F,H\} \) is the time derivative of \(F\) along the motion.
Suppose a Lie algebra \(\mathfrak g\) acts on a phase space \(M\) by infinitesimal symmetries. A momentum map is a map
such that pairing with \(\xi \in \mathfrak g\) gives the Hamiltonian for the infinitesimal motion generated by \(\xi \). The associated comoment map is the linear map
Lean uses the comoment because the declarations are scalar observables.
In the Kepler problem we identify \(\mathfrak {so}(3)\) with \(\mathbb {R}^3\) using the cross product. For \(u \in \mathbb {R}^3\), define the scalar functions
The map \(u \mapsto L_u\) is the rotational comoment built from the angular momentum.
The Kepler equations of motion are
The first equation says velocity equals momentum divided by mass. The second is Newton’s law for the inverse-square gravitational force. In Lean, the second component is defined by cases so the expression makes sense on all of \(\mathbb {R}^3 \times \mathbb {R}^3\). The case \(q = 0\) is never reached by a non-collision solution.
Along every solution, \(L = q \times p\) is constant.
Differentiate \(L = q \times p\) using the product rule:
Substitute the equations of motion:
The first term vanishes because \(p \times p = 0\). The second vanishes because \(q \times q = 0\). So \(\dot L = 0\) and \(L\) is constant.
The coordinate functions satisfy:
From the definition, \(\{ q_i, q_j\} = \sum _k(\delta _{ik} \cdot 0 - 0 \cdot \delta _{jk}) = 0\), and \(\{ p_i, p_j\} = 0\) similarly. For the mixed bracket: \(\{ q_i, p_j\} = \sum _k \delta _{ik}\delta _{jk} = \delta _{ij}\).
Away from collision, the following four identities hold:
\(L \cdot A = 0\) (\(L\) and \(A\) are perpendicular)
\(A \in \mathrm{span}\{ q, p\} \) (\(A\) lies in the orbital plane)
\(A \cdot q = \| L\| ^2 - m\mu r\)
\(\| A\| ^2 = (m\mu )^2 + 2mH\| L\| ^2\).
(i). Since \(L = q \times p\), the vector \(L\) is perpendicular to both \(q\) and \(p\). Therefore:
(The scalar triple product \(L \cdot (p \times L) = (L \times p) \cdot L\) vanishes because a cross product is perpendicular to both factors.)
(ii). Apply the BAC-CAB identity to \(p \times L = p \times (q \times p)\):
Both terms are scalar multiples of \(q\) and \(p\), so \(A = p \times L - (m\mu /r)q \in \mathrm{span}\{ q, p\} \).
(iii). Use the cyclic property of the scalar triple product:
Therefore:
(iv). Since \(p \perp L\), we have \(\| p \times L\| ^2 = \| p\| ^2\| L\| ^2\). Using the result of (iii) for the cross term:
For \(H {\lt} 0\), \(L \neq 0\), and \(A \neq 0\), the orbit in orthonormal coordinates \((x, y)\) in \(\Pi \) (with \(x\)-axis along \(A\)) satisfies
If \(A = 0\), the orbit is a circle of radius \(a\).
Start from the polar equation and write \(x = r\cos \theta \) in the chosen coordinates. Then
Since \(r^2 = x^2 + y^2\), squaring both sides removes the remaining polar variable:
Move everything to one side:
Because \(H {\lt} 0\), Proposition 24 gives \(0 \le e {\lt} 1\), so \(1-e^2 {\gt} 0\) and we may divide by it. Then complete the square in \(x\). Using \(a = \ell /(1-e^2)\) and \(b^2 = a\ell \), we obtain
Assume \(H {\lt} 0\) and \(L \neq 0\). Then:
Planarity. The position \(q(t)\) and momentum \(p(t)\) stay in the fixed plane \(\Pi = \{ y \in \mathbb {R}^3 : L \cdot y = 0\} \). The Lenz vector \(A\) also lies in \(\Pi \).
\(0 \le e {\lt} 1\), \(a {\gt} 0\), \(b {\gt} 0\).
\(b^2 = a\ell \).
\(b = a\sqrt{1-e^2}\).
\(\sqrt{a^2 - b^2} = ae\) (focal distance formula).
(a). Since \(L = q \times p\), the vector \(L\) is perpendicular to both \(q\) and \(p\) at every instant. Because \(L\) is constant, the plane \(\Pi \) is fixed, and \(q(t)\) stays in it for all time. The vector \(A\) lies in \(\Pi \) by identity (i): \(L \cdot A = 0\).
(b). From identity (iv):
Since \(H {\lt} 0\) and \(\| L\| {\gt} 0\), we get \(e^2 {\lt} 1\), so \(0 \le e {\lt} 1\). Since \(H {\lt} 0\), \(a = -\mu /(2H) {\gt} 0\). Also \(1-e^2 {\gt} 0\), so \(\sqrt{1-e^2} {\gt} 0\). Hence \(b = a\sqrt{1-e^2} {\gt} 0\).
(c). Substitute the definitions:
(d). From (b) and (c): \(e^2 = 1 - \ell /a = 1 - b^2/a^2\), so \(b^2 = a^2(1-e^2)\), giving \(b = a\sqrt{1-e^2}\).
(e). From (d): \(a^2 - b^2 = a^2 - a^2(1-e^2) = a^2 e^2\), so \(\sqrt{a^2-b^2} = ae\).
For a bounded orbit with configuration period \(T\), the orbit traces the ellipse exactly once over \([0, T]\).
Lean proves that there are orthonormal vectors \(e_A\) and \(e_B\) in the orbital plane and a function \(\theta (t)\) such that
So \(\theta (t)\) parametrizes the ellipse. The same theorem also gives a derivative formula for \(\theta \). Since \(x(t) = a(\cos \theta (t) - e)\) and \(y(t) = b\sin \theta (t)\) together determine \(q(t)\) in the orbital plane, the angle \(\theta (t)\) determines \(q(t)\). Note that for the eccentric anomaly, \(r(t) = a(1 - e\cos \theta (t))\).
The Lean theorem gives the areal velocity formula in terms of \(\theta \):
The second law says this equals the positive constant \(\| L\| /(2m)\). Since \(0 \le e {\lt} 1\), we have \(1 - e\cos \theta {\gt} 0\), so it follows that
So \(\theta (t)\) is strictly increasing. The orbit is traversed in one direction.
The Lean theorem also states that \(\theta (T) = \theta (0) + 2\pi \). Therefore \(q\) traces the ellipse exactly once.
Along every solution, \(H = \| p\| ^2/(2m) - \mu /r\) is constant.
Differentiate \(H\) along the trajectory:
Substitute \(\dot p = -\mu q/r^3\) and \(\dot r = (q \cdot p)/(mr)\):
Two time intervals of the same length sweep the same area.
The swept area on \([t_1, t_1 + \Delta t]\) equals \((\| L\| /2m) \cdot \Delta t\), which depends only on the length \(\Delta t\). So any two intervals of length \(\Delta t\) sweep equal areas.
For the noncircular case, the ellipse \(\tfrac {(x+ae)^2}{a^2} + \tfrac {y^2}{b^2} = 1\) has its center at \((-ae, 0)\). The focal half-distance is \(\sqrt{a^2 - b^2} = ae\), so the two foci are at \((-ae \pm ae, 0)\), that is \((0, 0)\) and \((-2ae, 0)\). The origin is a focus.
The center is at \((-ae, 0)\) by inspection of the equation. The focal half-distance is \(\sqrt{a^2 - b^2} = ae\) by Proposition 24(e). Moving from the center by \(\pm ae\) along the \(x\)-axis gives foci at \((0, 0)\) and \((-2ae, 0)\). So the origin is a focus.
For a non-collision solution with \(H {\lt} 0\) and \(L \neq 0\):
The orbit lies in the plane \(\Pi \) perpendicular to \(L\).
The eccentricity satisfies \(0 \le e {\lt} 1\) and the parameters \(e, \ell , a, b\) are given by the formulas in Definition 23.
The orbit satisfies the polar equation \(r = \ell /(1 + e\cos \theta )\).
The orbit is an ellipse with the origin at one focus (or a circle of radius \(a\) if \(A = 0\)).
On the shell \(H = h {\lt} 0\), the map
is an \(\mathfrak {so}(4)\)-comoment map for the bracket of Proposition 20.
We check directly that \(\{ \mu _h(u,v), \mu _h(u',v')\} = \mu _h([(u,v),(u',v')])\). The only subtlety is that the formula
holds on the full phase space, where \(H\) is still a function. We may replace \(H\) by the constant \(h\) only after restricting to the surface \(M_h\).
From Theorem 18 and the definition of \(K_u\):
Also \(\{ L_u, K_v\} = K_{u \times v}\) because \(L_u\) rotates \(A\) (and hence \(K\)) by the same formula as for \(L\). Therefore:
Away from collision, for all \(u, v \in \mathbb {R}^3\):
Both \(L_u\) and \(A_u\) Poisson-commute with \(H\).
The first identity is Theorem 17. The second follows because \(A\) is built from \(q\) and \(p\) by vector operations that commute with rotations. Since \(L_u\) rotates both \(q\) and \(p\), it also rotates every vector expression built from them. Explicitly, using the same Leibniz argument: \(\{ L_u, A\} = u \times A\), so \(\{ L_u, A_v\} = v \cdot (u \times A) = (u \times v) \cdot A = A_{u \times v}\).
For the third identity, \(\{ A_u, A_v\} = -2mH\, L_{u \times v}\), we expand \(A = F - m\mu U\). The right-hand side still contains the energy function \(H\). Only after restricting to a fixed-energy surface will that coefficient become a constant.
Write \(A = F - m\mu U\) where
For constant \(u\), let \(F_u = u \cdot F\) and \(U_u = u \cdot U\). Expand:
So we compute three smaller brackets.
Step 1: \(\{ F_u, F_v\} = -\| p\| ^2 L_{u \times v}\). Write \(F_u = \| p\| ^2 X_u - (q \cdot p) P_u\) where \(X_u = u \cdot q\) and \(P_u = u \cdot p\). From the canonical brackets and the Leibniz rule:
Expanding \(\{ F_u, F_v\} \) by bilinearity and collecting all terms gives:
Step 2: \(\{ F_u, U_v\} + \{ U_u, F_v\} = -(2/r)\, L_{u \times v}\). Using the Leibniz rule on \(U_v = X_v/r\):
Computing \(\{ F_u, X_v\} \) and \(\{ F_u, r\} \) from the Leibniz rule, then adding the antisymmetric term \(\{ U_u, F_v\} \), all remaining terms cancel except:
Step 3: \(\{ U_u, U_v\} = 0\). Both \(U_u\) and \(U_v\) depend only on \(q\), so all \(p\)-derivatives vanish and the bracket is zero.
Combining:
Along every solution, \(A = p \times L - (m\mu /r)q\) is constant.
Differentiate \(A = p \times L - (m\mu /r)q\). Since \(L\) is constant (\(\dot L = 0\)):
First term. Substitute \(\dot p = -(\mu /r^3)q\). Apply the BAC-CAB identity \(a \times (b \times c) = b(a \cdot c) - c(a \cdot b)\) to expand:
Therefore:
Second term. Since \(r^2 = \| q\| ^2\), differentiating gives \(\dot r = (q \cdot \dot q)/r = (q \cdot p)/(mr)\). Therefore:
Multiplying by \(m\mu \):
This equals the first term exactly, so they cancel:
On \(\mathbb {R}^3 \oplus \mathbb {R}^3\), define the bracket:
This is isomorphic to \(\mathfrak {so}(4)\) via the linear bijection \(\Phi \colon \mathbb {R}^3 \oplus \mathbb {R}^3 \to \mathfrak {so}(4)\):
The matrix \(\Phi (u, v)\) is antisymmetric, so it lies in \(\mathfrak {so}(4)\). The map \(\Phi \) is linear and bijective (both spaces are six-dimensional). A direct matrix multiplication verifies the bracket identity \([\Phi (u,v), \Phi (u',v')] = \Phi ([(u,v),(u',v')])\), so \(\Phi \) is a Lie algebra isomorphism.
For every solution, \(\dot p = -\mu q/\| q\| ^3\).
Again \(\gamma '(t)\) equals the Kepler vector field at \(\gamma (t)\), so taking the second component gives the momentum equation. In the formalization the vector field is defined by cases so that it makes sense on all of \(\mathbb {R}^3 \times \mathbb {R}^3\). However, a non-collision solution never reaches \(q = 0\), so along the trajectory we are always in the branch
If \(L \neq 0\), then in polar coordinates in the orbital plane \(\Pi \), with the polar axis along \(A\), the orbit satisfies
If \(A = 0\) then \(e = 0\) and \(r = \ell \) (constant), so the orbit is a circle.
The starting point is identity (iii) from Proposition 12:
Assume first that \(A \neq 0\). Choose polar coordinates in the fixed orbital plane so that the polar axis points along \(A\). If \(\theta \) is the angle between \(q\) and \(A\), then
Substituting this into the identity above gives
Now collect the terms involving \(r\):
Finally divide numerator and denominator by \(m\mu \):
This is the standard polar equation of a conic with focus at the origin.
If \(A = 0\), then \(e = 0\) and the same identity reduces to
so \(r = \ell \) is constant. Thus the orbit is the circular special case.
The Kepler equations of motion arise from the Poisson bracket with \(H\):
By the canonical brackets and the chain rule:
For the momentum bracket:
For every solution, \(q \times \dot p = 0\).
By the momentum equation, \(\dot p = (-\mu /\| q\| ^3)q\) is a scalar multiple of \(q\). The cross product of any vector with a scalar multiple of itself is zero:
The map \(u \mapsto L_u\) is an \(\mathfrak {so}(3)\)-comoment map. Concretely:
We first identify the infinitesimal motion generated by \(L_u\). Since
its derivative with respect to \(p_i\) is \((u \times q)_i\). Therefore
So the Hamiltonian flow of \(L_u\) rotates the position vector about the axis \(u\).
Next rewrite \(L_u\) as
Then
and therefore
So the same Hamiltonian simultaneously rotates the momentum vector about the same axis.
This proves the infinitesimal symmetry statement. To prove the comoment property, we must show that Poisson brackets of the functions \(L_u\) reproduce the Lie bracket on \(\mathfrak {so}(3)\). Apply the Leibniz rule to \(L = q \times p\):
Therefore \(\{ L_u, L_v\} = v \cdot \{ L_u, L\} = v \cdot (u \times L) = (u \times v) \cdot L = L_{u \times v}\). That is exactly the Lie bracket relation for \(\mathfrak {so}(3) \cong \mathbb {R}^3\) with bracket \(u \times v\).
Finally, \(L_u\) is conserved because it generates rotations and the Hamiltonian only depends on the rotation-invariant quantities \(\| p\| ^2\) and \(r = \| q\| \). Indeed,
Hence \(\{ H, L_u\} = 0\), and by antisymmetry also \(\{ L_u, H\} = 0\).
If \(L \neq 0\), the signed areal velocity is constant and equal to \(\| L\| /(2m)\).
By definition,
where \(\hat L_0 = L(0)/\| L(0)\| \). By conservation of angular momentum, \(q(t) \times p(t) = L(0)\). Therefore
For the same solution, the map \(u \mapsto L_u\) is an \(\mathfrak {so}(3)\)-comoment (Theorem 17), and on the shell \(H = h {\lt} 0\) the map \((u, v) \mapsto L_u + K_v\) is an \(\mathfrak {so}(4)\)-comoment (Theorem 21). Combining these statements with Theorem 36 gives the full comoment-map proof of Kepler’s laws.
For an orbital period \(T\),
By the second law, the area swept over one period is:
By Theorem 34, this equals \(\pi ab\). So:
Squaring:
From Proposition 24(c), \(b^2 = a\ell = a\| L\| ^2/(m\mu )\), so \(\| L\| ^2 = mb^2\mu /a\). Substituting:
Let \(\gamma \) be a non-collision solution with \(H {\lt} 0\) and \(L \neq 0\). Then:
First law. The orbit lies in the plane \(\Pi \) perpendicular to \(L\) and is an ellipse with the origin at one focus. Its eccentricity is \(e = \| A\| /(m\mu ) {\lt} 1\), its semi-major axis is \(a = -\mu /(2H)\), and its semi-latus rectum is \(\ell = \| L\| ^2/(m\mu )\).
Second law. The areal velocity \(d\mathcal{A}/dt = \| L\| /(2m)\) is constant, so equal areas are swept in equal times.
Third law. Every configuration period \(T\) satisfies \(T^2 = (4\pi ^2 m/\mu )\, a^3\).
On the shell \(M_h\), define:
Then \(J^+\) and \(J^-\) each satisfy the \(\mathfrak {so}(3)\) bracket, and they commute with each other:
So \(\mathfrak {so}(4) \cong \mathfrak {so}(3) \oplus \mathfrak {so}(3)\) via the splitting \((L, K) \mapsto (J^+, J^-)\).
For the commutator, expand using the brackets from Theorem 21:
The self-brackets \(\{ J^+_u, J^+_v\} = J^+_{u \times v}\) and \(\{ J^-_u, J^-_v\} = J^-_{u \times v}\) follow similarly.