Möbius transformation

Möbius transformations should not be confused with the Möbius transform or the Möbius function.

In geometry, a Möbius transformation is a rational function of the form:

f(z) = \frac{a z + b}{c z + d}

where z, a, b, c, d are complex numbers satisfying adbc ≠ 0. Möbius transformations are named in honor of August Ferdinand Möbius, although they are also called homographic transformations, linear fractional transformations, or fractional linear transformations.

Contents

Overview

A Möbius transformation is a bijective conformal map of the extended complex plane (i.e. the complex plane augmented by the point at infinity, also known as Riemann sphere):

\widehat{\mathbb{C}} = \mathbb{C}\cup\{\infty\}.

The set of all Möbius transformations forms a group under composition called the Möbius group.

The Möbius group is the automorphism group of the Riemann sphere, sometimes denoted

\mbox{Aut}(\widehat\mathbb C).

Certain subgroups of the Möbius group form the automorphism groups of the other simply-connected Riemann surfaces (the complex plane and the hyperbolic plane). As such, Möbius transformations play an important role in the theory of Riemann surfaces. The fundamental group of every Riemann surface is a discrete subgroup of the Möbius group (see Fuchsian group and Kleinian group). Möbius transformations are also closely related to isometries of hyperbolic 3-manifolds.

A particularly important subgroup of the Möbius group is the modular group; it is central to the theory of many fractals, modular forms, elliptic curves and Pellian equations.

In physics, the identity component of the Lorentz group acts on the celestial sphere the same way that the Möbius group acts on the Riemann sphere. In fact, these two groups are isomorphic. An observer who accelerates to relativistic velocities will see the pattern of constellations as seen near the Earth continuously transform according to infinitesimal Möbius transformations. This observation is often taken as the starting point of twistor theory.

Definition

The general form of a Möbius transformation is given by

z \mapsto \frac{a z + b}{c z + d}

where a, b, c, d are any complex numbers satisfying adbc ≠ 0. (If ad = bc the rational function defined above is a constant.) This definition can be extended to the whole Riemann sphere (the complex plane plus the point at infinity).

The set of all Möbius transformations forms a group under composition. This group can be given the structure of a complex manifold in such a way that composition and inversion are holomorphic maps. The Möbius group is then a complex Lie group. The Möbius group is usually denoted \mbox{Aut}(\widehat\mathbb C) as it is the automorphism group of the Riemann sphere.

Decomposition and elementary properties

A Möbius transformation is equivalent to a sequence of simpler transformations. Let:

then these functions can be composed on each other, giving

 f_4\circ f_3\circ f_2\circ f_1 (z)= \frac{az+b}{cz+d}.\!

This decomposition makes many properties of the Möbius transform obvious.

For example, the preservation of angles is reduced to proving the angle preservation property of circle inversion, since all other transformation are dilations or isometries, which trivially preserve angles.

The existence of an inverse Möbius transformation function and its explicit formula is easily derived by a composition of the inverse function of the simpler transformations. That is, define functions g_1, g_2, g_3, g_4 such that g_i is the inverse of f_i, then composition g_1\circ g_2\circ g_3\circ g_4 (z) would be the explicit expression for the inverse Möbius transformation:

\frac{dz-b}{-cz+a}

From this decomposition, we also see that Möbius transformation carries over all non-trivial properties of circle inversion. Namely, that circles are mapped to circles, and angles are preserved. Also, because of the circle inversion, is carried over the convenience of defining Möbius transformation over a plane with a point at infinity, which makes statements and concepts of Möbius transformation's properties simpler.

For another example, look at f_3. If ad-bc= 0, then the transformation collapses to the point 0, then f_4 moves to a/c. Collapsing to a point is not an interesting transformation, thus we require in the definition of Möbius transformation that ad-bc \ne 0.

Preservation of angles and circles

As seen from the above decomposition, Möbius transformation contains this transformation 1/z, called complex inversion. Geometrically, a complex inversion is a circle inversion followed by a reflection around the x-axis.

In circle inversion, circles are mapped to circles (here, lines are considered as circles with infinite radius), and angles are preserved. See circle inversion for various properties and proofs.

Cross-ratio preservation

The cross-ratio preservation theorem states that the cross-ratio


\frac{(z_1-z_3)(z_2-z_4)}{(z_1-z_4)(z_2-z_3)} =
\frac{(w_1-w_3)(w_2-w_4)}{(w_1-w_4)(w_2-w_3)}

is invariant under a Möbius transformation that maps from z to w.

The action of the Möbius group on the Riemann sphere is sharply 3-transitive in the sense that there is a unique Möbius transformation which takes any three distinct points on the Riemann sphere to any other set of three distinct points. See the section below on specifying a transformation by three points.

Projective matrix representations

The transformation

f(z) = \frac{a z + b}{c z + d}

is determined by the matrix of complex numbers

\mathfrak{H} = \begin{pmatrix} a & b \\ c & d \end{pmatrix}.

The condition adbc ≠ 0 is equivalent to the condition that the determinant of above matrix be nonzero (i.e. the matrix should be non-singular). Note that any matrix obtained by multiplying \mathfrak H by a complex scalar λ determines the same transformation, so the transformation does not determine the matrix. The freedom of choosing the matrix for a given transformation can be restricted by requiring that the determinant of \mathfrak H be equal to 1; then \mathfrak H will be unique up to sign.

The usefulness of using matrices to describe Möbius transformations stems from the fact that matrix multiplication gives rise to composition of the corresponding Möbius transformations. In other words, the map

\pi\colon \mbox{GL}(2,\mathbb C) \to \mbox{Aut}(\widehat\mathbb C)

from the general linear group GL(2,C) to the Möbius group, which sends the matrix \mathfrak{H} to the transformation f is a group homomorphism. This map is called a projective representation of GL(2,C) for reasons explained below.

The map \pi is not an isomorphism, since it maps any scalar multiple of \mathfrak{H} to the same transformation. The kernel of this homomorphism is then the set of all scalar multiples of the identity matrix, which is the center Z(GL(2,C)) of GL(2,C). The quotient group GL(2,C)/Z(GL(2,C)) is called the projective linear group and is usually denoted PGL(2,C). By the first isomorphism theorem of group theory we conclude that the Möbius group is isomorphic to PGL(2,C). Since Z(GL(2,C)) is the kernel of the group action given by GL(2,C) acting on itself by conjugation, PGL(2, C) is isomorphic to the inner automorphism group of GL(2,C). Moreover, the natural action of PGL(2,C) on the complex projective line CP1 is exactly the natural action of the Möbius group on the Riemann sphere, where the projective line CP1 and the Riemann sphere are identified as follows:

[z_1�: z_2]\leftrightarrow z_1/z_2.

Here [z1:z2] are homogeneous coordinates on CP1; the point [1:0] corresponds to the point ∞ of the Riemann sphere.

If one restricts \mathfrak{H} to matrices of determinant one, the map \pi restricts to a surjective map from the special linear group SL(2,C) to the Möbius group; in the restricted setting the kernel (formed by plus and minus the identity) is still the center of the group. The Möbius group is therefore also isomorphic to PSL(2,C). We then have the following isomorphisms:

\mbox{Aut}(\widehat\mathbb C) \cong \mbox{PGL}(2,\mathbb C) \cong \mbox{PSL}(2,\mathbb C).

From the last identification we see that the Möbius group is a 3-dimensional complex Lie group (or a 6-dimensional real Lie group).

Note that there are precisely two matrices with unit determinant which can be used to represent any given Möbius transformation. That is, SL(2,C) is a double cover of PSL(2,C). Since SL(2,C) is simply-connected it is the universal cover of the Möbius group. Therefore the fundamental group of the Möbius group is Z2.

Classification

Möbius transformations are commonly classified into four types, parabolic, elliptic, hyperbolic and loxodromic (actually hyperbolic is a special case of loxodromic). The classification has both algebraic and geometric significance. Geometrically, the different types result in different transformations of the complex plane, as the figures below illustrate. These types can be distinguished by looking at the trace \mbox{tr}\,\mathfrak{H}=a+d. Note that the trace is invariant under conjugation, that is,

\mbox{tr}\,\mathfrak{GHG}^{-1} = \mbox{tr}\,\mathfrak{H},

and so every member of a conjugacy class will have the same trace. Every Möbius transformation can be written such that its representing matrix \mathfrak{H} has determinant one (by multiplying the entries with a suitable scalar). Two Möbius transformations  \mathfrak{H}, \mathfrak{H}' (both not equal to the identity transform) with  \det \mathfrak{H}=\det\mathfrak{H}'=1 are conjugate if and only if  \mbox{tr}^2\,\mathfrak{H}= \mbox{tr}^2\,\mathfrak{H}' .

In the following discussion we will always assume that the representing matrix  \mathfrak{H} is normalized such that  \det{\mathfrak{H}}=ad-bc=1 .

Parabolic transforms

A non-identity Möbius transformation defined by a matrix \mathfrak{H} of determinant one is said to be parabolic if

\mbox{tr}^2\mathfrak{H} = (a+d)^2 = 4

(so the trace is plus or minus 2; either can occur for a given transformation since \mathfrak{H} is determined only up to sign). In fact one of the choices for \mathfrak{H} has the same characteristic polynomial X2−2X+1 as the identity matrix, and is therefore unipotent. A Möbius transform is parabolic if and only if it has exactly one fixed point in the compactified complex plane \widehat{\mathbb{C}}=\mathbb{C}\cup\{\infty\}, which happens if and only if it can be defined by a matrix conjugate to

\begin{pmatrix} 1 & 1 \\ 0 & 1 \end{pmatrix}.

The set of all parabolic Möbius transformations with a given fixed point in \widehat{\mathbb{C}}, together with the identity, forms a subgroup isomorphic to the group of matrices

\begin{pmatrix} 1 & b \\ 0 & 1 \end{pmatrix}

for b\in\mathbb C; this is an example of a Borel subgroup (of the Möbius group, or of SL(2,C) for the matrix group; the notion is defined for any reductive Lie group).

All other non-identity transformations have two fixed points. All non-parabolic (non-identity) transformations are defined by a matrix conjugate to

\begin{pmatrix} \lambda & 0 \\ 0 & \lambda^{-1} \end{pmatrix}

with \lambda not equal to 0, 1 or −1. The square k=\lambda^2 is called the characteristic constant or multiplier of the transformation.

Elliptic transforms

The transformation is said to be elliptic if it can be represented by a matrix \mathfrak H whose trace is real with

0 \le \mbox{tr}^2\mathfrak{H} < 4.\,

A transform is elliptic if and only if |\lambda|=1. Writing \lambda=e^{i\alpha}, an elliptic transform is conjugate to

\begin{pmatrix} \cos\alpha & \sin\alpha \\ 
                      -\sin\alpha & \cos\alpha \end{pmatrix}

with \alpha real. Note that for any \mathfrak{H} with characteristic constant k, the characteristic constant of \mathfrak{H}^n is k^n. Thus, the only Möbius transformations of finite order are the elliptic transformations, and these only when λ is a root of unity; equivalently, when α is a rational multiple of π.

Hyperbolic transforms

The transform is said to be hyperbolic if it can be represented by a matrix \mathfrak H whose trace is real with

\mbox{tr}^2\mathfrak{H} > 4.\,

A transform is hyperbolic if and only if λ is real and positive.

Loxodromic transforms

The transform is said to be loxodromic if \mbox{tr}^2\mathfrak{H} is not in [0,∞). A transformation is loxodromic if and only if |\lambda|\ne 1. Historically, navigation by loxodrome or rhumb line refers to a path of constant bearing; the resulting path is a logarithmic spiral, similar in shape to the transformations of the complex plane that a loxodromic Möbius transformation makes. See the geometric figures below.

Transformation Trace squared Multipliers Class representative
Elliptic 0 \leq \sigma < 4 |k| = 1
k = e^{\pm i\theta} \neq 1
\begin{pmatrix}e^{i\theta/2} & 0 \\ 0 & e^{-i\theta/2}\end{pmatrix} z\mapsto e^{i\theta}z
Parabolic \sigma = 4 k = 1 \begin{pmatrix}1 & a \\ 0 & 1\end{pmatrix} z\mapsto z + a
Hyperbolic 4 < \sigma < \infty k \in \mathbb R^{+}
k = e^{\pm \theta} \neq 1
\begin{pmatrix}e^{\theta/2} & 0 \\ 0 & e^{-\theta/2}\end{pmatrix} z \mapsto e^\theta z
Loxodromic \sigma\in\mathbb C, \sigma \not\in [0,\infty) |k| \neq 1, k\not\in\mathbb{R}^+
k = \lambda^{2}, \lambda^{-2}
\begin{pmatrix}\lambda & 0 \\ 0 & \lambda^{-1}\end{pmatrix} z \mapsto k z

Fixed points

Every non-identity Möbius transformation has two fixed points \gamma_1, \gamma_2 on the Riemann sphere. Note that the fixed points are counted here with multiplicity; for parabolic transformations, the fixed points coincide. Either or both of these fixed points may be the point at infinity.

The fixed points of the transformation

f(z) = \frac{az + b}{cz + d}

are obtained by solving the fixed point equation f(\gamma) = \gamma. For c\ne 0, this has two roots (proof):

\gamma_{1,2} = \frac{(a - d) \pm \sqrt{(a-d)^2 + 4bc}}{2c} = \frac{(a - d) \pm \sqrt{(a+d)^2 - 4(ad-bc)}}{2c}.

Note that for parabolic transformations, which satisfy (a+d)^2 = 4(ad-bc), the fixed points coincide.

When c = 0, one of the fixed points is at infinity; the other is given by

\gamma=-\frac{b}{a-d}.

The transformation will be a simple transformation composed of translations, rotations, and dilations:

z \mapsto \alpha z + \beta.\,

If c = 0 and a = d, then both fixed points are at infinity, and the Möbius transformation corresponds to a pure translation: z \mapsto z + \beta.

Normal form

Möbius transformations are also sometimes written in terms of their fixed points in so-called normal form. We first treat the non-parabolic case, for which there are two distinct fixed points.

Non-parabolic case:

Every non-parabolic transformation is conjugate to a dilation, i.e. a transformation of the form

z \mapsto k z

with fixed points at 0 and ∞. To see this define a map

g(z) = \frac{z - \gamma_1}{z - \gamma_2}

which sends the points (\gamma_1, \gamma_2) to (0,\infty). Here we assume that both \gamma_1 and \gamma_2 are finite. If one of them is already at infinity then g can be modified so as to fix infinity and send the other point to 0.

If f has distinct fixed points (\gamma_1, \gamma_2) then the transformation gfg^{-1} has fixed points at 0 and ∞ and is therefore a dilation: gfg^{-1}(z) = kz. The fixed point equation for the transformation f can then be written

\frac{f(z)-\gamma_1}{f(z)-\gamma_2} = k \frac{z-\gamma_1}{z-\gamma_2}.

Solving for f gives (in matrix form):

\mathfrak{H}(k; \gamma_1, \gamma_2) =
\begin{pmatrix}
 \gamma_1 - k\gamma_2 & (k - 1) \gamma_1\gamma_2 \\
 1 - k                &  k\gamma_1 - \gamma_2
\end{pmatrix}

or, if one of the fixed points is at infinity:

\mathfrak{H}(k; \gamma, \infty) =
\begin{pmatrix}
 k & (1 - k) \gamma \\
 0 &  1
\end{pmatrix}.

From the above expressions one can calculate the derivatives of f at the fixed points:

f'(\gamma_1)= k\, and f'(\gamma_2)= 1/k.\,

Observe that, given an ordering of the fixed points, we can distinguish one of the multipliers (k) of f as the characteristic constant of f. Reversing the order of the fixed points is equivalent to taking the inverse multiplier for the characteristic constant:

\mathfrak{H}(k; \gamma_1, \gamma_2) = \mathfrak{H}(1/k; \gamma_2, \gamma_1).

For loxodromic transformations, whenever |k|>1, one says that \gamma_1 is the repulsive fixed point, and \gamma_2 is the attractive fixed point. For |k|<1, the roles are reversed.

Parabolic case:

In the parabolic case there is only one fixed point \gamma. The transformation sending that point to ∞ is

g(z) = \frac{1}{z - \gamma}

or the identity if \gamma is already at infinity. The transformation gfg^{-1} fixes infinity and is therefore a translation:

gfg^{-1}(z) = z + \beta\,.

Here, β is called the translation length. The fixed point formula for a parabolic transformation is then

\frac{1}{f(z)-\gamma} = \frac{1}{z-\gamma} + \beta.

Solving for f (in matrix form) gives

\mathfrak{H}(\beta; \gamma) =
\begin{pmatrix}
   1+\gamma\beta    & - \beta \gamma^2  \\
   \beta            &   1-\gamma \beta 
\end{pmatrix}

or, if \gamma = \infty:

\mathfrak{H}(\beta; \infty) =
\begin{pmatrix}
   1 & \beta  \\
   0 & 1 
\end{pmatrix}

Note that \beta is not the characteristic constant of f, which is always 1 for a parabolic transformation. From the above expressions one can calculate:

f'(\gamma) = 1.\,

Geometric interpretation of the characteristic constant

The following picture depicts (after stereographic transformation from the sphere to the plane) the two fixed points of a Möbius transformation in the non-parabolic case:

Mobius Identity.jpeg

The characteristic constant can be expressed in terms of its logarithm:

e^{\rho + \alpha i} = k. \;

When expressed in this way, the real number \rho becomes an expansion factor. It indicates how repulsive the fixed point \gamma_1 is, and how attractive \gamma_2 is. The real number \alpha is a rotation factor, indicating to what extent the transform rotates the plane anti-clockwise about \gamma_1 and clockwise about \gamma_2.

Elliptic transformations

If \rho = 0, then the fixed points are neither attractive nor repulsive but indifferent, and the transformation is said to be elliptical. These transformations tend to move all points in circles around the two fixed points. If one of the fixed points is at infinity, this is equivalent to doing an affine rotation around a point.

If we take the one-parameter subgroup generated by any elliptic Möbius transformation, we obtain a continuous transformation, such that every transformation in the subgroup fixes the same two points. All other points flow along a family of circles which is nested between the two fixed points on the Riemann sphere. In general, the two fixed points can be any two distinct points.

This has an important physical interpretation. Imagine that some observer rotates with constant angular velocity about some axis. Then we can take the two fixed points to be the North and South poles of the celestial sphere. The appearance of the night sky is now transformed continuously in exactly the manner described by the one-parameter subgroup of elliptic transformations sharing the fixed points 0, \infty, and with the number \alpha corresponding to the constant angular velocity of our observer.

Here are some figures illustrating the effect of an elliptic Möbius transformation on the Riemann sphere (after stereographic projection to the plane):

Mobius Small Neg Elliptical.jpeg
Mobius Large Pos Elliptical.jpeg

These pictures illustrate the effect of a single Möbius transformation. The one-parameter subgroup which it generates continuously moves points along the family of circular arcs suggested by the pictures.

Hyperbolic transformations

If \alpha is zero (or a multiple of 2 \pi), then the transformation is said to be hyperbolic. These transformations tend to move points along circular paths from one fixed point toward the other.

If we take the one-parameter subgroup generated by any hyperbolic Möbius transformation, we obtain a continuous transformation, such that every transformation in the subgroup fixes the same two points. All other points flow along a certain family of circular arcs away from the first fixed point and toward the second fixed point. In general, the two fixed points may be any two distinct points on the Riemann sphere.

This too has an important physical interpretation. Imagine that an observer accelerates (with constant magnitude of acceleration) in the direction of the North pole on his celestial sphere. Then the appearance of the night sky is transformed in exactly the manner described by the one-parameter subgroup of hyperbolic transformations sharing the fixed points 0,\infty, with the real number \rho corresponding to the magnitude of his acceleration vector. The stars seem to move along longitudes, away from the South pole toward the North pole. (The longitudes appear as circular arcs under stereographic projection from the sphere to the plane).

Here are some figures illustrating the effect of a hyperbolic Möbius transformation on the Riemann sphere (after stereographic projection to the plane):

Mobius Small Neg Hyperbolic.jpeg
Mobius Large Pos Hyperbolic.jpeg

These pictures resemble the field lines of bar magnets, because the circular flow lines subtend a constant angle between the two fixed points.

Loxodromic transformations

If both ρ and α are nonzero, then the transformation is said to be loxodromic. These transformations tend to move all points in S-shaped paths from one fixed point to the other.

The word "loxodrome" is from the Greek: "λοξος (loxos), slanting + δρόμος (dromos), course". When sailing on a constant bearing - if you maintain a heading of (say) north-east, you will eventually wind up sailing around the north pole in a logarithmic spiral. On the mercator projection such a course is a straight line, as the north and south poles project to infinity. The angle that the loxodrome subtends relative to the lines of longitude (i.e. its slope, the "tightness" of the spiral) is the argument of k. Of course, Möbius transformations may have their two fixed points anywhere, not just at the north and south poles. But any loxodromic transformation will be conjugate to a transform that moves all points along such loxodromes.

If we take the one-parameter subgroup generated by any loxodromic Möbius transformation, we obtain a continuous transformation, such that every transformation in the subgroup fixes the same two points. All other points flow along a certain family of curves, away from the first fixed point and toward the second fixed point. Unlike the hyperbolic case, these curves are not circular arcs, but certain curves which under stereographic projection from the sphere to the plane appear as spiral curves which twist counterclockwise infinitely often around one fixed point and twist clockwise infinitely often around the other fixed point. In general, the two fixed points may be any two distinct points on the Riemann sphere.

You can probably guess the physical interpretation in the case when the two fixed points are 0, \infty: an observer who is both rotating (with constant angular velocity) about some axis and boosting (with constant magnitude acceleration) along the same axis, will see the appearance of the night sky transform according to the one-parameter subgroup of loxodromic transformations with fixed points 0, \infty, and with \rho, \alpha determined respectively by the magnitude of acceleration and angular velocity.

Stereographic projection

These images show Möbius transformations stereographically projected onto the Riemann sphere. Note in particular that when projected onto a sphere, the special case of a fixed point at infinity looks no different from having the fixed points in an arbitrary location.

Elliptic Hyperbolic Loxodromic
One fixed point at infinity Mob3d-elip-inf-200.png Mob3d-hyp-inf-200.png Mob3d-lox-inf-200.png
Full size Full size Full size
Fixed points diametrically opposite Mob3d-elip-opp-200.png Mob3d-hyp-opp-200.png Mob3d-lox-opp-200.png
Full size Full size Full size
Fixed points in an arbitrary location Mob3d-elip-arb-200.png Mob3d-hyp-arb-200.png Mob3d-lox-arb-200.png
Full size Full size Full size

Iterating a transformation

If a transformation \mathfrak{H} has fixed points \gamma_1, \gamma_2, and characteristic constant k, then \mathfrak{H}' = \mathfrak{H}^n will have \gamma_1' = \gamma_1, \gamma_2' = \gamma_2, k' = k^n.

This can be used to iterate a transformation, or to animate one by breaking it up into steps.

These images show three points (red, blue and black) continuously iterated under transformations with various characteristic constants.

Mobius23621.jpeg Mobius23622.jpeg
Mobius23623.jpeg

And these images demonstrate what happens when you transform a circle under Hyperbolic, Elliptical, and Loxodromic transforms. Note that in the elliptical and loxodromic images, the α value is 1/10 .

IteratedHyperbolicTsfm.png IteratedEllipticalTsfm.png IteratedLoxodromicTsfm.png

Poles of the transformation

The point

z_\infty = - \frac{d}{c}

is called the pole of \mathfrak{H}; it is that point which is transformed to the point at infinity under \mathfrak{H}.

The inverse pole

Z_\infty = \frac{a}{c}

is that point to which the point at infinity is transformed. The point midway between the two poles is always the same as the point midway between the two fixed points:

\gamma_1 + \gamma_2 = z_\infty + Z_\infty.

These four points are the vertices of a parallelogram which is sometimes called the characteristic parallelogram of the transformation.

A transform \mathfrak{H} can be specified with two fixed points \gamma_1, \gamma_2 and the pole z_\infty.

\mathfrak{H} =
\begin{pmatrix}
  Z_\infty & - \gamma_1 \gamma_2 \\
  1        & - z_\infty
\end{pmatrix}, \;\;
  Z_\infty = \gamma_1 + \gamma_2 - z_\infty.

This allows us to derive a formula for conversion between k and z_\infty given \gamma_1, \gamma_2:

z_\infty = \frac{k \gamma_1 - \gamma_2}{1 - k}
k 
= \frac{\gamma_2 - z_\infty}{\gamma_1 - z_\infty}
= \frac{Z_\infty - \gamma_1}{Z_\infty - \gamma_2}
= \frac {a - c \gamma_1}{a - c \gamma_2},

which reduces down to

k = \frac{(a + d) + \sqrt {(a - d)^2 + 4 b c}}{(a + d) - \sqrt {(a - d)^2 + 4 b c}}.

The last expression coincides with one of the (mutually reciprocal) eigenvalue ratios  \lambda_1\over \lambda_2 of the matrix

\mathfrak{H} =
\begin{pmatrix}
  a & b \\
  c & d
\end{pmatrix}

representing the transform (compare the discussion in the preceding section about the characteristic constant of a transformation). Its characteristic polynomial is equal to


 \mbox{det} (\lambda I_2- \mathfrak{H})
=\lambda^2-\mbox{tr} \mathfrak{H}\,\lambda+
\mbox{det} \mathfrak{H}
=\lambda^2-(a+d)\lambda+(ad-bc)

which has roots

 \lambda_{i}=\frac{(a + d) \pm \sqrt {(a - d)^2 + 4 b c}}{2}=\frac{(a + d) \pm \sqrt {(a + d)^2 - 4(ad-b c)}}{2}=c\gamma_i+d \ .

Specifying a transformation by three points

Direct cross-ratio

Fixing three points of a cross-ratio, one obtains a transformation

cr(w)= \frac {(w-z_3)(z_2-z_4)}{(w-z_4)(z_2-z_3)}

This method allows to directly specify the anti-images of 1, 0, and ∞ respectively as z2, z3, and z4.

The function cr as defined above is obviously a Möbius transformation. It can be easily seen that any Möbius transformation can be written this way. Besides theoretical considerations, it is practical to note that specifying ∞ as one of the zi points cancels out the relevant pair of factors. For example, to get rid of the denominator, that is, to set coefficients c = 0 and d = 1, it is enough to choose ∞ as the anti-image of itself:

 \lim_{z_4 \to \infty} \frac {(w-z_3)(z_2-z_4)}{(w-z_4)(z_2-z_3)} = \frac {w-z_3}{z_2-z_3}

Direct approach

Any set of three points


  Z_1 = \mathfrak{H}(z_1), \;\;
  Z_2 = \mathfrak{H}(z_2), \;\;
  Z_3 = \mathfrak{H}(z_3)

uniquely defines a transformation \mathfrak{H}. To calculate this out, it is handy to make use of a transformation that is able to map three points onto (0,0), (1, 0) and the point at infinity.

\mathfrak{H}_1 = \begin{pmatrix} 
\frac{z_2 - z_3}{z_2 - z_1} & -z_1 \frac{z_2 - z_3}{z_2 - z_1} 
\\ 1 & -z_3 
\end{pmatrix}, \;\;
\mathfrak{H}_2 = \begin{pmatrix} 
\frac{Z_2 - Z_3}{Z_2 - Z_1} & -Z_1 \frac{Z_2 - Z_3}{Z_2 - Z_1} 
\\ 1 & -Z_3 
\end{pmatrix}

One can get rid of the infinities by multiplying out by z_2 - z_1 and Z_2 - Z_1 as previously noted.

\mathfrak{H}_1 = \begin{pmatrix} 
z_2 - z_3 & z_1 z_3 - z_1 z_2 
\\ z_2 - z_1 & z_1 z_3 - z_3 z_2 
\end{pmatrix}
, \;\;
\mathfrak{H}_2 = \begin{pmatrix} 
Z_2 - Z_3 & Z_1 Z_3 - Z_1 Z_2 
\\ Z_2 - Z_1 & Z_1 Z_3 - Z_3 Z_2 
\end{pmatrix}

The matrix \mathfrak{H} to map z_{1,2,3} onto Z_{1,2,3} then becomes

\mathfrak{H} = \mathfrak{H}_2^{-1} \mathfrak{H}_1

You can multiply this out, if you want, but if you are writing code then it's easier to use temporary variables for the middle terms.

Explicit determinant formula

The problem of constructing a Möbius transformation  \mathfrak{H}(z) mapping a triple  (z_1, z_2, z_3 ) to another triple ( w_1, w_2, w_3 ) is equivalent to finding the equation of a standard hyperbola

\, c wz -az+dw -b=0

in the (z,w)-plane passing through the points  (z_i, w_i ) . An explicit equation can be found by evaluating the determinant

\det \begin{pmatrix} zw & z & w & 1 \\ z_1w_1 & z_1 & w_1 & 1 \\   z_2w_2 & z_2 & w_2 & 1 \\   z_3w_3 & z_3 & w_3 & 1 
\end{pmatrix} 
\,

by means of a Laplace expansion along the first row. This results in the determinant formulae

a=\det \begin{pmatrix} z_1w_1 & w_1 & 1 \\   z_2w_2 & w_2 & 1 \\   z_3w_3 & w_3 & 1 
\end{pmatrix} 
\,
b=\det \begin{pmatrix} z_1w_1 & z_1 & w_1 \\   z_2w_2 & z_2 & w_2 \\   z_3w_3 & z_3 & w_3 
\end{pmatrix} 
\,
c=\det \begin{pmatrix} z_1 & w_1 & 1 \\   z_2 & w_2 & 1 \\   z_3 & w_3 & 1 
\end{pmatrix} 
\,
d=\det \begin{pmatrix} z_1w_1 & z_1 & 1 \\  z_2w_2 & z_2 & 1 \\   z_3w_3 & z_3 & 1 
\end{pmatrix}

for the coefficients  a,b,c,d of the representing matrix \, \mathfrak{H} =\begin{pmatrix} a & b \\  c & d \end{pmatrix} . The constructed matrix  \mathfrak{H} has determinant equal to  (z_1-z_2) (z_1-z_3)(z_2-z_3)(w_1-w_2) (w_1-w_3)(w_2-w_3) which does not vanish if the zi resp. wi are pairwise different thus the Möbius transformation is well-defined.

Remark: A similar determinant (with  wz replaced by  w^2+z^2 ) leads to the equation of a circle through three different (non-collinear) points in the plane.

Alternate method using cross-ratios of point quadruples

This construction exploits the fact (mentioned in the first section) that the cross-ratio

 
\mbox{cr}(z_1,z_2,z_3,z_4)=
{{(z_1-z_3)(z_2-z_4)}\over{(z_1-z_4)(z_2-z_3)}}

is invariant under a Möbius transformation mapping a quadruple  (z_1,z_2,z_3,z_4) to  (w_1,w_2,w_3,w_4) via w_i=\mathfrak{H}z_i. If \mathfrak{H} maps a triple  (z_1,z_2,z_3) of pairwise different zi to another triple  (w_1,w_2,w_3) , then the

Möbius transformation \mathfrak{H} is determined by the equation

 
\mbox{cr}(\mathfrak{H}(z),w_1,w_2,w_3)=\mbox{cr}(z,z_1,z_2,z_3),

or written out in concrete terms:


{{(\mathfrak{H}(z)-w_2)(w_1-w_3)}
\over{(\mathfrak{H}(z)-w_3)(w_1-w_2)}}
={{(z-z_2)(z_1-z_3)}\over{(z-z_3)(z_1-z_2)}}\ .

The last equation can be transformed into


{{\mathfrak{H}(z)-w_2}
\over{\mathfrak{H}(z)-w_3}}
={{(z-z_2)(w_1-w_2)(z_1-z_3)}\over{(z-z_3)(w_1-w_3)(z_1-z_2)}} \ .

Solving this equation for  \mathfrak{H}(z) one obtains the sought transformation.

Relation to the fixed point normal form

Assume that the points  z_2,\, z_3 are the two (different) fixed points of the Möbius transform  \mathfrak{H} i.e.  w_2=z_2, \, w_3=z_3. Write  z_2 =\gamma_1,\, z_3 =\gamma_2 . The last equation


{{\mathfrak{H}(z)-w_2}
\over{\mathfrak{H}(z)-w_3}}
={{(z-z_2)(w_1-w_2)(z_1-z_3)}\over{(z-z_3)(w_1-w_3)(z_1-z_2)}}

then reads


{{\mathfrak{H}(z)-\gamma_1}
\over{\mathfrak{H}(z)-\gamma_2}}
={{(w_1-\gamma_1)(z_1-\gamma_2)}\over {(w_1-\gamma_2)(z_1-\gamma_1)}}\cdot  {{z-\gamma_1}\over {z-\gamma_2}}\ .

In the previous section on normal form a Möbius transform with two fixed points  \gamma_1, \gamma_2 was expressed using the characteristic constant k of the transform as


{{\mathfrak{H}(z)-\gamma_1}
\over{\mathfrak{H}(z)-\gamma_2}}
=k\,{{z-\gamma_1}\over {z-\gamma_2}}\ .

Comparing both expressions one derives the equality

 k={{(w_1-\gamma_1)(z_1-\gamma_2)}\over {(w_1-\gamma_2)(z_1-\gamma_1)}}=\mbox{cr}(w_1,z_1,\gamma_1,\gamma_2)  \ ,

where  z_1 is different from the fixed points  \gamma_1 ,\, \gamma_2 and  w_1=\mathfrak{H}(z_1) is the image of z1 under  \mathfrak{H} . In particular the cross-ratio  \mbox{cr}(\mathfrak{H}(z),z,\gamma_1,\gamma_2) does not depend on the choice of the point z (different from the two fixed points) and is equal to the characteristic constant.

Lorentz transformations

The real Minkowski space consists of a the four-dimensional real coordinate space R4 consisting of the space of ordered quadruples (x0,x1,x2,x3) of real numbers, together with a quadratic form

Q(x_0,x_1,x_2,x_3) = x_0^2-x_1^2-x_2^2-x_3^2.

Borrowing terminology from special relativity, points with Q > 0 are considered timelike; in addition, if x0 > 0, then the point is called future-pointing. Points with Q < 0 are called spacelike. The null cone S consists of those points where Q = 0; the future null cone N+ are those points on the null cone with x0 > 0. The celestial sphere is then identified with the collection of rays in N+ whose initial point is the origin of R4. The collection of linear transformations on R4 with positive determinant preserving the quadratic form Q and preserving the time direction form the orthochronous Lorentz group SO+(1,3).

In connection with the geometry of the celestial sphere, the group of transformations SO+(1,3) is identified with the group Möbius transformations of the sphere PSL(2,C) by exhibiting the action of the spin group on spinors (Penrose & Rindler 1986). To each (x0,x1,x2,x3) ∈ R4, associate the hermitian matrix

X=\begin{bmatrix}
x_0+x_1 & x_2+ix_3\\
x_2-ix_3 & x_0-x_1
\end{bmatrix}.

The determinant of the matrix X is equal to Q(x0,x1,x2,x3). The special linear group acts on the space of such matrices via

X\mapsto AXA^* 

 

 ( 1 )

 

for each A ∈ SL(2,C), and this action of SL(2,C) preserves the determinant of X because det A = 1. Since the determinant of X is identified with the quadratic form Q, SL(2,C) acts by Lorentz transformations. On dimensional grounds, SL(2,C) covers a neighborhood of the identity of SO(1,3). Since SL(2,C) is connected, it covers the entire orthochronous Lorentz group SO+(1,3). Furthermore, if is that the kernel of the action (1 ) is the subgroup {±I}, then passing to the quotient group gives the group isomorphism

PSL(2,\mathbf{C})\cong SO^+(1,3). 

 

 ( 2 )

 

Focussing now attention on the case when (x0,x1,x2,x3) is null, the matrix X has zero determinant, and therefore splits as the outer product of a complex two-vector ξ with its complex conjugate:

X = \xi\bar{\xi}^T=\xi\xi^*. 

 

 ( 3 )

 

The two-component vector ξ is acted upon by SL(2,C) in a manner compatible with (1 ). It is now clear that the kernel of the representation of SL(2,C) on hermitian matrices is {±I}.

The action of PSL(2,C) on the celestial sphere may also be described geometrically using stereographic projection. Consider first the hyperplane in R4 given by x0 = 1. The celestial sphere may be identified with the sphere S+ of intersection of the hyperplane with the future null cone N+. The stereographic projection from the north pole (1,0,0,1) of this sphere onto the plane x3 = 0 takes a point with coordinates (1,x1,x2,x3) with

x_1^2+x_2^2+x_3^2=1

to the point

\left(1, \frac{x_1}{1-x_3}, \frac{x_2}{1-x_3},0\right).

Introducing the complex coordinate

\zeta = \frac{x_1+ix_2}{1-x_3},

the inverse stereographic projection gives the following formula for a point (x1, x2, x3) on S+:


\begin{align}
x_1 &= \frac{\zeta+\bar{\zeta}}{\zeta\bar{\zeta}+1}\\
x_2 &= \frac{\zeta-\bar{\zeta}}{i(\zeta\bar{\zeta}+1)}\\
x_3 &= \frac{\zeta\bar{\zeta}-1}{\zeta\bar{\zeta}+1}.
\end{align}
 

 

 ( 4 )

 

The action of SO+(1,3) on the points of N+ does not preserve the hyperplane S+, but acting on points in S+ and then rescaling so that the result is again in S+ gives an action of SO+(1,3) on the sphere which goes over to an action on the complex variable ζ. In fact, this action is by fractional linear transformations, although this is not easily seen from this representation of the celestial sphere. Conversely, for any fractional linear transformation of ζ variable goes over to a unique Lorentz transformation on N+, possibly after a suitable (uniquely determined) rescaling.

A more invariant description of the stereographic projection which allows the action to be more clearly seen is to consider the variable ζ = z:w as a ratio of a pair of homogeneous coordinates for the complex projective line CP1. The stereographic projection goes over to a transformation from C2 − {0} to N+ which is homogeneous of degree two with respect to real scalings

(z,w)\mapsto (x_0,x_1,x_2,x_3)=(z\bar{z}+w\bar{w}, z\bar{w}+w\bar{z}, i^{-1}(z\bar{w}-w\bar{z}), z\bar{z}-w\bar{w}) 

 

 ( 5 )

 

which agrees with (4 ) upon restriction to scales in which \scriptstyle{z\bar{z}+w\bar{w}=1}. The components of (5 ) are precisely those obtained from the outer product


\begin{bmatrix}
x_0+x_1 & x_2+ix_3\\
x_2-ix_3 & x_0-x_1
\end{bmatrix} = 
2\begin{bmatrix}
z\\ w
\end{bmatrix}
\begin{bmatrix}
\bar{z}&\bar{w}
\end{bmatrix}.

In summary, the action of the orthochronous Lorentz group SO+(1,3) agrees with that of the Möbius group PSL(2,C). This motivates the following definition. In dimension n ≥ , the Möbius group Möb(n) is the group of all orientation-preserving conformal isometries of the round sphere Sn to itself. By realizing the conformal sphere as the space of future-pointing rays of the null cone in the Minkowski space R1,n+1, there is an isomorphism of Möb(n) with the orthochronous Lorentz group SO+(1,n+1) of Lorentz transformations with positive determinant, preserving the direction of time.

See also

  • Bilinear transform
  • Conformal geometry
  • Fuchsian group
  • Generalised circle
  • Hyperbolic geometry
  • Inversive geometry
  • Inversive ring geometry
  • Inversion transformation
  • Kleinian group
  • Lie sphere geometry
  • Lorentz group
  • Modular group
  • Poincaré half-plane model
  • Projective geometry

References

External links