Quaternions, in mathematics, are a non-commutative extension of complex numbers. They were first described by the Irish mathematician Sir William Rowan Hamilton in 1843 and applied to mechanics in three-dimensional space. They find uses in both theoretical and applied mathematics, in particular for calculations involving three-dimensional rotations, such as in 3D computer graphics, although they have been superseded in many applications by vectors and matrices.
In modern language, quaternions form a 4-dimensional normed division algebra over the real numbers. The algebra of quaternions is often denoted by H (for Hamilton), or in blackboard bold by (Unicode ℍ). It can also be given by the Clifford algebra classifications Cℓ0,2(R) = Cℓ03,0(R). The algebra H holds a special place in analysis since, according to the Frobenius theorem, it is one of only three finite-dimensional division rings containing the real numbers as a subring.
Quaternions were introduced by Irish mathematician Sir William Rowan Hamilton in 1843. Hamilton knew that the complex numbers could be viewed as points in a plane, and he was looking for a way to do the same for points in space. Points in space can be represented by their coordinates, which are triples of numbers, and for many years Hamilton had known how to add and multiply triples of numbers. But he had been stuck on the problem of division: He did not know how to take the quotient of two points in space.
On October 16, 1843, Hamilton and his wife took a walk along the Royal Canal in Dublin. While they walked across Brougham Bridge (now Broom Bridge), a solution suddenly occurred to him. He could not divide triples, but he could divide quadruples. By using three of the numbers in the quadruple as the points of a coordinate in space, Hamilton could represent points in space by his new system of numbers. He then carved the basic rules for multiplication into the bridge:
Hamilton called a quadruple with these rules of multiplication a quaternion, and he devoted the remainder of his life to studying and teaching them. He founded a school of "quaternionists" and popularized them in several books. The last and longest, Elements of Quaternions, had 800 pages and was published shortly after his death.
After Hamilton's death, his pupil Peter Tait continued promoting quaternions. At this time, quaternions were a mandatory examination topic in Dublin, and in some American universities they were the only advanced mathematics topic taught. Topics in physics and geometry that would now be described using vectors, such as kinematics in space and Maxwell's equations, were described entirely in terms of quaternions. There was even a professional research association, the Quaternion Society (1899 - 1913), exclusively devoted to the study of quaternions.
From the mid 1880s, quaternions began to be displaced by vector analysis, which had been developed by Josiah Willard Gibbs and Oliver Heaviside. Vector analysis described the same phenomena as quaternions, so it borrowed ideas and terms liberally from the classical quaternion literature. However, vector analysis was conceptually simpler and notationally cleaner, and eventually quaternions were relegated to a minor role in mathematics and physics. A side effect of this transition is that works on classical Hamiltonian quaternions are difficult to comprehend for many modern readers because they use familiar terms from vector analysis in unfamiliar and fundamentally different ways.
However, quaternions have had a revival in the late 20th century, primarily due to their utility in describing spatial rotations. Representations of rotations by quaternions are more compact and faster to compute than representations by matrices. For this reason, quaternions are used in computer graphics,[1] control theory, signal processing, attitude control, physics, bioinformatics, and orbital mechanics. For example, it is common for spacecraft attitude-control systems to be commanded in terms of quaternions. Quaternions have received another boost from number theory because of their relation to quadratic forms.
Since 1989, the Department of Mathematics of the National University of Ireland, Maynooth has organized a pilgrimage, where scientists (including physicists Murray Gell-Mann in 2002, Steven Weinberg in 2005, and mathematician Andrew Wiles in 2003) take a walk from Dunsink Observatory to the Royal Canal bridge where, unfortunately, no trace of Hamilton's carving remains.
As a set, the quaternions H are equal to R4, a four-dimensional vector space over the real numbers. H has three operations: Addition, scalar multiplication, and quaternion multiplication. The sum of two elements of H is defined to be their sum as elements of R4. Similarly the product of an element of H by a real number is defined to be the same as the product in R4. To define multiplication in H requires a choice of basis for R4. The elements of this basis are customarily denoted as 1, i, j, and k. Every element of H can be uniquely written as a linear combination of these basis elements, that is, as a1 + bi + cj + dk, where a, b, c, and d are real numbers. The basis element 1 will be the identity element of H, meaning that multiplication by 1 does nothing, and for this reason, elements of H are usually written a + bi + cj + dk, suppressing the basis element 1. Given this basis, quaternion multiplication is defined by first defining the products of basis elements and then defining all other products using the distributive law.
The equations
where i, j, and k are imaginary numbers, determine all the possible products of i, j, and k. For example, since
right-multiplying both sides by k gives
All the other possible products can be determined by similar methods, and this gives the following table:
For two elements a1 + b1i + c1j + d1k and a2 + b2i + c2j + d2k, their Hamilton product (a1 + b1i + c1j + d1k)(a2 + b2i + c2j + d2k) is determined by the products of the basis elements and the distributive law. The distributive law makes it possible to expand the product so that it is a sum of products of basis elements. This gives the following expression:
Now the basis elements can be multiplied using the rules given above to get:
Using the basis 1, i, j, k of H makes it possible to write H as a set of quadruples:
Then the basis elements are:
and the formulas for addition and multiplication are:
A number of the form a + 0i + 0j + 0k, where a is a real number, is called real, and a number of the form 0 + bi + cj + dk, where b, c, and d are real numbers (possibly all zero), is called pure imaginary. If a + bi + cj + dk is any quaternion, then a is called its scalar part and bi + cj + dk is called its vector part. The scalar part of a quaternion is always real, and the vector part is always pure imaginary. Even though every quaternion is a vector in a four-dimensional vector space, it is common to define a vector to mean a pure imaginary quaternion. With this convention, a vector is the same as an element of the vector space R3. This identification leads to many applications.
Hamilton called pure imaginary quaternions right quaternions[2][3] and real numbers (considered as quaternions with zero vector part) scalar quaternions.
Unlike multiplication of real or complex numbers, multiplication of quaternions is not commutative: For example, , while . The non-commutativity of multiplication has some unexpected consequences, among them that polynomial equations over the quaternions can have more distinct solutions than the degree of the polynomial. The equation , for instance, has infinitely many quaternion solutions with , so that these solutions form a two-dimensional sphere centered on zero in the three-dimensional pure imaginary subspace of quaternions. This sphere intersects the complex plane at the two poles and .
Conjugation of quaternions is analogous to conjugation of complex numbers and to transposition (also known as reversal) of elements of Clifford algebras. To define it, let q = a +bi +cj + dk be a quaternion. The conjugate of q is the quaternion a − bi − cj − dk. It is denoted by q*, , qt, or . Conjugation is an involution, meaning that it is its own inverse, so conjugating an element twice returns the original element. The conjugate of a product of two quaternions is the product of the conjugates in the reverse order. That is, if p and q are quaternions, then (pq)* = q*p*, not p*q*.
Conjugation can be used to extract the scalar and vector parts of a quaternion. The scalar part of p is (p + p*)/2, and the vector part of p is (p − p*)/2.
The square root of the product of a quaternion with its conjugate is called its norm and is denoted ||q||. (Hamilton called this quantity the tensor of q, but this conflicts with modern usage. See tensor.) It has the formula
This is always a non-negative real number, and it is same as the Euclidean norm on H considered as the vector space R4. Multiplying a quaternion by a real number scales its norm by the absolute value of the number. That is, if α is real, then
This is a special case of the fact that the norm is multiplicative, meaning that
for any two quaternions p and q. Multiplicativity is a consequence of the formula for the conjugate of a product. This norm makes it possible to define the distance d(p, q) between p and q as the norm of their difference:
This makes H into a metric space. Addition and multiplication are continuous in the metric topology.
A unit quaternion is a quaternion of norm one. Dividing a non-zero quaternion q by its norm produces a unit quaternion Uq called the versor of q:
Using conjugation and the norm makes it possible to define the reciprocal of a quaternion. The product of a quaternion with its reciprocal should equal 1, and the considerations above imply that the product of and (in either order) is 1. So the reciprocal of q is defined to be
This makes it possible to divide two quaternions p and q in two different ways. That is, their quotient can be either pq−1 or q−1p. The notation p/q is ambiguous because it does not specify whether q divides on the left or the right.
The set H of all quaternions is a vector space over the real numbers with dimension 4. (In comparison, the real numbers have dimension 1, the complex numbers have dimension 2, and the octonions have dimension 8.) They are an associative algebra over the real numbers. Even though they contain copies of the complex numbers, they are not an associative algebra over the complex numbers because multiplying a quaternion by a complex number is not always commutative.
The norm makes the quaternions into a normed algebra, and even into a composition algebra and a unital Banach algebra. Composition algebras are very rare. Hurwitz's theorem states that there are only four composition algebras over the real numbers: R, C, H, and O. Because it is possible to divide quaternions, they form a division algebra. This is a structure similar to a field except for the commutativity of multiplication. Finite-dimensional division algebras over the real numbers are also very rare. The Frobenius theorem states that there are exactly three: R, C, and H.
Because the product of any two basis vectors is plus or minus another basis vector, the set {±1, ±i, ±j, ±k} forms a group under multiplication. This group is called the quaternion group and is denoted Q8. The real group ring of Q8 is a ring RQ8 which is also an eight-dimensional vector space over R. It has one basis vector for each element of Q8. The quaternions are the quotient ring of RQ8 by the ideal generated by the elements 1 − (−1), i − (−i), j − (−j), and k − (−k). Here the first term in each of the differences is one of the basis elements 1, i, j, and k, and the second term is one of basis elements −1, −i, −j, and −k, not the additive inverses of 1, i, j, and k.
Because the vector part of a quaternion is a vector in R3, the geometry of R3 is reflected in the algebraic structure of the quaternions. Many operations on vectors can be defined in terms of quaternions, and this makes it possible to apply quaternion techniques wherever spatial vectors arise. For instance, this is true in electrodynamics, general relativity, and 3D computer graphics.
For the remainder of this section, i, j, and k will denote both imaginary[4] basis vectors of H and a basis for R3. Notice that replacing i by −i, j by −j, and k by −k sends a vector to its additive inverse, so the additive inverse of a vector is the same as its conjugate as a quaternion. For this reason, conjugation is sometimes called the spatial inverse.
Choose two imaginary quaternions p = b1i + c1j + d1k and q = b2i + c2j + d2k. Their dot product is
This is equal to the scalar parts of p*q, qp*, pq*, and q*p. (Note that the vector parts of these four products are different.) It also has the formulas
The cross product of p and q relative to the orientation determined by the ordered basis i, j, and k is
(Recall that the orientation is necessary to determine the sign.) This is equal to the vector part of the product pq (as quaternions), as well as the vector part of −q*p*. It also has the formula
In general, let p and q be quaternions (possibly non-imaginary), and write
where ps and qs are the scalar parts of p and q and and are the vector parts of p and q. Then we have the formula
This shows that the non-commutativity of quaternion multiplication comes from the multiplication of right quaternions. It also shows that the product of two quaternions is commutative if and only if their vector parts are collinear.
There are at least two ways of representing quaternions as matrices in such a way that quaternion addition and multiplication correspond to matrix addition and matrix multiplication. One is to use 2×2 complex matrices, and the other is to use 4×4 real matrices. In the terminology of abstract algebra, these are injective homomorphisms from H to the matrix rings M2(C) and M4(R), respectively.
Using 2×2 complex matrices, the quaternion a + bi + cj + dk can be represented as
This representation has the following properties:
Using 4×4 real matrices, that same quaternion can be written as
In this representation, the conjugate of a quaternion corresponds to the transpose of the matrix. The fourth power of the norm of a quaternion is the determinant of the corresponding matrix. Complex numbers are block diagonal matrices with two 2×2 blocks.
Quaternions can be represented as pairs of complex numbers. From this perspective, quaternions are the result of applying the Cayley–Dickson construction to the complex numbers. This is a generalization of the construction of the complex numbers as pairs of real numbers.
Let C2 be a two-dimensional vector space over the complex numbers. Choose a basis consisting of two elements 1 and j. A vector in C2 can be written in terms of the basis elements 1 and j as
If we define j2 = −1 and ij = −ji, then we can multiply two vectors using the distributive law. Writing k in place of the product ij leads to the same rules for multiplication as the usual quaternions. Therefore the above vector of complex numbers corresponds to the quaternion a + bi + cj + dk. If we write the elements of C2 as ordered pairs and quaternions as quadruples, then the correspondence is
In the complex numbers, there are just two numbers, i and −i, whose square is −1 . In H there are an infinity of square roots of minus one: the quaternion solution for the square root of −1 includes every point on the surface of the unit sphere in 3-space. To see this, let q = a + bi + cj + dk be a quaternion, and assume that its square is −1. In terms of a, b, c, and d, this means
To satisfy the last three equations, either a = 0 or b, c, and d are all 0. The latter is impossible because a is a real number and the first equation would imply that a2 = −1. Therefore a = 0 and b2 + c2 + d2 = 1. In other words, a quaternion squares to −1 if and only if it is a vector (that is, pure imaginary) with norm 1. By definition, the set of all such vectors forms the unit sphere.
Each square root of −1 creates a distinct copy of the complex numbers inside the quaternions. If q2 = −1, then the copy is determined by the function
In the language of abstract algebra, this is an injective ring homomorphism from C to H.
Every non-real quaternion lies in a unique copy of C. Write q as the sum of its scalar part and its vector part:
Decompose the vector part further as the product of its norm and its versor:
(Note that this is not the same as .) The versor of the vector part of q, , is a pure imaginary unit quaternion, so its square is −1. Therefore it determines a copy of the complex numbers by the function
Under this function, q is the image of the complex number . Thus H is the union of complex planes intersecting in a common real line, where the union is taken over the sphere of square roots of minus one.
The relationship of quaternions to each other within the complex subplanes of H can also be identified and expressed in terms of commutative subrings. Specifically, since two quaternions p and q commute (p q = q p) only if they lie in the same complex subplane of H, the profile of H as a union of complex planes arises when one seeks to find all commutative subrings of the quaternion ring. Ian R. Porteous's book Clifford Algebras and the Classical Groups (Cambridge, 1995) describes the role of the sphere of unit vectors in proposition 8.13 on page 60.
Functions of a complex variable can be extended to functions of a quaternion variable as follows:
Let the complex function be written
where u and v are real-valued functions of two real variables. According to the above profile, any quaternion can be written
Then the extension is given by .
This is called Fueter's construction.
The multiplicative group of non-zero quaternions acts by conjugation on the copy of R³ consisting of quaternions with real part equal to zero. The conjugation by a unit quaternion (a quaternion of absolute value 1) with real part cos(t) is a rotation by an angle 2t, the axis of the rotation being the direction of the imaginary part. The advantages of quaternions are:
The set of all unit quaternions (versors) forms a 3-dimensional sphere S³ and a group (a Lie group) under multiplication. S³ is the double cover of the group SO(3,R) of real orthogonal 3×3 matrices of determinant 1 since two unit quaternions correspond to every rotation under the above correspondence.
The image of a subgroup of S³ is a point group, and conversely, the preimage of a point group is a subgroup of S³. The preimage of a finite point group is called by the same name, with the prefix binary. For instance, the preimage of the icosahedral group is the binary icosahedral group.
The group S³ is isomorphic to SU(2), the group of complex unitary 2×2 matrices of determinant 1. Let A be the set of quaternions of the form a + bi + cj + dk where a, b, c, and d are either all integers or all rational numbers with odd numerator and denominator 2. The set A is a ring and a lattice. There are 24 unit quaternions in this ring, and they are the vertices of a 24-cell regular polytope with Schläfli symbol {3,4,3}.
If F is any field with characteristic different from 2, and a and b are elements of F, one may define a four-dimensional unitary associative algebra over F with basis 1, i, j, and ij, where i2 = a, j2 = b and ij = −ji (so ij2 = −ab). These algebras are called quaternion algebras and are isomorphic to the algebra of 2×2 matrices over F or form division algebras over F, depending on the choice of a and b.
The usefulness of quaternions for geometrical computations can be generalised to other dimensions, by identifying the quaternions as the even part Cℓ+3,0(R) of the Clifford algebra Cℓ3,0(R). This is an associative multivector algebra built up from fundamental basis elements σ1, σ2, σ3 using the product rules
If these fundamental basis elements are taken to represent vectors in 3D space, then it turns out that the reflection of a vector r in a plane perpendicular to a unit vector w can be written:
Now in geometry, two reflections make a rotation, by an angle twice the angle between the two reflection planes, so
corresponds to a rotation of 180° in the plane containing σ1 and σ2
But this is very similar to the corresponding quaternion formula,
In fact, the two are identical, if we make the identification
and it is straightforward to confirm that this preserves the Hamilton relations
In this picture, quaternions correspond not to vectors but to bivectors, quantities with magnitude and orientations associated with particular 2D planes rather than 1D directions. The relation to complex numbers becomes clearer, too: in 2D, with two vector directions σ1 and σ2, there is only one bivector basis element σ1σ2, so only one imaginary. But in 3D, with three vector directions, there are three bivector basis elements σ1σ2, σ2σ3, σ3σ1, so three imaginaries.
With this recognition, the sequence can be continued. So in the Clifford algebra Cℓ4,0(R), there are six bivector basis elements, since with four different basic vector directions, six different pairs and therefore six different linearly independent planes can be defined. Rotations in such spaces using these generalisations of quaternions, called rotors, can be very useful for applications involving homogeneous coordinates. But it is only in 3D that the number of basis bivectors equals the number of basis vectors, and each bivector can be identified as a pseudovector.
Dorst et al identify the following advantages for placing quaternions in this wider setting:[5]
For further detail about the geometrical uses of Clifford algebras, see Geometric algebra.
As a linear algebra over the reals, quaternions constitute a real vector space with a rank-three tensor, S, on it, sometimes called the structure tensor. This once contravariant twice covariant tensor converts a one-form and vectors and to a real number S. For each one-form , S is a twice covariant tensor, which, if symmetric, is an inner product on H. Since any real vector space can also be considered a linear manifold, such an inner product is naturally extended to a tensor field, and in case of its nondegeneracy, becomes a (pseudo- or proper-) Euclidean metric . For quaternions this inner product[6] is indefinite, its signature is independent of the one-form , and the corresponding pseudo-Euclidean metric is Minkowski [1]. This metric is automatically extended over the Lie group of nonzero quaternions along its left invariant vector fields resulting in a closed FLRW metric [2] – an important solution of the Einstein equations. These results have some implications for the problem of compatibility between quantum mechanics and general relativity within the framework of quantum gravity [3].
|