Rotation representation (mathematics)
From Wikipedia, the free encyclopedia
In geometry a rotation representation expresses the orientation of an object (or coordinate frame) relative to a coordinate reference frame. This concept extends to classical mechanics where rotational (or angular) kinematics is the science of describing the purely rotational motion of an object.
According to Euler's rotation theorem the general displacement of a rigid body (or coordinate frame) with one point fixed is described by a rotation about some axis. Furthermore, such a rotation may be uniquely described by a minimum of three parameters. However, for various reasons, there are several ways to represent such a rotation. Many of these definitions use more than the necessary minimum of three parameters, although any such representation still has only three degrees of freedom.
An example where rotation representation is used is in computer vision, where an automated observer needs to track a target. Let's consider a rigid body, with an orthogonal right-handed triad , and of unit vectors fixed to its body (the object coordinate system). The basic problem is to specify the orientation of this triad, and hence the rigid body, in terms of the reference coordinate frame (in our case the camera coordinate system).
Contents |
[edit] Rotation matrix (or direction cosine matrix)
It is clear that specifying the coordinates (scalar components) of a basis for our new (rotated) set of coordinate axes in terms of the original (non-rotated) coordinate axes, will completely describe the rotation. The three unit vectors , and which form the basis for our new system each consists of 3 coordinates, yielding a total of 9 parameters. These parameters can be written as the elements of a matrix , called the rotation matrix or direction cosine matrix (DCM).
Each of the elements is the cosine of the angle between a rotated unit basis vector and one of the reference axes, and this is the reason why is often called the direction cosine matrix. The elements of the DCM are not all independent - as Euler's rotation theorem dictates, the DCM has only three degrees of freedom. The DCM has the following properties:
- A is a real, orthogonal matrix.
- The eigenvalues of A are , or
- The determinant (being the product of the eigenvalues) of A is +1.
The angle θ which appears in the eigenvalue expression corresponds to the angle of the Euler axis and angle representation. The eigenvector corresponding with the eigenvalue of 1 is the accompanying Euler axis, since the axis is the only (nonzero) vector which remains unchanged by left-multiplying (rotating) it with the rotation matrix.
Two successive rotations represented by matrices and are easily combined as follows: (Note the order, since the vector being rotated is multiplied from the right). The ease by which vectors can be rotated using a DCM, as well as the ease of combining successive rotations, make the DCM a very useful and popular way to represent rotations, even though it is less concise than other representations.
[edit] Euler axis and angle (rotation vector)
From Euler's rotation theorem we know that any rotation can be expressed as a single rotation about some axis. The axis is the unit vector (unique except for sign) which remains unchanged by the rotation. The magnitude of the angle is also unique, with its sign being determined by the sign of the rotation axis.
The axis can be represented as a three-dimensional unit vector , and the angle by a scalar .
Since the axis is normalized, it has only two degrees of freedom. The angle adds the third degree of freedom to this rotation representation.
A similar but more concise way to express rotation is by using a rotation vector, a non-normalized three-dimensional vector the direction of which specifies the axis, and the length of which is :
If the rotation angle θ is zero, the axis is not uniquely defined.
Combining two successive rotations, each represented by an Euler axis and angle, is not straightforward. It is usual to convert to direction cosine matrix (DCM) or quaternion notation, calculate the product, and then convert back to Euler axis and angle.
[edit] Euler angles
The idea behind Euler angles is to split the complete rotation of the coordinate system into three simpler constitutive rotations, in such a way that the complete rotation matrix is the product of three simpler matrices. Notice that the outer matrix will represent a rotation around one of the axis of the reference frame, and the inner matrix represents a rotation around one of the moving frame axis. The middle matrix represent a rotation around an intermediate axis called line of nodes.
Unfortunately, the definition of Euler angles is not unique and in the literature many different conventions are used. These conventions depend on the axes about which the rotations are carried out, and their sequence (since rotations are not commutative).
The convention being used is usually indicated by specifying the axes about which the consecutive rotations (before being composed) take place, referring to them by index (1,2,3) or letter (X,Y,Z). The engineering and robotics communities typically use 3-1-3 Euler angles. Notice that after composing the independent rotations, they do not rotate about the axis anymore. The most external matrix rotates the other two, leaving the second rotation matrix over the line of nodes, and the third one in a frame comoving with the body. There are 3x3x3 = 27 possible definitions for Euler Angle rotations, but not all of them represent real decompositions.
The Euler angles are never expressed in terms of the external frame, or in terms of the co-moving rotated body frame, but in a mixture. Other angle conventions (e.g., Tait-Bryan angles, Fick angles, Helmholtz angles, roll-pitch-yaw, dip-slip-rake, azimuth-elevation-skew) are used to avoid this problem.
[edit] Quaternions
Quaternions (Euler symmetric parameters) have proven very useful in representing rotations due to several advantages above the other representations mentioned in this article.
A quaternion representation of rotation is written as a normalized four dimensional vector . In terms of the Euler axis and angle θ this vector's elements are expressed as follow:
The above definition follow the convention as used in (Wertz 1980) and (Markley 2003). An alternative definition used in some publications define the "scalar" term as the first quaternion element, with the other elements shifted down one position. (Coutsias 1999), (Schmidt 2001)
Inspection shows that the quaternion parametrization obeys the following constraint:
The last term (in our definition) is often called the scalar term, which has its origin in quaternions when understood as the mathematical extension of the complex numbers, written as
a + bi + cj + dk. with ,
and where {i,j,k} are the hypercomplex numbers satisfying
Quaternion multiplication is performed in the same manner as multiplication of complex numbers, except that the order of elements must be taken into account, since multiplication is not commutative. In matrix notation we can write quaternion multiplication as
Combining two consecutive quaternion rotations is therefore just as simple as using DCMs. Remember that two successive DCM rotations, followed by , are combined as follows:
We can represent this quaternion parameters in a similarly concise way. Please note the inverse ordering of quaternion multiplication when compared to DCM multiplication.
Quaternions are a very popular parametrization due to the following properties:
- More compact than the DCM representation and less susceptible to round-off errors
- The quaternion elements vary continuously over the unit sphere in , (denoted by S3) as the orientation changes, avoiding discontinuous jumps (inherent to three-dimensional parameterizations)
- Expression of the DCM in terms of quaternion parameters involves no trigonometric functions
- It is simple to combine two individual rotations represented as quaternions using a quaternion product
Like DCM's, quaternions must sometimes be re-normalized due to rounding errors, to make sure that they correspond to valid rotations. The computational cost of re-normalizing a quaternion, however, is much less than for normalizing a DCM.
[edit] Rodrigues parameters
Rodrigues parameters (also called Gibbs vector) can be expressed in terms of Euler axis and angle as follows:
The Gibbs vector is undefined for rotations, which is undesirable for global attitude representation.
Modified Rodriguez parameters (MRPs) can be expressed in terms of Euler axis and angle by:
The modified Rodrigues parametrization shares many characteristics with the rotation vector parametrization, including the occurrence of discontinuous jumps in the parameter space when incrementing the rotation.
- See also: Rodrigues' rotation formula
[edit] Cayley-Klein parameters
See definition at Wolfram Mathworld
[edit] Higher dimensional analogues
- See also: SO(4)
[edit] Conversion formulae between representations
[edit] DCM -> Euler angles
The Euler angles (φ,θ,ψ) can be extracted from the rotation matrix by inspecting the combined DCM (rotation matrix) in analytical form.
Using the x-convention, the 3-1-3 Euler angles φ, θ and ψ (around the Z,X and again the Z-axis) can be obtained as follows:
Note that arctan(a,b) is equivalent to arctan(a / b) where we also take into account the quadrant in which the point (a,b) is in.
[edit] DCM -> Euler axis/angle
If the Euler angle θ is not a multiple of π, the Euler axis and angle θ can be computed from the elements of the rotation matrix as follow:
Alternatively, the following method can be used:
Eigen-decomposition of the DCM yields the eigenvalues 1, and . The Euler axis is the eigenvector corresponding to the eigenvalue of 1, and the θ can be computed from the remaining eigenvalues.
The Euler axis can be also found using Singular Value Decomposition since it is the normalized vector spanning the null-space of the matrix I − A.
[edit] DCM -> quaternion
When computing a quaternion from the DCM parametrization there is a sign ambiguity, since and represent the same rotation.
One way of computing the quaternion from the rotation matrix is as follows:
There are three other mathematically equivalent ways to compute . Numerical inaccuracy can be reduced by avoiding situations in which the denominator (in this case q4) is close to zero. For small angles . One of the other three methods looks as follows:
[edit] Euler angles -> DCM
We will consider the Euler Angle x-convention for the following algorithm.
The DCM of the complete rotation, , is the matrix product of the three matrices for the individual rotations. The first rotation's matrix is on the right and the last on the left, since we left-multiply the orientation vector by the rotation matrix.
If we use the general convention instead of the x-convention, the order of multiplication is reversed (this can be proven by writing out the equations).
The axes of the rotation depend on the specific convention being used. For rotations about the X, Y and Z axes with angles φ, θ and ψ, the individual matrices are as follows:
[edit] Euler angles -> quaternion
We will consider the x-convention 3-1-3 Euler Angles for the following algorithm.
We can compute the quaternion from the Euler angles (φ,θ,ψ) as follows:
Warning: The above formulae are wrong. Consider Euler angles rotation (0,0,φ). The above gives instead of
[edit] Euler axis/angle -> DCM
The DCM corresponding to an Euler axis and angle θ can be computed as follows:
with the identity matrix, and
[edit] Euler axis/angle -> quaternion
Given the Euler axis and angle θ, the quaternion can be computed by
[edit] Quaternion -> DCM
The DCM corresponding to the quaternion can be computed as follows:
with the identity matrix, and
which gives
or equivelently
.
[edit] Quaternion -> Euler angles
Given the quaternion , the x-convention 3-1-3 Euler angles (φ,θ,ψ) can be computed by
Warning: The above formulae are wrong. Consider a rotation about the z-axis, i.e. q1 = q2 = 0. Then we get φ = θ = ψ = 0.
[edit] Quaternion -> Euler axis/angle
Given the quaternion , define . Then the Euler axis and angle θ can be computed by
[edit] See also
- Euler's rotation theorem
- Rotation (mathematics)
- Rotation matrix
- Euler angles
- Euler filter
- Rotation around a fixed axis
- Conversion between quaternions and Euler angles
- Euler-Rodrigues parameters
- Charts on SO(3)
- SO(4)#The Euler-Rodrigues formula for 3D rotations
[edit] References
- Evangelos A. Coutsias and Louis Romero, (1999) The Quaternions with an application to Rigid Body Dynamics, Department of Mathematics and Statistics, University of New Mexico.
- H. Goldstein, (1980) Classical Mechanics, 2nd. ed., Addison-Wesley. ISBN 0-201-02918-9
- Klumpp, A. R., Singularity-Free Extraction of a Quaternion from a Direction-Cosine Matrix, Journal of Spacecraft and Rockets, vol. 13, Dec. 1976, p. 754, 755.
- Lev D. Landau and E. M. Lifshitz, (1976) Mechanics, 3rd. ed., Pergamon Press. ISBN 0-08-021022-8 (hardcover) and ISBN 0-08-029141-4 (softcover).
- F. Landis Markley, (2003) Attitude Error Representations for Kalman Filtering, Journal of Guidance, Control and Dynamics
- J. Schmidt and H. Niemann, (2001) Using Quaternions for Parametrizing 3-D Rotations in Unconstrained Nonlinear Optimization, Vision, Modeling and Visualization (VMV01)
- James R. Wertz, (1980) Spacecraft Attitude Determination and Control, D Reidel Publishing Company. ISBN 90-277-1204-2