Karhunen-Loève theorem
From Wikipedia, the free encyclopedia
In the theory of stochastic processes, the Karhunen-Loève theorem (named after Kari Karhunen and Michel Loève) is a representation of a stochastic process as an infinite linear combination of orthogonal functions, analogous to a Fourier series representation of a function on a bounded interval. In contrast to a Fourier series where the coefficients are real numbers and the expansion basis consists of sinusoidal functions (that is, sine and cosine functions), the coefficients in the Karhunen-Loève theorem are random variables and the expansion basis depends on the process. In fact, the orthogonal basis functions used in this representation are determined by the covariance function of the process. If we regard a stochastic process as a random function F, that is, one in which the random value is a function on an interval [a, b], then this theorem can be considered as a random orthonormal expansion of F.
In the case of a centered stochastic process {Xt}t ∈ [a, b] (where centered means that the expectations E(Xt) are defined and equal to 0 for all t) satisfying a technical continuity condition, admits a decomposition
where Zk are pairwise uncorrelated random variables and the functions ek are continuous real-valued functions on [a, b] which are pairwise orthogonal in L2[a, b]. The general case of a process which is not centered can be represented by expanding the expectation function (which is a non-random function) in the basis ek .
Moreover, if the process is Gaussian, then the random variables Zk are Gaussian and stochastically independent. This result generalizes the Karhunen-Loève transform. An important example of a centered real stochastic process on [0,1] is the Wiener process and the Karhunen-Loève theorem can be used to provide a canonical orthogonal representation for it. In this case the expansion consists of sinusoidal functions.
The above expansion into uncorrelated random variables is also known as the Karhunen-Loève expansion.
[edit] Detailed formulation
We will formulate the result in terms of real random variables, although it is applicable without change to vector-valued random variables.
If X and Y are random variables, the inner product is defined by
This is defined if both X and Y have finite second moments i.e., are square integrable. Note that the inner product is related to covariance and correlation. In particular, for random variables of mean zero, covariance and inner product coincide. If {Xt}t is a centered process, the covariance function of {Xt}t is
Note that if {Xt}t is centered and t1, ≤ t2, ..., ≤ tN are points in [a, b], then
Theorem. Consider a centered stochastic process {Xt}t indexed by t in the interval [a, b] with covariance function CovX. Suppose the covariance function CovX(t,s) is jointly continuous in t, s. Then CovX can be regarded as a positive definite kernel and so by Mercer's theorem, the corresponding integral operator T on L2[a,b] (relative to Lebesgue measure on [a,b]) has an orthonormal basis of eigenvectors. Let {ei}i be the eigenvectors of T corresponding to non-zero eigenvalues and
Then Zi are centered orthogonal random variables and
where the convergence is in the mean and is uniform in t. Moreover
where λi is the eigenvalue corresponding to the eigenvector ei.
In the statement of the theorem, the integral defining Zi, can be defined as the limit in the mean of Cauchy sums of random variables:
where
Since the limit in the mean of jointly Gaussian random variables is jointly Gaussian, and jointly Gaussian random (centered) variables are independent if and only if they are orthogonal, we can also conclude:
Theorem. The variables Zi have a joint Gaussian distribution and are stochastically independent if the original process {Xt}t is Gaussian.
In the gaussian case, since the variables Zi are independent, we can say more:
almost surely.
Note that by generalizations of Mercer's theorem we can replace the interval [a, b] with other compact spaces C and Lebesgue measure on [a, b] with a Borel measure whose support is C.
[edit] The Wiener process
There are numerous equivalent characterizations of the Wiener process which is a mathematical formalization of Brownian motion. Here we regard it as the centered Gaussian process {Bt} with covariance function
The eigenvectors of the covariance kernel are easily determined. These are
and the corresponding eigenvalues are
This gives the following representation of the Wiener process:
Theorem. There is a sequence {Wi}i of independent Gaussian random variables with mean zero and variance 1 such that
Convergence is uniform in t and in the L2 norm, that is
uniformly in t.
[edit] References
- I. Guikhman, A. Skorokhod, ´'Introduction a la Théorie des Processus Aléatoires´' Éditions MIR, 1977
- B. Simon, Functional Integration and Quantum Physics, Academic Press, 1979