Min-max theorem

From Wikipedia, the free encyclopedia

"Variational theorem" redirects here. The term is also sometimes applied to the variational principle.

In linear algebra and functional analysis, the min-max theorem, or variational theorem, or Courant-Fischer-Weyl min-max principle, is a result that gives a variational characterization of eigenvalues of compact Hermitian operators on Hilbert spaces. It can be viewed as the starting point of many results of similar nature.

This article first discusses the finite dimensional case and its applications before considering compact operators on infinite dimensional Hilbert spaces. We will see that for compact operators, the proof of the main theorem uses essentially the same idea from the finite dimensional argument.

The min-max theorem can be extended to self adjoint operators that are bounded below.

1 Matrices
- 1.1 Applications
  - 1.1.1 Min-max principle for singular values
  - 1.1.2 Cauchy interlacing theorem
2 Compact operators
3 References

[edit] Matrices

Let A be a n × n Hermitian matrix. As with many other variaional results on eigenvalues, one considers the Rayleigh-Ritz quotient R: Cⁿ → R defined by

$R(x) = \frac{(Ax, x)}{\|x\|^2}$

where (·, ·) denotes the Euclidean inner product on Cⁿ. Equivalently, the Rayleigh-Ritz quotient can be replaced by

$f(x) = (Ax, x), \; \|x\| = 1.$

For Hermitian matrices, the range of the continuous function R(x), or f(x), is a compact subset [a, b] of the real line. The maximum b and the minimum a are the largest and smallest eigenvalue of A, respectively. The min-max theorem is a refinement of this fact.

Lemma Let S_k be a k dimensional subspace.

If the eigenvalues of A are listed in increasing order λ₁ ≤ ... ≤ λ_k ≤ ... ≤ λ_n, then there exists x ∈ S_k, ||x|| = 1 such that (Ax, x) ≥ λ_k.
Similarly, if the eigenvalues of A are listed in increasing order λ₁ ≥ ... ≥ λ_k ≥ ... ≥ λ_n, then there exists y ∈ S_k, ||y|| = 1 such that (Ay, y) ≤ λ_k.

Proof:

Let u_i be the eigenvector corresponding to λ_i. Consider the subspace S' = span{u_k...u_n}. Simply by counting dimensions, we see that the subspace S' ∩ S_k is nonempty. So there exists x ∈ S' ∩ S_k with ||x|| = 1. But for all x ∈ S' , (Ax, x) ≥ λ_k. So the claim holds.
Exactly the same as 1. above.

From the above lemma follows readily the min-max theorem. If the eigenvalues of A are listed in increasing order λ₁ ≤ ... ≤ λ_k ≤ ... ≤ λ_n, by first part of lemma, we have that for all k dimensional subspace S_k,

$\max_{x \in S_k, \|x\| = 1} (Ax, x) \geq \lambda_k.$

This implies

$\inf_{S_k} \max_{x \in S_k, \|x\| = 1} (Ax, x) \geq \lambda_k.$

But choose S_k to be span{u₁...u_k} and we see that equality is achieved. Therefore

$\lambda_k ^{\uparrow} = \min_{S_k} \max_{x \in S_k, \|x\| = 1} (Ax, x).$

, where the ↑ indicates it is the k-th eigenvalue in the increasing order. Similarly, the second part of lemma gives

$\lambda_k ^{\downarrow} = \max_{S_k} \min_{x \in S_k, \|x\| = 1} (Ax, x).$

The min-max theorem consists of the above two equalities.

[edit] Applications

[edit] Min-max principle for singular values

The singular values {σ_k} of a square matrix M are the square roots of eigenvalues of M*M (equivalently MM*). An immediate consequence of the first equality from min-max theorem is

$\sigma_k ^{\uparrow} = \min_{S_k} \max_{x \in S_k, \|x\| = 1} (M^* Mx, x)^{\frac{1}{2}}= \min_{S_k} \max_{x \in S_k, \|x\| = 1} \| Mx \|.$

Similarly,

$\sigma_k ^{\downarrow} = \max_{S_k} \min_{x \in S_k, \|x\| = 1} \| Mx \|.$

[edit] Cauchy interlacing theorem

Let A be a n × n matrix. A m × m matrix B, where m ≤ n, is called a compression of A is there exists an orthogonal projection P onto a subspace of dimension m such that PAP = B. The Cauchy interlacing theorem states:

Theorem If the eigenvalues of A are α₁ ≤ ... ≤ α_n, and those of B are β₁ ≤ ... β_j ... ≤ β_m, then for all j,

$\alpha_j \leq \beta_j \leq \alpha_{n-m+j}.$

This can be proven using the min-max principle. Let β_i have corresponding eigenvector b_i and S_j be the j dimensional subspace S_j = span{b₁...b_j}, then

$\beta_j = \max_{x \in S_j, \|x\| = 1} (Bx, x) = \max_{x \in S_j, \|x\| = 1} (PAPx, x)= \max_{x \in S_j, \|x\| = 1} (Ax, x).$

According to first part of min-max,

$\alpha_j \leq \beta_j.$

On the other hand, if we define S_m-j+1 = span{b_j...b_m}, then

$\beta_j = \min_{x \in S_{m-j+1}, \|x\| = 1} (Bx, x) = \min_{x \in S_{m-j+1}, \|x\| = 1} (PAPx, x)= \min_{x \in S_{m-j+1}, \|x\| = 1} (Ax, x) \leq \alpha_{n-m+j},$

where the last equality is given by the second part of min-max.

Notice that, when n - m = 1, we have

$\alpha_j \leq \beta_j \leq \alpha_{j+1}.$

Hence the name interlacing theorem.

[edit] Compact operators

Let A be a compact, Hermitian operator on a Hilbert space H. Recall that the spectrum of such an operator form a sequence of real numbers whose only possible cluster point is zero. Every nonzero number in the spectrum is an eigenvalue. It no longer makes sense here to list the positive eigenvalues in increasing order. Let the positive eigenvalues of A be

$\cdots \le \lambda_k \le \cdots \le \lambda_1$

,where multiplicity is taken into account as in the matrix case. When H is infinite dimensional, the above sequence of eigenvalues is necessarily infinite. We now apply the same reasoning as in the matrix case. Let S_k ⊂ H be a k dimensional subspace, and S' be the closure of the linear span S' = span{u_k, u_{k + 1}...}. The subspace S' has codimension k - 1. By the same dimension count argument as in the matrix case, S' ∩ S_k is non empty. So there exists x ∈ S' ∩ S_k with ||x|| = 1. Since it is an element of S' , such an x necessarily satisfy

$(Ax, x) \le \lambda_k.$

Therefore, for all S_k

$\inf_{x \in S_k, \|x\| = 1}(Ax,x) \le \lambda_k$

But A is compact, therefore the function f(x) = (Ax, x) is weakly continuous. Furthermore, any bounded set in H is weakly compact. This lets us replace the infimum by minimum:

$\min_{x \in S_k, \|x\| = 1}(Ax,x) \le \lambda_k.$