Matrix norm

From Wikipedia, the free encyclopedia

In mathematics, a matrix norm is a natural extension of the notion of a vector norm to matrices.

Contents

[edit] Properties of matrix norms

In what follows, K will denote the field of real or complex numbers. Consider the space K^{m \times n} of all matrices with m rows and n columns with entries in K.

A matrix norm on K^{m \times n} satisfies all the properties of vector norms. That is, if \|A\| is the norm of the matrix A, then,

  • \|A\|\ge 0 with equality if and only if A = 0
  • \|\alpha A\|=|\alpha| \|A\| for all α in K and all matrices A in K^{m \times n}
  • \|A+B\| \le \|A\|+\|B\| for all matrices A and B in K^{m \times n}.

Additionally, some matrix norms defined on n-by-n matrices (but not all such norms) satisfy one or more of the following conditions which relate to the fact that matrices are more than just vectors:

A matrix norm that satisfies the first additional property is called a sub-multiplicative norm. The set of all n-by-n matrices, together with such a sub-multiplicative norm, is an example of a Banach algebra.

(In some books the terminology matrix norm is used only for those norms which are sub-multiplicative.)

[edit] Induced norm

If vector norms on Km and Kn are given (K is field of real or complex numbers), then one defines the corresponding induced norm or operator norm on the space of m-by-n matrices as the following maxima:

 \begin{align}
\|A\| &= \max\{\|Ax\| : x\in K^n \mbox{ with }\|x\|\le 1\} \\
&= \max\{\|Ax\| : x\in K^n \mbox{ with }\|x\| = 1\} \\
&= \max\left\{\frac{\|Ax\|}{\|x\|} : x\in K^n \mbox{ with }x\ne 0\right\}.
\end{align}

If m = n and one uses the same norm on the domain and the range, then the induced operator norm is a sub-multiplicative matrix norm.

For example, the operator norm corresponding to the p-norm for vectors is:

 \left \| A \right \| _p = \max \limits _{x \ne 0} \frac{\left \| A x\right \| _p}{\left \| x\right \| _p}.

In the case of p = 1 and p=\infty, the norms can be computed as:

 \begin{align}
& \left \| A \right \| _1 = \max \limits _{1 \leq j \leq n} \sum _{i=1} ^m | a_{ij} | \\
& \left \| A \right \| _\infty = \max \limits _{1 \leq i \leq m} \sum _{j=1} ^n | a_{ij} | .
\end{align}

These are different from the Schatten p-norms for matrices, which are also usually denoted by  \left \| A \right \| _p .

In the special case of p = 2 (the Euclidean norm) and m = n (square matrices), the induced matrix norm is the spectral norm. The spectral norm of a matrix A is the largest singular value of A or the square root of the largest eigenvalue of the positive-semidefinite matrix A*A:

\left \| A \right \| _2=\sqrt{\lambda_{\text{max}}(A^* A)}

where A* denotes the conjugate transpose of A.

Any induced norm satisfies the inequality

\left \| A \right \| \ge \rho(A),

where ρ(A) is the spectral radius of A. In fact, it turns out that ρ(A) is the infimum of all induced norms of A.

Furthermore, we have

\lim_{r\rarr\infty}\|A^r\|^{1/r}=\rho(A).

[edit] "Entrywise" norms

These vector norms treat a matrix as an m \times n vector, and use one of the familiar vector norms.

For example, using the p-norm for vectors, we get:

\Vert A \Vert_{p} = \Big( \sum_{i=1}^m \sum_{j=1}^n |a_{ij}|^p \Big)^{1/p}. \,

Note: Entrywise p norms are not to be confused with Induced p norms.

[edit] Frobenius norm

For p = 2, this is called the Frobenius norm or the Hilbert-Schmidt norm, though the latter term is often reserved for operators on Hilbert space. This norm can be defined in various ways:

\|A\|_F=\sqrt{\sum_{i=1}^m\sum_{j=1}^n |a_{ij}|^2}=\sqrt{\operatorname{trace}(A^\ast{} A)}=\sqrt{\sum_{i=1}^{\min\{m,\,n\}} \sigma_{i}^2}

where A* denotes the conjugate transpose of A, σi are the singular values of A, and the trace function is used. The Frobenius norm is very similar to the Euclidean norm on Kn and comes from an inner product on the space of all matrices.

The Frobenius norm is submultiplicative and is very useful for numerical linear algebra. This norm is often easier to compute than induced norms.

[edit] Trace norm

The trace norm is defined as

\|A\|_{tr}
=\operatorname{trace}(\sqrt{A^*A})=\sum_{i=1}^{\min\{m,\,n\}} \sigma_{i}.

[edit] Max norm

The max norm is defined as \|A\|_{max}=\max\{|a_{ij}|\}.

[edit] Consistent norms

A matrix norm \| \cdot \|_{ab} on K^{m \times n} is called consistent with a vector norm \| \cdot \|_{a} on Kn and a vector norm \| \cdot \|_{b} on Km if:

\|Ax\|_b \leq \|A\|_{ab} \|x\|_a

for all A \in K^{m \times n}, x \in K^n. All induced norms are consistent by definition.

[edit] Equivalence of norms

For any two vector norms ||·||α and ||·||β, we have

r\left\|A\right\|_\alpha\leq\left\|A\right\|_\beta\leq s\left\|A\right\|_\alpha

for some positive numbers r and s, for all matrices A in K^{m \times n}. In other words, they are equivalent norms; they induce the same topology on K^{m \times n}.

Moreover, when A\in \mathbb{R}^{n\times n}, then for any vector norm ||·||, there exists a unique positive number k such that k||A|| is a (submultiplicative) matrix norm.

A matrix norm ||·||p is said to be minimal if there exists no other matrix norm ||·||q satisfying ||·||q≤||·||p for all ||·||q.

[edit] Examples of norm equivalence

For matrix A\in\mathbb{R}^{m\times n} the following inequalities hold 1,2:

  • \|A\|_2\le\|A\|_F\le\sqrt{n}\|A\|_2
  • \|A\|_{max} \le\|A\|_2\le\sqrt{mn}\|A\|_{max}
  • \frac{1}{\sqrt{n}}\|A\|_\infty\le\|A\|_2\le\sqrt{m}\|A\|_\infty
  • \frac{1}{\sqrt{m}}\|A\|_1\le\|A\|_2\le\sqrt{n}\|A\|_1
  • \|A\|_2\le\sqrt{\|A\|_1\|A\|_\infty}

[edit] References

  1. Golub, Gene; Charles F. Van Loan (1996). Matrix Computations - Third Edition. Baltimore: The Johns Hopkins University Press, 56-57. ISBN 0-8018-5413-X.
  2. Roger Horn and Charles Johnson. Matrix Analysis, Chapter 5, Cambridge University Press, 1985. ISBN 0-521-38632-2.
  3. L. Thomas, Norms and Condition Numbers of a Matrix [1]
  4. James W. Demmel, Applied Numerical Linear Algebra, section 1.7, published by SIAM, 1997.