Young's inequality

From Wikipedia, the free encyclopedia

In mathematics, the standard form of Young's inequality states that if a and b are nonnegative real numbers and p and q are positive real numbers such that 1/p + 1/q = 1 then we have

ab \le \frac{a^p}{p} + \frac{b^q}{q}.

Equality holds if and only if ap = bq. Young's inequality is a special case of the inequality of weighted arithmetic and geometric means. It is named for William Henry Young.

An elementary case of Young's inequality is the inequality with exponent 2,

ab \le \frac{a^2}{2} + \frac{b^2}{2},

which also gives rise to the so-called Young's inequality with ε (valid for any ε > 0),

ab \le \frac{a^2}{2\varepsilon} + \frac{\varepsilon b^2}{2}.

Contents

[edit] Generalization using Legendre transforms

If f is a convex function and its Legendre transform is denoted by g, then

 ab \le f(a) + g(b). \,

This follows immediately from the definition of the Legendre transform. This inequality also holds — in the form a ·bf(a) + g(b) — if f is a convex function taking a vector argument (Arnold 1989, §14).

[edit] Examples

  • The Legendre transform of f(a) = ap/p is g(b) = bq/q with q such that 1/p + 1/q = 1, and thus the standard Young inequality mentioned above is a special case.
  • The Legendre transform of f(a) = ea – 1 is g(b) = 1 – b + b ln b, hence ab ≤ eab + b ln b for all non-negative a and b. This estimate is useful in large deviations theory under exponential moment conditions, because b ln b appears in the definition of relative entropy, which is the rate function in Sanov's theorem.

[edit] An inequality for Lp norms

In real analysis, the following result, first proved in Young (1912) is also called Young's inequality:

Suppose f is in Lp and g is in Lq and

\frac{1}{p} + \frac{1}{q} = \frac{1}{r} + 1

with 1 ≤ p, q ,r ≤ ∞ and 1/p + 1/q ≥ 1. Then

\|f*g\| _r\le\|f\|_p\|g\|_q.

Here the star denotes convolution, Lp is Lebesgue space, and

\|f\|_p = \Bigl(\int |f(x)|^p\,dx \Bigr)^{1/p}

denotes the usual Lp norm. This can be proved by use of the Hölder inequality.

An example application is that Young's inequality can be used to show that the Heat Semigroup is a contraction semigroup using the L2 norm.

The result can be strengthened to a sharp form, viz

\|f*g\| _r\le c_{p,q} \|f\|_p\|g\|_q.

where the constant cp,q<1.

[edit] Use

Young's inequality is used in the proof of Hölder's inequality. It is also used widely to estimate the norm of nonlinear terms in PDE theory, since it allows one to estimate a product of two terms by a sum of the same terms raised to a power and scaled.

[edit] Proof of the standard form

The proof is trivial if a = 0 or b = 0. Therefore, assume a, b > 0.

If ap = bq, then by the rules for exponentiation and the assumption 1/p + 1/q = 1,

ab = a(b^q)^{1/q} = aa^{p/q} =a^{p/p}a^{p/q}  
=a^{p(1/p+1/q)}= a^p = {a^p \over p} + {b^q \over q},

and we have equality in Young's inequality.

Assume in addition apbq for the remaining part of the proof. By the functional equation of the natural logarithm,


\ln ab
=\ln a +\ln b
=\frac{\ln a^p}p + \frac{\ln b^q}q.

Note that the natural logarithm is strictly increasing, because its first derivative is positive for every positive number, hence ln ap ≠ ln bq. Its inverse is the exponential function f(x) = exp(x), which is strictly convex, since its second derivative is positive for every real number. Therefore the exponential function satisfies the defining property of strictly convex functions: for every t in the open interval (0,1) and all real numbers x and y with x ≠ y,

f(tx+(1-t)y)< t f(x)+(1-t)f(y)\,.

Applying this strict inequality for t = 1/p, 1 – t = 1/q, x = ln ap and y = ln bq gives


ab
=\exp(\ln ab)
=\exp\Bigl(\frac{\ln a^p}p + \frac{\ln b^q}q\Bigr)
<\frac{\exp(\ln a^p)}p+\frac{\exp(\ln b^q)}q
= \frac{a^p}p + \frac{b^q}q,

which completes the proof.

[edit] Proof of the elementary case

Young's inequality with exponent 2 is the special case p = q = 2. However, it has a more elementary proof, just observe that

0\le (a-b)^2=a^2+b^2-2ab,

add 2ab to every side and divide by 2.

Young's inequality with ε follows by applying Young's inequality with exponent 2 to

a'=a/\sqrt{\varepsilon},\text{ }b'=\sqrt{\varepsilon}b.

[edit] References

  • Arnold, V. I. (1989), Mathematical Methods of Classical Mechanics (second ed.), Springer, ISBN 978-0-387-96890-2 .
  • Young, W.H. (1912), “On the multiplication of successions of Fourier constants”, Proc. Roy. Soc. Lond. Series A 87: 331-339 .