Convolution

Visual explanation of convolution.
1.Express each function in terms of a dummy variable \tau.
2.Transpose one of the functions: g(\tau)g(-\tau).
3.Add a time-offset, t, which allows g(t-\tau) to slide along the \tau-axis.
4.Start t at -∞ and slide it all the way to +∞. Wherever the two functions intersect, find the integral of their product. In other words, compute a sliding, weighted-average of function f(\tau), where the weighting function is g(-\tau).
The resulting waveform (not shown here) is the convolution of functions f and g. If f(t) is a unit impulse, the result of this process is simply g(t), which is therefore called the impulse response.

In mathematics and, in particular, functional analysis, convolution is a mathematical operation on two functions f and g, producing a third function that is typically viewed as a modified version of one of the original functions. Convolution is similar to cross-correlation. It has applications that include statistics, computer vision, image and signal processing, electrical engineering, and differential equations.

The convolution can be defined for functions on groups other than Euclidean space. In particular, the circular convolution can be defined for periodic functions (that is, functions on the circle), and the discrete convolution can be defined for functions on the set of integers. These generalizations of the convolution have applications in the field of numerical analysis and numerical linear algebra, and in the design and implementation of finite impulse response filters in signal processing.

Contents

Definition

The convolution of f\, and g\, is written f * g\,. It is defined as the integral of the product of the two functions after one is reversed and shifted. As such, it is a particular kind of integral transform:

(f * g )(t)\ \ \,   \stackrel{\mathrm{def}}{=}\ \int_{-\infty}^{\infty} f(\tau)\cdot g(t - \tau)\, d\tau
= \int_{-\infty}^{\infty} f(t-\tau)\cdot g(\tau)\, d\tau.       (commutativity)

While the symbol t\, is used above, it need not represent the time domain. But in that context, the convolution formula can be described as a weighted average of the function f(\tau)\, at the moment t.\,  The weighting function is g(-\tau),\, simply shifted by amount t\,.  As t\, changes, the weighting function emphasizes different parts of the input function.

More generally, if f and g are complex-valued functions on Rd, then their convolution may be defined as the integral

(f * g )(x) = \int_{\mathbf{R}^d} f(y)g(x-y)\,dy = \int_{\mathbf{R}^d} f(x-y)g(y)\,dy.

Circular convolution

Main article: Circular convolution

When a function gT is periodic, with period T (and f is not), then ƒ*gT is also periodic and identical to:

(f * g_T)(t) \equiv \int_{t_0}^{t_0+T} \left[\sum_{k=-\infty}^{\infty} f(\tau + kT)\right] g_T(t - \tau)\ d\tau,\,

where to is an arbitrary choice. The summation is called a periodic extension of the function ƒ.

If gT is a periodic extension of another function, g, then ƒ*gT is known as a circular, cyclic, or periodic convolution of f and g.

Discrete convolution

For complex-valued functions ƒ, g defined on the set of integers, the discrete convolution of ƒ and g is given by:

(f * g)[n]\ \stackrel{\mathrm{def}}{=}\ \sum_{m=-\infty}^{\infty} f[m]\cdot g[n - m]\,
= \sum_{m=-\infty}^{\infty} f[n-m]\cdot g[m].\,       (commutativity)

When multiplying two polynomials, the coefficients of the product are given by the convolution of the original coefficient sequences, extended with zeros where necessary to avoid undefined terms; this is known as the Cauchy product of the coefficients of the two polynomials.

Circular discrete convolution

When a function gN is periodic, with period N (and f is not), then ƒ*gN is also periodic and identical to:

(f * g_N)[n] \equiv \sum_{m=0}^{N-1} \left(\sum_{k=-\infty}^{\infty} {f}[m+kN] \right) g_N[n-m].\,

The summation on k is called a periodic extension of the function ƒ.

If gN is a periodic extension of another function, g, then ƒ*gN is known as a circular, cyclic, or periodic convolution of f and g.

When the non-zero durations of both f and g are limited to the interval [0, N-1],  ƒ*gN reduces to these common forms:

(f * g_N)[n] = \sum_{m=0}^{N-1} f[m]\ g_N[n-m]\, 

 

 ( Eq.1 )

 

= \sum_{m=0}^n f[m]\ g[n-m] + \sum_{m=n+1}^{N-1} f[m]\ g[N+n-m]\,
= \sum_{m=0}^{N-1} f[m]\ g[(n-m)_{\mod{N}}]\quad \stackrel{\mathrm{def}}{=}\quad (f *_N g)[n].\,

The notation (f *_N g)\, for cyclic convolution denotes convolution over the cyclic group of integers modulo N.

Fast convolution algorithms

In many situations, discrete convolutions can be converted to circular convolutions so that fast transforms with a convolution property can be used to implement the computation. For example, convolution of digit sequences is the kernel operation in multiplication of multi-digit numbers, which can therefore be efficiently implemented with transform techniques (Knuth 1997, §4.3.3.C; von zur Gathen & Gerhard 2003, §8.2).

Eq.1  requires N arithmetic operations per output value and N2 operations for N outputs. That can be significantly reduced with any of several fast algorithms. Digital signal processing and other applications typically use fast convolution algorithms to reduce the cost of the convolution to O(N log N) complexity.

The most common fast convolution algorithms use fast Fourier transform (FFT) algorithms via the circular convolution theorem. Specifically, the circular convolution of two finite-length sequences is found by taking an FFT of each sequence, multiplying pointwise, and then performing an inverse FFT. Convolutions of the type defined above are then efficiently implemented using that technique in conjunction with zero-extension and/or discarding portions of the output. Other fast convolution algorithms, such as the Schönhage-Strassen algorithm, use fast Fourier transforms in other rings.

Domain of definition

The convolution of two complex-valued functions on Rd

f*g(x) = \int_{\mathbf{R}^d}f(y)g(x-y)\,dy

is well-defined only if ƒ and g decay sufficiently rapidly at infinity in order for the integral to exist. Conditions for the existence of the convolution may be tricky, since a blow-up in g at infinity can be easily offset by sufficiently rapid decay in ƒ. The question of existence thus may involve different conditions on ƒ and g.

Compactly supported functions

If ƒ and g are compactly supported continuous functions, then their convolution exists, and is also compactly supported and continuous (Hörmander). More generally, if either function (say ƒ) is compactly supported and the other is locally integrable, then the convolution ƒ*g is well-defined and continuous.

Integrable functions

The convolution of ƒ and g exists if ƒ and g are both Lebesgue integrable functions (in L1(Rd)), and ƒ*g is also integrable (Stein & Weiss 1971, Theorem 1.3). This is a consequence of Tonelli's theorem. Likewise, if ƒ∈L1(Rd) and g ∈ Lp(Rd) where 1 ≤ p ≤ ∞, then ƒ*g ∈ Lp(Rd) and

\|{f}*g\|_p\le \|f\|_1\|g\|_p.

Functions of rapid decay

In addition to compactly supported functions and integrable functions, functions that have sufficiently rapid decay at infinity can also be convolved. An important feature of the convolution is that if ƒ and g both decay rapidly, then ƒ*g also decays rapidly. In particular, if ƒ and g are rapidly decreasing functions, then so is the convolution ƒ*g. Combined with the fact that convolution commutes with differentiation (see Properties), it follows that the class of Schwartz functions is closed under convolution.

Distributions

Under some circumstances, it is possible to define the convolution of a function with a distribution, or of two distributions. If ƒ is a compactly supported function and g is a distribution, then ƒ*g is a smooth function defined by a distributional formula analogous to

\int_{\mathbf{R}^d} {f}(y)g(x-y)\,dy.

More generally, it is possible to extend the definition of the convolution in a unique way so that the associative law

f*(g*\phi) = (f*g)*\phi

remains valid in the case where ƒ is a distribution, and g a compactly supported distribution (Hörmander 1983, §4.2).

Properties

Algebraic properties

See also: Convolution algebra

The convolution defines a product on the linear space of integrable functions. This product satisfies the following algebraic properties, which formally mean that the space of integrable functions with the product given by convolution is a commutative algebra without identity (Strichartz 1994, §3.3). Other linear spaces of functions, such as the space of continuous functions of compact support, are closed under the convolution, and so also form commutative algebras.

Commutativity
f * g = g * f  \,
Associativity
f  * (g  * h) = (f  * g)  * h \,
Distributivity
f  * (g + h) = (f  * g) + (f  * h) \,
Associativity with scalar multiplication
a (f  * g) = (a f)  * g = f  * (a g) \,

for any real (or complex) number {a}\,.

Multiplicative identity

No algebra of functions possesses an identity for the convolution. The lack of identity is typically not a major inconvenience, since most collections of functions on which the convolution is performed can be convolved with a delta distribution or, at the very least (as is the case of L1) admit approximations to the identity.

The linear space of compactly supported distributions does, however, admit an identity under the convolution. Specifically,

f*\delta = f

where δ is the delta distribution.

Differentiation

In the one variable case,

\frac{d}{dx}({f} * g) = \frac{df}{dx}  * g = {f} * \frac{dg}{dx} \,

where d/dx is the derivative. More generally, in the case of functions of several variables, an analogous formula holds with the partial derivative:

\frac{\partial}{\partial x_i}({f} * g)(x) = \frac{\partial f}{\partial x_i} * g = {f} * \frac{\partial g}{\partial x_i}.

A particular consequence of this is that the convolution can be viewed as a "smoothing" operation: the convolution of ƒ and g is differentiable as many times as ƒ and g are together.

In the discrete case, the difference operator Dƒ(n) = ƒ(n+1) − ƒ(n) satisfies an analogous relationship:

D(f*g) = (Df)*g = f*(Dg).\,

Convolution theorem

The convolution theorem states that

 \mathcal{F}\{f * g\} = k\cdot \mathcal{F}\{f\}\cdot \mathcal{F}\{g\}

where  \mathcal{F}\{f\}\, denotes the Fourier transform of f, and k is a constant that depends on the specific normalization of the Fourier transform (see Fourier transform#Some Fourier transform properties). Versions of this theorem also hold for the Laplace transform, two-sided Laplace transform, Z-transform and Mellin transform.

See also the less trivial Titchmarsh convolution theorem.

Translation invariance

The convolution commutes with translations, meaning that

\tau_x ({f}*g) = (\tau_x f)*g = {f}*(\tau_x g)\,

where τxƒ is the translation of the function ƒ by x defined by

(\tau_x f)(y) = f(y-x).\,

Furthermore, under certain conditions, convolution is the most general translation invariant operation. Roughly speaking, the following holds

Thus any translation invariant operation can be represented as a convolution. Convolutions play an important role in the study of time-invariant systems, and especially LTI system theory. The representing function gS is the impulse response of the transformation S.

Convolution inverse

Main article: Convolution power#Convolution inverse

Many functions have an inverse element, f(-1), which satisfies the relationship:

f^{(-1)} * f = \delta \,

These functions form an abelian group, with the group operation being convolution.

Convolutions on groups

If G is a suitable group endowed with a measure m (for instance, a locally compact Hausdorff topological group with the Haar measure) and if f and g are real or complex valued m-integrable functions on G, then we can define their convolution by

(f * g)(x) = \int_G f(y)g(xy^{-1})\,dm(y) \,

The circle group T with the Lebesgue measure is an immediate example. For a fixed g in L1(T), we have the following familiar operator acting on the Hilbert space L2(T):

T {f}(x) =  \frac{1}{2 \pi} \int_{\mathbb{T}} {f}(y) g( x - y) dy.

The operator T is compact. A direct calculation shows that its adjoint T* is convolution with

\bar{g}(-y).

By the commutativity property cited above, T is normal, i.e. T*T = TT*. Also, T commutes with the translation operators. Consider the family S of operators consisting of all such convolutions and the translation operators. S is a commuting family of normal operators. According to spectral theory, there exists an orthonormal basis {hk} that simultaneously diagonalizes S. This characterizes convolutions on the circle. Specifically, we have

h_k (x) = e^{ikx},\;

which are precisely the characters of T. Each convolution is a compact multiplication operator in this basis. This can be viewed as a version of the convolution theorem discussed above.

The above example may convince one that convolutions arise naturally in the context of harmonic analysis on groups. For more general groups, it is also possible to give, for instance, a Convolution Theorem, however it is much more difficult to phrase and requires representation theory for these types of groups and the Peter-Weyl theorem. It is very difficult to do these calculations without more structure, and Lie groups turn out to be the setting in which these things are done.

Convolution of measures

If μ and ν are measures on the family of Borel subsets of the real line, then the convolution μ * ν is defined by

(\mu * \nu)(A) = (\mu \times \nu)(\{\, (x,y) \in \mathbb{R}^2 \,:\, x+y \in A \,\}).

In case μ and ν are absolutely continuous with respect to Lebesgue measure, so that each has a density function, then the convolution μ * ν is also absolutely continuous, and its density function is just the convolution of the two separate density functions.

If μ and ν are probability measures, then the convolution μ * ν is the probability distribution of the sum X + Y of two independent random variables X and Y whose respective distributions are μ and ν.

Applications

Convolution and related operations are found in many applications of engineering and mathematics.

See also

References

External links