Window function

From Wikipedia, the free encyclopedia

In signal processing, a window function (or apodization function) is a function that is zero-valued outside of some chosen interval. For instance, a function that is constant inside the interval and zero elsewhere is called a rectangular window, which describes the shape of its graphical representation. When another function or a signal (data) is multiplied by a window function, the product is also zero-valued outside the interval: all that is left is the "view" through the window. Applications of window functions include spectral analysis and filter design.

Contents

[edit] Spectral analysis

The Fourier transform of the function: \cos(\omega t)\, is zero, except at frequency \pm \omega\,. However, many other functions and data (that is, waveforms) do not have convenient closed form transforms. Alternatively, one might be interested in their spectral content only during a certain time period.

In either case, the Fourier transform (or something similar) can be applied on one or more finite intervals of the waveform. In general, the transform is applied to the product of the waveform and a window function. Any window (including rectangular) affects the spectral estimate computed by this method.

Figure 1:  Zoom view of spectral leakage
Figure 1: Zoom view of spectral leakage

[edit] Windowing

Windowing of a simple waveform, like \cos(\omega t)\,. causes its Fourier transform to have non-zero values (commonly called leakage) at frequencies other than \omega\,. It tends to be worst (highest) near \omega\, and least at frequencies farthest from \omega\,.

If there are two sinusoids, with different frequencies, leakage can interfere with the ability to distinguish them spectrally. If their frequencies are dissimilar, then the leakage interferes when one sinusoid is much smaller in amplitude than the other. That is, its spectral component can be hidden by the leakage from the larger component. But when the frequencies are near each other, the leakage can be sufficient to interfere even when the sinusoids are equal strength; that is, they become unresolvable.

The rectangular window has excellent resolution characteristics for signals of comparable strength, but it is a poor choice for signals of disparate amplitudes. This characteristic is sometimes described as low-dynamic-range.

At the other extreme of dynamic range are the windows with the poorest resolution. These high-dynamic-range low-resolution windows are also poorest in terms of sensitivity; this is, if the input waveform contains random noise close to the signal frequency, the response to noise, compared to the sinusoid, will be higher than with a higher-resolution window. In other words, the ability to find weak sinusoids amidst the noise is diminished by a high-dynamic-range window. High-dynamic-range windows are probably most often justified in wideband applications, where the spectrum being analyzed is expected to contain many different signals of various strengths.

In between the extremes are moderate windows, such as Hamming and Hann. They are commonly used in narrowband applications, such as the spectrum of a telephone channel. In summary, spectral analysis involves a tradeoff between resolving comparable strength signals with similar frequencies and resolving disparate strength signals with dissimilar frequencies. That tradeoff occurs when the window function is chosen.

[edit] Discrete-time signals

When the input waveform is time-sampled, instead of continuous, the analysis is usually done by applying a window function and then a discrete Fourier transform (DFT). But the DFT provides only a coarse sampling of the actual DTFT spectrum. Figure 1 shows a portion of the DTFT for a rectangularly-windowed sinusoid. The actual frequency of the sinusoid is indicated as "0" on the horizontal axis. Everything else is leakage. The unit of frequency is "DFT bins"; that is, the integer values are the frequencies sampled by the DFT. So the figure depicts a case where the actual frequency of the sinusoid happens to coincide with a DFT sample, and the maximum value of the spectrum is accurately measured by that sample. When it misses the maximum value by some amount [up to 1/2 bin], the measurement error is referred to as scalloping loss (inspired by the shape of the peak). But the most interesting thing about this case is that all the other samples coincide with nulls in the true spectrum. (The nulls are actually zero-crossings, which cannot be shown on a logarithmic scale such as this.) So in this case, the DFT creates the illusion of no leakage. Despite the unlikely conditions of this example, it is a popular misconception that visible leakage is some sort of artifact of the DFT. But since any window function causes leakage, its apparent absence (in this contrived example) is actually the DFT artifact.

[edit] Total leakage

The concepts of resolution and dynamic range tend to be somewhat subjective, depending on what the user is actually trying to do. But they also tend to be highly correlated with the total leakage, which is quantifiable. It is usually expressed as an equivalent bandwidth, B. Think of it as redistributing the DTFT into a rectangular shape with height equal to the spectral maximum and width B. The more leakage, the greater the bandwidth. It is sometimes called noise equivalent bandwidth or equivalent noise bandwidth.

Although we have depicted leakage as interference that one component imposes on other frequencies, the effect is reciprocal. Thus, if frequency f_1\, leaks a percentage of its energy into frequency f_2\,, then a frequency component at f_2\, returns the favor in the same proportion, as does every other frequency component of the input signal in varying amounts. So the spectral measurement at frequency f_1\, is perturbed by all the other components. The greater the noise bandwidth (B), the greater the effect.

[edit] Processing gain

In signal processing, operations are chosen to improve some aspect of quality of a signal by exploiting the differences between the signal and the corrupting influences. When the signal is a sinusoid corrupted by additive random noise, spectral analysis distributes the signal and noise components differently, often making it easier to detect the signal's presence or measure certain characteristics, such as amplitude and frequency. Effectively, the signal to noise ratio (SNR) is improved by distributing the noise uniformly, while concentrating most of the sinusoid's energy around one frequency. Processing gain is a term often used to describe an SNR improvement. The processing gain of spectral analysis depends on the window function, both its noise bandwidth (B) and its potential scalloping loss. These effects partially offset, because windows with the least scalloping naturally have the most leakage.

For example, the worst possible scalloping loss from a Blackman–Harris window (below) is 0.83 dB, compared to 1.42 dB for a Hann window. But the noise bandwidth is larger by a factor of 2.01/1.5, which can be expressed in decibels as:   10 log10(2.01 / 1.5) = 1.27. Therefore, even at maximum scalloping, the net processing gain of a Hann window exceeds that of a Blackman–Harris window by:  1.27 +0.83 -1.42 = 0.68 dB. And when we happen to incur no scalloping (due to a fortuitous signal frequency), the Hann window is 1.27 dB more sensitive than Blackman–Harris. In general (as mentioned earlier), this is a deterrent to using high-dynamic-range windows in low-dynamic-range applications.

[edit] Window examples

Terminology:

  • N\, represents the width, in samples, of a discrete-time window function. Typically it is an integer power-of-2, such as 210 = 1024.
  • n\, is an integer, with values 0\le \; n\le \; N-1\,. So these are the time-shifted forms of the windows:  w(n-\begin{matrix} \frac{N-1}{2}\end{matrix})\,, where w(n)\, is maximum at n=0\,.
    • Some of these forms have an overall width of N−1, which makes them zero-valued at n=0 and n=N−1. That sacrifices two data samples for no apparent gain, if the DFT size is N. When that happens, an alternative approach is to replace N−1 with N in the formula.
  • Each figure label includes the corresponding noise equivalent bandwidth metric (B), in units of DFT bins. As a guideline, windows are divided into two groups on the basis of B. One group comprises 1 \le B \le 1.8, and the other group comprises B \ge 1.98. The Gauss and Kaiser windows are families that span both groups, though only one or two examples of each are shown.

[edit] High- and moderate-resolution windows

[edit] Rectangular window

Rectangular window; B=1.00
Rectangular window; B=1.00

w(n) = 1\,

[edit] Gauss windows

Gauss window, σ=0.4; B=1.45
Gauss window, σ=0.4; B=1.45

w(n)=e^{-\frac{1}{2} \left ( \frac{n-(N-1)/2}{\sigma (N-1)/2} \right)^{2}}
\sigma \le \;0.5\,

[edit] Hamming window

Hamming window; B=1.37
Hamming window; B=1.37

w(n)=0.53836 - 0.46164\; \cos \left ( \frac{2\pi n}{N-1} \right)

[edit] Hann window

Hann window; B = 1.50
Hann window; B = 1.50

w(n)= 0.5\; \left(1 - \cos \left ( \frac{2 \pi n}{N-1} \right) \right)

The Hann window is sometimes called the "Hanning" window, in analogy to the Hamming window. However, this is incorrect, because the windows were named after Julius von Hann and Richard Hamming, respectively. Another common name is the "raised cosine window".

[edit] Bartlett window (zero valued end-points)

Bartlett window; B=1.33
Bartlett window; B=1.33

w(n)=\frac{2}{N-1}\cdot\left(\frac{N-1}{2}-\left |n-\frac{N-1}{2}\right |\right)\,

[edit] Triangular window (non-zero end-points)

Triangular window; B=1.33
Triangular window; B=1.33

w(n)=\frac{2}{N}\cdot\left(\frac{N}{2}-\left |n-\frac{N-1}{2}\right |\right)\,

[edit] Bartlett–Hann window

Bartlett-Hann window; B=1.46
Bartlett-Hann window; B=1.46

w(n)=a_0 - a_1 \left |\frac{n}{N-1}-\frac{1}{2} \right| - a_2 \cos \left (\frac{2 \pi n}{N-1}\right )
a_0=0.62;\quad a_1=0.48;\quad a_2=0.38\,

[edit] Blackman window

Blackman window; B=1.73
Blackman window; B=1.73

w(n)=a_0 - a_1 \cos \left ( \frac{2 \pi n}{N-1} \right) + a_2 \cos \left ( \frac{4 \pi n}{N-1} \right)
a_0=0.42;\quad a_1=0.5;\quad a_2=0.08\,

[edit] Kaiser windows

Kaiser window, α =2; B=1.50
Kaiser window, α =2; B=1.50
Kaiser window, α =3; B=1.80
Kaiser window, α =3; B=1.80

w(n)=\frac{I_0\Bigg (\pi\alpha \sqrt{1 - (\begin{matrix} \frac{2 n}{N-1} \end{matrix}-1)^2}\Bigg )} {I_0(\pi\alpha)}

See Kaiser window.


[edit] Low-resolution (high-dynamic-range) windows

[edit] Nuttall window, continuous first derivative

Nuttall window, continuous first derivative; B=2.02
Nuttall window, continuous first derivative; B=2.02

w(n)=a_0 - a_1 \cos \left ( \frac{2 \pi n}{N-1} \right)+ a_2 \cos \left ( \frac{4 \pi n}{N-1} \right)- a_3 \cos \left ( \frac{6 \pi n}{N-1} \right)
a_0=0.355768;\quad a_1=0.487396;\quad a_2=0.144232;\quad a_3=0.012604\,

[edit] Blackman–Harris window

Blackman–Harris window; B=2.01
Blackman–Harris window; B=2.01

w(n)=a_0 - a_1 \cos \left ( \frac{2 \pi n}{N-1} \right)+ a_2 \cos \left ( \frac{4 \pi n}{N-1} \right)- a_3 \cos \left ( \frac{6 \pi n}{N-1} \right)
a_0=0.35875;\quad a_1=0.48829;\quad a_2=0.14128;\quad a_3=0.01168\,

[edit] Blackman–Nuttall window

Blackman–Nuttall window; B=1.98
Blackman–Nuttall window; B=1.98

w(n)=a_0 - a_1 \cos \left ( \frac{2 \pi n}{N-1} \right)+ a_2 \cos \left ( \frac{4 \pi n}{N-1} \right)- a_3 \cos \left ( \frac{6 \pi n}{N-1} \right)
a_0=0.3635819; \quad a_1=0.4891775; \quad a_2=0.1365995; \quad a_3=0.0106411\,

[edit] Flat top window

Flat top window; B=3.77
Flat top window; B=3.77

w(n)=a_0 - a_1 \cos \left ( \frac{2 \pi n}{N-1} \right)+ a_2 \cos \left ( \frac{4 \pi n}{N-1} \right)- a_3 \cos \left ( \frac{6 \pi n}{N-1} \right)+a_4 \cos \left ( \frac{8 \pi n}{N-1} \right)
a_0=1;\quad a_1=1.93;\quad a_2=1.29;\quad a_3=0.388;\quad a_4=0.032\,

[edit] Other windows

[edit] Bessel window

[edit] Sine window

[edit] Exponential window

[edit] Comparison of windows

Stopband attenuation of different windows
Stopband attenuation of different windows

When selecting an appropriate window function for an application, the comparsion graph may be useful. The most important parameter is usually the stopband attenuation close to the main lobe. However, some applications are more sensitive to the stopband attenuation far away from the cut-off frequency.

[edit] Overlapping windows

When the length of a data set to be transformed is larger than necessary to provide the desired frequency resolution, a common practice is to subdivide it into smaller sets and window them individually. To mitigate the "loss" at the edges of the window, the individual sets may overlap in time. See Welch method of power spectral analysis.

[edit] See also

[edit] References

  • Oppenheim, A.V., and R.W. Schafer, Discrete-Time Signal Processing, Upper Saddle River, NJ: Prentice-Hall, 1999, pp 468–471.
  • Albert H. Nuttall, Some Windows with Very Good Sidelobe Behavior, IEEE Transactions on Acoustics, Speech, and Signal Processing, Vol.ASSP-29, No.1, February 1981, pp 84–91.
  • Frederic J. Harris, On the use of Windows for Harmonic Analysis with the Discrete Fourier Transform, Proceedings of the IEEE, Vol.66, No.1, January 1978, pp 51–83.
  • S.W.A. Bergen and A. Antoniou, Design of Ultraspherical Window Functions with Prescribed Spectral Characteristics, EURASIP Journal on Applied Signal Processing, vol. 2004, no. 13, pp. 2053–2065, 2004.
  • S.W.A. Bergen and A. Antoniou, Design of Nonrecursive Digital Filters Using the Ultraspherical Window Function, EURASIP Journal on Applied Signal Processing, vol. 2005, no. 12, pp. 1910–1922, 2005.