Transform coding

From Wikipedia, the free encyclopedia

Transform coding is a type of data compression for "natural" data like audio signals or photographic images. The transformation is typically "lossy", resulting in a lower quality copy of the original input.

In transform coding, knowledge of the application is used to choose information to discard, thereby lowering its bandwidth. The remaining information can then be compressed via a variety of methods. When the output is decoded, the result may not be identical to the original input, but is expected to be close enough for the purpose of the application.

One of the most successful transform encoding systems is typically not referred to as such -- the example being NTSC color television. After an extensive series of studies in the 1950s, Alda Bedford showed that the human eye has high resolution only for black and white, somewhat less for "mid-range" colors like yellows and greens, and much less for colors on the end of the spectrum, reds and blues. Using this knowledge allowed RCA to develop a system in which they discarded most of the blue signal after it comes from the camera, keeping most of the green and only some of the red. The result is a signal with considerably less content, one that would fit within existing 6 MHz black-and-white signals. The average TV displays the equivalent of 350 pixels on a line, but the TV signal contains enough information for only about 50 pixels of blue and perhaps 150 of red. This is not apparent to the viewer in most cases, as the eye has sophisticated systems for "re-building" a sharp image based on clues from contrast and edges.

The term is much more commonly used in digital media. The common JPEG image format is an example of a transform coding, one that examines small blocks of the image and "averages out" the color using a discrete cosine transform to form an image with far fewer colors in total. MPEG modifies this across frames in a motion image, further reducing the size compared to a series of JPEGs. MPEG audio compression analyzes the transformed data according to a psychoacoustic model that describes the human ear's sensitivity to parts of the signal, similar to the TV model.

The basic process of digitizing an analog signal is a kind of transform coding that uses sampling in one or more domains as its transform.

[edit] See also

In other languages