Calgary Corpus

From Wikipedia, the free encyclopedia

The Calgary Corpus is a collection of text and binary data files, commonly used for comparing data compression algorithms. It was created by Ian Witten and Tim Bell in the 1980s and was commonly used in the 1990s. In 1997 it was replaced by the Canterbury Corpus, but the Calgary Corpus still exists for comparison and is still useful for its original intended purpose.

[edit] See also

Comparison of file archivers

[edit] External links

v • d • e

Data compression methods

Lossless compression methods

Theory	Entropy · Complexity · Redundancy

Entropy encoding	Huffman · Adaptive Huffman · Arithmetic (Shannon-Fano · Range) · Golomb · Exp-Golomb · Universal (Elias · Fibonacci) · Asymmetric binary

Dictionary	RLE · LZ77/78 · LZW · LZWL · LZO · DEFLATE · LZMA · LZX · LZJB

Others	CTW · BWT · PPM · DMC

Audio compression methods

Theory	Convolution · Sampling · Nyquist–Shannon theorem

Audio codec parts	LPC (LAR · LSP) · WLPC · CELP · ACELP · A-law · μ-law · MDCT · Fourier transform · Psychoacoustic model

Others	Dynamic range compression · Speech compression · Sub-band coding

Image compression methods

Terms	Color space · Pixel · Chroma subsampling · Compression artifact

Methods	RLE · Fractal · Wavelet · SPIHT · DCT · KLT

Others	Bit rate · Test images · PSNR quality measure · Quantization

Video compression

Terms	Video Characteristics · Frame · Frame types · Video quality

Video codec parts	Motion compensation · DCT · Quantization

Others	Video codecs · Rate distortion theory (CBR · ABR · VBR)

Timeline of information theory, data compression, and error-correcting codes

See Compression Formats and Standards for formats and Compression Software Implementations for codecs

Categories: Data compression

Calgary Corpus

From Wikipedia, the free encyclopedia

[edit] See also

[edit] External links

Views

Navigation

Interaction

Search

Languages