Block Truncation Coding

From Wikipedia, the free encyclopedia

1 Introduction
2 Compression procedure
3 Example
- 3.1 Encoder
- 3.2 Decoder
4 References

[edit] Introduction

Block Truncation Coding, or BTC, is a type of lossy image compression technique for grayscale images. It divides the original images into blocks and then uses a quantiser to reduce the number of gray levels in each block whilst maintaining the same mean and standard deviation. It is an early predecessor of the popular hardware DXTC technique, although BTC compression method was first adapted to color long before DXTC using a very similar approach called Color Cell Compression ^[1]. BTC has also been adapted to video compression ^[2]

BTC was first proposed by Professors Mitchell and Delp at Purdue University. Another variation of BTC is Absolute Moment Block Truncation Coding or AMBTC, in which instead of using the standard deviation the first absolute moment is preserved along with the mean. AMBTC is computationally simpler than BTC. AMBTC was proposed by Lema and Mitchell.

Sub blocks of 4x4 pixels allow compression of about 25% assuming 8-bit integer values are used during transmission or storage. Larger blocks allow greater compression ("a" and "b" values spread over more pixels) however quality also reduces with the increase in block size due to the nature of the algorithm.

[edit] Compression procedure

A 256x256 pixel image is divided into blocks of typically 4x4 pixels. For each block the Mean and Standard Deviation are calculated, these values change from block to block. These two values define what values the reconstructed or new block will have, in other words the blocks of the BTC compressed image will all have the same mean and standard deviation of the original image. A two level quantization on the block is where we gain the compression, it is performed as follows:

$y(i,j) = \begin{cases} 1, & x(i,j) > \bar x \\ 0, & x(i,j) \le \bar x \end{cases}$

Where $x (i, j)$ are pixel elements of the original block and $y (i, j)$ are elements of the compressed block. In words this can be explained as: If a pixel value is greater than the mean it is assigned the value "1", otherwise "0". Values equal to the mean can have either a "1" or a "0" depending on the preference of the person or organisation implementing the algorithm.

This 16 bit block is stored or transmitted along with the values of Mean and Standard Deviation. Reconstruction is made with two values "a" and "b" which preserve the mean and the standard deviation. The values of "a" and "b" can be computed as follows:

$a=\bar x - \sigma \sqrt{\cfrac {q}{m-q}}$

$b=\bar x + \sigma \sqrt{\cfrac {m-q}{q}}$

Where $σ$ is the standard deviation, m is the total number of pixels in the block and q is the number of pixels greater than the mean ( $\bar x$ )

To reconstruct the image, or create its approximation, elements assigned a 0 are replaced with the "a" value and elements assigned a 1 are replaced with the "b" value. This demonstrates that the algorithm is asymmetric in that the encoder has much more work to do than the decoder. This is because the decoder is simply replacing 1's and 0's with the estimated value whereas the encoder is also required to calculate the mean, standard deviation and the two values to use.

$x(i,j) = \begin{cases} a, & y(i,j) = 0 \\ b, & y(i,j) = 1 \end{cases}$ ^[3].

[edit] Example

[edit] Encoder

Take a 4x4 block from an image, in this case the mountain test image^[4]:

$\begin{matrix} 245 & 239 & 249 & 239 \\ 245 & 245 & 239 & 235 \\ 245 & 245 & 245 & 245 \\ 245 & 235 & 235 & 239 \end{matrix}$

Like any small block from an image this appears rather boring to work with as the numbers are all quite similar, this is the nature of lossy compression and how it can work so well for images. Now we need to calculate two values from this data, that is the mean and standard deviation. The mean can be computed to 241, this is a simple calcuation which should require no further explanation. The standard deviation is easily calculated at 4. Keep in mind we are using integer values as this is what will work the fastest in a computer system. From this the values of "a" and "b" can be calculated using the previous equations. They come out to be 236 and 245 respectively. The last calculation that needs to be done on the encoding side is to set the matrix to transmit to 1's and 0's so that each pixel can be transmitted as a single bit.

$\begin{matrix} 1 & 0 & 1 & 0 \\ 1 & 1 & 0 & 0 \\ 1 & 1 & 1 & 1 \\ 1 & 0 & 0 & 0 \end{matrix}$

[edit] Decoder

Now at the decoder side all we need to do is reassign the "a" and "b" values to the 1 and 0 pixels. This will give us the following block:

$\begin{matrix} 245 & 236 & 245 & 236 \\ 245 & 245 & 236 & 236 \\ 245 & 245 & 245 & 245 \\ 245 & 236 & 236 & 236 \end{matrix}$

As can be seen, the block has been reconstructed with the two values of "a" and "b". When working through the theory, this is a good point to calculate the mean and standard deviation of the reconstructed block. They should equal the original mean and standard deviation. Remember to use integers, otherwise much quantization error will become involved, as we previously quantized everything to integers in the encoder.

[edit] References

^ 1990 IEEE Color Cell Compression Paper[1]
^ 1981 IEEE Paper "Digital Video Bandwidth Compression Using Block Truncation Coding" [2]
^ Leis, J 2008, ELE4607 Advance Digital Communications, Module 3: Image & Video Coding. Lecture Slides, University of Southern Queensland, 2008.
^ http://links.uwaterloo.ca/bragzone.base.html

Categories: Image processing | Lossy compression algorithms

Block Truncation Coding

From Wikipedia, the free encyclopedia

Contents

[edit] Introduction

[edit] Compression procedure

[edit] Example

[edit] Encoder

[edit] Decoder

[edit] References

Views

Navigation

Interaction

Search