Kernel (image processing)

From Wikipedia, the free encyclopedia

In image processing, a kernel, convolution matrix, or mask is a small matrix useful for blurring, sharpening, embossing, edge-detection, and more. This is accomplished by means of convolution between a kernel and an image.

Details

Depending on the element values, a kernel can cause a wide range of effects.

Original {\begin{bmatrix}0&0&0\\0&1&0\\0&0&0\end{bmatrix}}
Edge-Detect {\begin{bmatrix}1&0&-1\\0&0&0\\-1&0&1\end{bmatrix}}
{\begin{bmatrix}0&1&0\\1&-4&1\\0&1&0\end{bmatrix}}
{\begin{bmatrix}-1&-1&-1\\-1&8&-1\\-1&-1&-1\end{bmatrix}}
Sharpen {\begin{bmatrix}0&-1&0\\-1&5&-1\\0&-1&0\end{bmatrix}}
Blur* {\begin{bmatrix}1&2&1\\2&4&2\\1&2&1\end{bmatrix}}
{\begin{bmatrix}1&1&1\\1&1&1\\1&1&1\end{bmatrix}}
*must be normalized

The above are just a few examples of effects achievable by convolving kernels and images.

Origin

The origin is the position of the kernel which is above (conceptually) the current output pixel. This could be outside of the actual kernel, though usually it corresponds to one of the kernel elements. For a symmetric kernel, the origin is usually the center element.

Convolution

This is a form of mathematical convolution.

The values of a given pixel in the output image are calculated by multiplying each kernel value by the corresponding input image pixel values. This can be described algorithmically with the following pseudo-code:

for each image row in output image:
   for each pixel in image row:

      set running total to zero

      for each kernel row in kernel:
         for each element in kernel row:

            multiply element value by corresponding* pixel value
            add result to running total

      set output image pixel to value of running total
*corresponding input image pixels are found relative to the kernel's origin.

If the kernel is not symmetric, it has to be flipped both around its horizontal and vertical axis before calculating the convolution as above.[1]

Edge Handling

Extend Edge-Handling

Kernel convolution usually requires values from pixels outside of the image boundaries. There are a variety of methods for handling image edges.

Extend
The nearest border pixels are conceptually extended as far as necessary to provide values for the convolution. Corner pixels are extended in 90° wedges. Other edge pixels are extended in lines.
Wrap
The image is conceptually wrapped (or tiled) and values are taken from the opposite edge or corner.
Crop
Any pixel in the output image which would require values from beyond the edge is skipped. This method can result in the output image being slightly smaller, with the edges having been cropped.

Normalization

Dividing each element in the kernel by the sum of all the elements in the kernel. Normalization ensures that the pixel values in the output image are of the same relative magnitude as those in the input image.

External links

References

This article is issued from Wikipedia. The text is available under the Creative Commons Attribution/Share Alike; additional terms may apply for the media files.