Thresholding (image processing)
From Wikipedia, the free encyclopedia
Thresholding is the simplest method of image segmentation. Individual pixels in a grayscale image are marked as 'object' pixels if their value is greater than some threshold value (assuming an object to be brighter than the background) and as 'background' pixels otherwise. Typically, an object pixel is given a value of '1' while a background pixel is given a value of '0'.
The key parameter in thresholding is obviously the choice of the threshold. Several different methods for choosing a threshold exist. The simplest method would be to choose the mean or median value, the rationale being that if the object pixels are brighter than the background, they should also be brighter than the average. In a noiseless image with uniform background and object values, the mean or median will work beautifully as the threshold, however generally speaking, this will not be the case. A more sophisticated approach might be to create a histogram of the image pixel intensities and use the valley point as the threshold. The histogram approach assumes that there is some average value for the background and object pixels, but that the actual pixel values have some variation around these average values. However, computationally this is not as simple as we'd like, and many image histograms do not have clearly defined valley points. Ideally we're looking for a method for choosing the threshold which is simple, does not require too much prior knowledge of the image, and works well for noisy images. A good such approach is an iterative method, as follows:
- An initial threshold (T) is chosen, this can be done randomly or according to any other method desired.
- The image is segmented into object and background pixels as described above, creating two sets:
- G1 = {f(m,n):f(m,n)>T} (object pixels)
- G2 = {f(m,n):f(m,n)T} (background pixels) (note, f(m,n) is the value of the pixel located in the mth column, nth row)
- The average of each set is computed.
- m1 = average value of G1
- m2 = average value of G2
- A new threshold is created that is the average of m1 and m2
- T' = (m1 + m2)/2
- Go back to step two, now using the new threshold computed in step 4, keep repeating until the new threshold matches the one before it (i.e. until convergence has been reached).
Another approach is to calculate the new threshold in step 4 using the weighted average of m1 and m2: T' = (||G1||*m1 + ||G2||*m2)/(||G1||+||G2||), where ||Gn|| is the number of pixels in Gn. This approach often gives a more accurate result.
This iterative algorithm is a special one-dimensional case of the k-means clustering algorithm, which has been proven to converge at a local minimum - meaning that a different initial threshold may result in a different final result.
[edit] List of Methods
- Entropy based methods.
- Cluster based methods.
- A priori assumptions about the background population distribution (e.g. 'it is a Gaussian').
- Local methods.
- Otsu's method.
[edit] References
- Gonzalez, Rafael C. & Woods, Richard E. (2002). Thresholding. In Digital Image Processing, pp. 595-611. Pearson Education. ISBN 81-7808-629-8
- Mehmet Sezgin and Bulent Sankur, Survey over image thresholding techniques and quantitative performance evaluation, Journal of Electronic Imaging 13(1), 146– 165 (January 2004). DOI:10.1117/1.1631315