Point Distribution Model

From Wikipedia, the free encyclopedia

The Point Distribution Model is a model for representing the mean geometry of a shape and some statistical modes of geometric variation inferred from a training set of shapes. It has been developed by Cootes,[1] Taylor et al[2] and became a standard in computer vision for statistical study of shape[3] and for segmentation of medical images[2] where shape priors really help interpretation of noisy and low-contrasted pixels/voxels. The latter point leads to Active shape models (ASM) and Active Appearance Models (AAM).

Point Distribution Models rely on Landmark points. A landmark is an annotating point posed by an anatomist onto a given locus for every shape instance across the training set population. For instance, the same landmark will designate the tip of the index in a training set of 2D hands outlines. Principal Component Analysis (PCA), for instance, is a relevant tool for studying correlations of movement between groups of landmarks among the training set population. Typically, it might detect that all the landmarks located along the same finger move exactly together across the training set examples showing different finger spacing for a flat-posed hands collection.

The implementation of the procedure is roughly the following:

  1. Annotate the training set outlines with enough corresponding landmarks to sufficiently approximate the geometry of the original shapes
  2. Align the clouds of landmark using the Generalized procrustes analysis (minimization of overall distance between landmarks of same label). The big idea is that shape information is not related to affine pose parameters, which need to be removed before any shape study. A mean shape can now be computed in averaging the aligned landmark positions.
  3. Now the shape outlines are reduced to sequences of n landmarks, we can see the training set as a 2n or 3n (2D/3D) space where any shape instance is a single dot. Assuming the scattering is gaussian in this space, PCA is supposedly the most straightforward tool to analyse the training set in this space
  4. PCA computes normalized eigenvectors and eigenvalues of the training set covariance matrix. Each eigenvector describe a principal mode of variation along the set, the corresponding eigenvalue indicating the importance of this mode in the shape space scattering. Since correlation was found between landmarks, the total variation of the space is concentrated on the very first eigenvectors, showing a very fast descent. Otherwise correlation was not found, suggesting the training set shows no variation or the landmarks are not properly posed.

An eigenvector, interpreted in euclidean space, can be seen as a sequence of n euclidean vectors associated to corresponding landmark and designating a compound move for the whole shape. Global nonlinear variation is usually well handled provided nonlinear variation is kept to a reasonable level. Typically, a twisting nematode worm is used as an example in the teaching of kernel PCA-based methods.

Due to the PCA properties: eigenvectors are mutually orthogonal, form a basis of the training set cloud in the shape space, and cross at the 0 in this space, which represents the mean shape. Also, PCA is a traditional way of fitting a closed ellipsoid to a Gaussian cloud of points (whatever their dimension): this suggests the concept of bounded variation.

The very big idea of PDM is that eigenvectors can be linearly combined to create an infinity of new shape instances that will 'look like' the one in the training set. The coefficients are bounded alike the values of the corresponding eigenvalues, so as to ensure the generated 2n/3n-dimensional dot will remain into the hyper-ellipsoïdal allowed domain—Allowable Shape Domain (ASD).[2]

[edit] References

  1. ^ T. F. Cootes (2004), Statistical models of appearance for computer vision, <http://www.isbe.man.ac.uk/~bim/Models/app_models.pdf> 
  2. ^ a b c D.H. Cooper, T.F. Cootes, C.J. Taylor and J. Graham (1995), “Active shape models—their training and application”, Computer Vision and Image Understanding (no. 61): 38–59 
  3. ^ Rhodri H. Davies and Carole J. Twining and P. Daniel Allen and Tim F. Cootes and Chris J. Taylor (2003), Shape discrimination in the Hippocampus using an MDL Model, <http://www2.wiau.man.ac.uk/caws/Conferences/10/proceedings/8/papers/133/rhhd_ipmi03%2Epdf> 

[edit] See also

[edit] External links