Centimorgan

In genetics, a centimorgan (abbreviated cM) or map unit (m.u.) is a unit for measuring genetic linkage. It is defined as the distance between chromosome positions (also termed, loci or markers) for which the expected average number of intervening chromosomal crossovers in a single generation is 0.01. It is often used to infer distance along a chromosome. It is not a true physical distance however.

Relation to physical distance

The number of base-pairs to which it corresponds varies widely across the genome (different regions of a chromosome have different propensities towards crossover).

One centimorgan corresponds to about 1 million base pairs in humans on average.[1][2] Plasmodium falciparum has an average recombination distance of ~15 kb per centimorgan: markers separated by 15 kb of DNA (15,000 nucleotides) have an expected rate of chromosomal crossovers of 0.01 per generation. Note that non-syntenic genes (genes residing on different chromosomes) are inherently unlinked, and cM distances have no meaning between them.

Relation to the probability of recombination

Because genetic recombination between two markers is detected only if there are an odd number of chromosomal crossovers between the two markers, the distance in centimorgans does not correspond exactly to the probability of genetic recombination. Assuming Haldane's map function, where the number of chromosomal crossovers is according to a Poisson distribution,[3] a genetic distance of d centimorgans will lead to an odd number of chromosomal crossovers, and hence a detectable genetic recombination, with probability

\Pr[\text{recombination}|\text{linkage of }d\text{ cM}] = \sum_{k=0}^{\infty} \Pr[2k + 1 \text{ crossovers}|\text{linkage of }d\text{ cM}]
{} = \sum_{k=0}^{\infty} e^{-d/100} \frac{(d/100)^{2\,k+1}}{(2\,k+1)!} = e^{-d/100} \sinh(d/100) = \frac{1 - e^{-2d/100}}{2}\,,

where sinh is the hyperbolic sine function. The probability of recombination is approximately d/100 for small values of d and approaches 50% as d goes to infinity.

The formula can be inverted, giving the distance in centimorgans as a function of the recombination probability:

d=50 \ln\left({\frac{1}{1 - 2 \Pr[\text{recombination}]}}\right)\,.

Etymology

The centimorgan was named in honor of geneticist Thomas Hunt Morgan by his student Alfred Henry Sturtevant. Note that the parent unit of the centimorgan, the Morgan, is rarely used today.

External links

Demonstration of crossing over, from the home page of the laboratory of Paul O. Lewis in the Department of Ecology and Evolutionary Biology at the University of Connecticut.

References

  1. NIH ORDR - Glossary - C
  2. Harvey Lodish, Arnold Berk, Paul Matsudaira, Chris A. Kaiser, Monty Krieger, Matthew P. Scott, Lawrence Zipursky, and James Darnell (2004). Molecular Cell Biology, Fifth Edition. San Francisco: W. H. Freeman. p. 396. ISBN 0-7167-4366-3. ...in humans 1 centimorgan on average represents a distance of about 7.5x10E5 base pairs
  3. "Department of Plant Sciences".

Further reading