Watterson estimator
From Wikipedia, the free encyclopedia
The introduction to this article provides insufficient context for those unfamiliar with the subject. Please help improve the article with a good introductory style. |
In population genetics, the Watterson estimator is a method for estimating the population mutation rate, θ = 4Neμ, where Ne is the effective population size and μ is the per-generation mutation rate of the population of interest (Watterson (1975)). The assumptions made are that there is a sample of n haploid individuals from the population of interest, that there are an infinitely many alleles possible, and that .
The estimate of θ, often denoted as , is
where K is the number of segregating sites in the sample and
is the (n − 1)th harmonic number.
This estimate is based on coalescent theory. Watterson's estimator is commonly used for its simplicity. The estimator is unbiased and the variance of the estimator decreases with increasing sample size and/or recombination rate. However, the estimator can be biased by population structure. For example, is downwardly biased in an exponentially growing population.
[edit] See also
- Tajima estimator
- Fu estimator
- Coupon collector's problem
- Ewens sampling formula
[edit] References
- Watterson, G.A. (1975), “On the number of segregation sites.”, Theoretical Population Biology 7: 256-276