Beta-binomial model
From Wikipedia, the free encyclopedia
In empirical Bayes methods, the Beta-binomial model is an analytic model where the likelihood function L(x | θ) is specifed by a binomial distribution
and the conjugate prior π(θ | α,β) is a Beta distribution
The Beta-binomial is a two-dimensional Multivariate Polya distribution, as the binomial and Beta distributions are special cases of the multinomial and Dirichlet distributions, respectively.
Contents |
[edit] Derivation of the posterior and the marginal
It is convenient to reparameterize the distributions so that the expected mean of the prior is a single parameter: Let
where
-
-
- and
-
so that
The posterior distribution ρ(θ | x) is also a beta distribution
while the marginal distribution m(x | μ,M) is given by
-
- =
-
- =
[edit] Moment estimates
Because the marginal is a complex, non-linear function of Gamma and Digamma functions, it is quite difficult to obtain a marginal maximum likelihood estimate (MMLE) for the mean and variance. Instead, we use the method of iterated expectations to find the expected value of the marginal moments.
Let us write our model as a two-stage compund sampling model (for each event i out of ni possible events):
We can find iterated moment estimates for the mean and variance using the moments for the distributions in the two-stage model:
We now seek a point estimate as a weighted average of the sample estimate and an estimate for . The sample estimate is given by
Therefore we need point estimates for μ and M. The estimated mean is given as a weighted average
The hyperparameter M is obtained using the moment estimates for the variance of the two-stage model:
We can now solve for :
Given our point estimates for the prior, we may now plug in these values to find a point estimate for the posterior
[edit] Shrinkage factors
We may write the posterior estimate as a weighted average:
where is called the Shrinkage Factor.
[edit] Example
[edit] Maximum likelihood estimation
Maximum likelihood estimates from empirical data can be computed using general methods for fitting multinomial Polya distributions, methods for which are described in (Minka 2003).
[edit] Improved estimates: James-Stein estimator
[edit] External links
- Fastfit contains Matlab code for fitting Beta-Binomial distributions (in the form of two-dimensional Polya distributions) to data.
[edit] References
* Minka, Thomas P. (2003). Estimating a Dirichlet distribution. Microsoft Technical Report.