Multinomial distribution
From Wikipedia, the free encyclopedia
In probability theory, the multinomial distribution is a generalization of the binomial distribution. The binomial distribution is the probability distribution of the number of "successes" in n independent Bernoulli trials, with the same probability of "success" on each trial. Instead of each trial resulting in "success" or "failure", imagine that each trial results in one of some fixed finite number k of possible outcomes, with probabilities p1, ..., pk, and there are n independent trials. We can use a random variable Xi to indicate the number of times outcome number i was observed over the n trials. Then, the multinomial distribution can be defined as the distribution of the vector
The probabilities are given by
for non-negative integers x1, ..., xk.
Each of the k components separately has a binomial distribution with parameters n and pi, for the appropriate value of the subscript i, and, because of the constraint that the sum of the components is n, they are negatively correlated.
The expected value is
The covariance matrix is as follows. Each diagonal entry is the variance of a binomially distributed random variable, and is therefore
The off-diagonal entries are the covariances. These are
for i, j distinct. This is a k × k nonnegative-definite matrix of rank k − 1.
The off-diagonal entries of the corresponding correlation matrix are
Note that the sample size drops out of this expression. All off-diagonal entries are negatively correlated because for fixed N, an increase in one component of a multinomial vector requires a decrease in another component.
The Dirichlet distribution is the conjugate prior of the multinomial in Bayesian statistics.