In statistics, an exchangeable sequence of random variables (also sometimes interchangeable) is a sequence such that future samples behave like earlier samples, meaning formally that any order (of a finite number of samples) is equally likely. This formalizes the notion of "the future being predictable on the basis of past experience."
A sequence of independent and identically-distributed random variables (i.i.d.) is exchangeable, but so is sampling without replacement, which is not independent.
The notion is central to Bruno de Finetti's development of predictive inference and to Bayesian statistics — where frequentist statistics uses i.i.d. variables (samples from a population), Bayesian statistics more frequently uses exchangeable sequences. They are a key way in which Bayesian inference is "data-centric" (based on past and future observations), rather than "model-centric", as exchangeable sequences that are not i.i.d. cannot be modeled as "sampling from a fixed population".
Contents |
Formally, an exchangeable sequence of random variables is a finite or infinite sequence X1, X2, X3, ... of random variables such that for any finite permutation σ of the indices 1, 2, 3, ..., i.e. any permutation σ that leaves all but finitely many indices fixed, the joint probability distribution of the permuted sequence
is the same as the joint probability distribution of the original sequence.
A sequence E1, E2, E3, ... of events is said to be exchangeable precisely if the sequence of its indicator functions is exchangeable.
Independent and identically distributed random variables are exchangeable.
The distribution function FX1,...,Xn(x1, ... ,xn) of a finite sequence of exchangeable random variables is symmetric in its arguments x1, ... ,xn.
The von Neumann extractor is a randomness extractor that depends on exchangeability: it gives a method to take an exchangeable sequence of 0s and 1s (Bernoulli trials), with some probability p of 0 and of 1, and produce a (shorter) exchangeable sequence of 0s and 1s with probability 1/2.
Partition the sequence into non-overlapping pairs: if the two elements of the pair are equal (00 or 11), discard it; if the two elements of the pair are unequal (01 or 10), keep the first. This yields a sequence of Bernoulli trials with as, by exchangeability, the odds of a given pair being 01 or 10 are equal.