Geometric mean

From Wikipedia, the free encyclopedia

The geometric mean of a set of positive data is defined as the nth root of the product of all the members of the set, where n is the number of members.

Contents

[edit] Calculation

The geometric mean of a set {a1, a2, ..., an} is:

\bigg(\prod_{i=1}^n a_i \bigg)^{1/n} = \sqrt[n]{a_1 \cdot a_2 \dotsb a_n}.

The geometric mean of a data set is always smaller than or equal to the set's arithmetic mean (the two means are equal if and only if all members of the data set are equal). This allows the definition of the arithmetic-geometric mean, a mixture of the two which always lies in between.

The geometric mean is also the arithmetic-harmonic mean in the sense that if two sequences (an) and (hn) are defined:

a_{n+1} = \frac{a_n + h_n}{2}, \quad a_1=\frac{x + y}{2}

and

h_{n+1} = \frac{2}{\frac{1}{a_n} + \frac{1}{h_n}}, \quad h_1=\frac{2}{\frac{1}{x} + \frac{1}{y}}

then an and hn will converge to the geometric mean of x and y.

[edit] Relationship with arithmetic mean of logarithms

By using logarithmic identities to transform the formula, we can express the multiplications as a sum and the power as a multiplication.

\bigg(\prod_{i=1}^nx_i \bigg)^{1/n} = \exp\left[\frac1n\sum_{i=1}^n\ln x_i\right].

This is simply computing the arithmetic mean of the logarithm transformed values of xi (i.e. the arithmetic mean on the log scale) and then using the exponentiation to return the computation to the original scale. I.e., it is the generalised f-mean with f(x) = ln x.

Therefore the geometric mean is related to the log-normal distribution. The log-normal distribution is a distribution which is normal for the logarithm transformed values. We see that the geometric mean is the exponentiated value of the mean of the log transformed values, i.e. emean(ln(X)).

[edit] When to use the geometric mean

The geometric mean is useful to determine "average factors". For example, if a stock rose 10% in the first year, 20% in the second year and fell 15% in the third year, then we compute the geometric mean of the factors 1.10, 1.20 and 0.85 as (1.10 × 1.20 × 0.85)1/3 = 1.0391-1... and we conclude that the stock rose 3.91 percent per year, on average.

Put another way...

The arithmetic mean is relevant any time several quantities add together to produce a total. The arithmetic mean answers the question, "if all the quantities had the same value, what would that value have to be in order to achieve the same total?"

In the same way, the geometric mean is relevant any time several quantities multiply together to produce a product. The geometric mean answers the question, "if all the quantities had the same value, what would that value have to be in order to achieve the same product?"

For example, suppose you have an investment which earns 10% the first year, 50% the second year, and 30% the third year. What is its average rate of return? It is not the arithmetic mean, because what these numbers signify is that on the first year your investment was multiplied (not added to) by 1.10, on the second year it was multiplied by 1.50, and the third year it was multiplied by 1.30. The relevant quantity is the geometric mean of these three numbers, which is about 1.28966 or about 29% annual interest. Source.

In the scientific community, when reporting experimental results, it is also important to know whether arithmetic mean or geometric mean should be used. If, for example, you are averaging ratios (i.e. ratio = new_method/old_method) over many experiments, geometric mean should be used. This becomes evident when considering the two extremes. If one experiment yields a ratio of 10,000 and the next yields a ratio of 0.0001, an arithmetic mean would misleadingly report that the average ratio was near 5000. Taking a geometric mean will more honestly represent the fact that the average ratio was 1.

[edit] See also

[edit] External links