Talk:Correlation ratio

From Wikipedia, the free encyclopedia

[This article is] poorly explained. --Eequor 03:39, 22 Aug 2004 (UTC)

[edit] Cleanup request

This article is incomprehensible to anyone unfamiliar with advanced statistical notation. It could be repaired with a concrete example and an intuitive, rather than symbolic or mathematical explanation. A graph or two would also help a lot. -- Beland 01:54, 23 August 2005 (UTC)

This is really part of a wider problem: currently large areas of the topic of statistics are very deficiently treated on Wikipedia. A few statistics articles are really good, some others are quite competent, but many are skimpy stubs, and this is one of those. I may return here. Michael Hardy 02:27, 25 August 2005 (UTC)


Please include more information

I have attempted an example --Rumping 00:35, 30 September 2007 (UTC)

[edit] Anyone understands this thoroughly?

It is worth noting that if the relationship between values of x \;\ and values of \overline{y}_x is linear (which is certainly true when there are only two possibilities for x) this will give the same result as the square of the correlation coefficient, otherwise the correlation ratio will be larger in magnitude. It can therefore be used for judging non-linear relationships.

If you understand this thoroughly, please help improving this article and put it more clearly. I am very unhappy with this sentence, since I am not understanding it. I do not want to remove it, since there needs to be a reference made to the correlation coefficient and it seems that the original author knew about it. It would just be nice to convey this message comprehensively. Tomeasy (talk) 11:32, 29 March 2008 (UTC)

[edit] Standard deviation

I have taken the later of the two following statements out, since I think it is wrong.

\eta^2 = \frac{\sum_x n_x (\overline{y}_x-\overline{y})^2}{\sum_{xi} (y_{xi}-\overline{y})^2}

which might be written as

\frac{{\sigma_{\overline{y}}}^2}{{\sigma_{y}}^2}.

The use of the sigma's implies that we should be dealing with standard deviations here. However, substituting their definition does not yield the given equation. Also it is not very clear to me, what standard deviation is precisely meant in the numerator. I tried interpreting it as the standard deviation of the category means, but it failed to yield the stated equation. Correct me, if I am wrong. Tomeasy (talk) 11:41, 29 March 2008 (UTC)