Proxy (statistics)
In statistics, a proxy or proxy variable is a variable that is not in itself directly relevant, but that serves in place of an unobservable or immeasurable variable.[1] In order for a variable to be a good proxy, it must have a close correlation, not necessarily linear or positive, with the variable of interest.
Examples
Per-capita GDP is often used as a proxy for measures of standard of living or quality of life.
Likewise, country of origin or birthplace might be used as a proxy for race, or vice versa.
See also
References
- ↑ Upton, G., Cook,I. (2002) Oxford Dictionary of Statistics. OUP ISBN 978-0-19-954145-4
- Toutenburg, Helge; Götz Trenkler (1992). "Proxy variables and mean square error dominance in linear regression". Journal of Quantitative Economics 8: 433–442.
- Stahlecker, Peter; Götz Trenkler (1993). "Some further results on the use of proxy variables in prediction". The Review of Economics and Statistics (The MIT Press) 75 (4): 707–711. doi:10.2307/2110026. JSTOR 2110026.
- Trenkler, Götz; Peter Stahlecker (1996). "Dropping variables versus use of proxy variables in linear regression". Journal of Statistical Planning and Inference (NORTH-HOLLAND) 50 (1): 65–75. doi:10.1016/0378-3758(95)00045-3.