Classic data sets
From Wikipedia, the free encyclopedia
Several classic data sets have been used extensively in the statistical literature.
- Iris flower data set - multivariate data set introduced by Fisher (1936).
- Categorical data analysis - Data sets used in the book, An Introduction to Categorical Data Analysis, by Agresti are provided on-line by StatLib.
- Robust statistics - Data sets used in Robust Regression and Outlier Detection (Rousseeuw and Leroy, 1986). Provided on-line at the University of Cologne.
- Time series - Data used in Chatfield's book, The Analysis of Time Series, are provided on-line by StatLib.
- Extreme values - Data used in the book, An Introduction to the Statistical Modeling of Extreme Values are provided on-line by Stuart Coles, the book's author.
- Bayesian Data Analysis - Data used in the book, Bayesian Data Analysis, are provided on-line by Andrew Gelman, one of the book's authors.
- The Bupa liver data, used in several papers in the machine learning (data mining) literature.