Data analysis

From Wikipedia, the free encyclopedia

Data analysis is the act of transforming data with the aim of extracting useful information and facilitating conclusions. Depending on the type of data and the question, this might include application of statistical methods, curve fitting, selecting or discarding certain subsets based on specific criteria, or other techniques. In respect to Data mining, data analysis is usually more narrowly intended as not aiming to the discovery of unforeseen patterns hidden in the data, but to the verification or disproval of an existing model, or to the extraction of parameters necessary to adapt a theoretical model to (experimental) reality.

Contents

[edit] Applications in various fields

Data analysis assumes different aspects, and possibly different names, in different fields.

[edit] Nuclear and Particle physics

In nuclear and particle physics the data is usually originated from the experimental apparatus via a Data acquisition system. It is then processed, is a step usually called data reduction, to apply calibrations and to extract physically significant information. Data reduction is most often, especially in large particle physics experiments, an automatic, batch-mode operation carried out by software written ad-hoc. The resulting data n-tuples are then scrutinized by the physicists, using specialized software tools like ROOT or PAW, comparing the results of the experiment with theory.

The theoretical models are often difficult to compare directly with the results of the experiments, so they are used instead as input for Monte Carlo simulation software like Geant4 that predict the response of the detector to a given theoretical event, producing simulated events which are then compared to experimental data.

[edit] Software tools

[edit] See also

[edit] Social sciences

Qualitative Data Analysis (QDA) or qualitative research is the analysis of non-numerical data, for example words, photographs, observations, etc..

[edit] Information technology

A special case is the data analysis in information technology audits.

[edit] Business

[edit] See also

[edit] Further reading

  • Michael S. Lewis-Beck, Data Analysis: an Introduction, Sage Publications Inc, 1995, ISBN 0803957726
  • Pyzdek, T, "Quality Engineering Handbook", 2003, ISBN 0824746147
  • Godfrey, A. B., "Juran's Quality Handbook", 1999, ISBN 007034003
  • Engineering Statistics Handbook, NIST/SEMATEK, [1]
In other languages