Dirty data

From Wikipedia, the free encyclopedia

Dirty data is inaccurate, incomplete or erroneous data, especially in a computer system or database.[1]

In reference to databases, this is data that contain errors. Unclean data can contain such mistakes as spelling or punctuation errors, incorrect data associated with a field, incomplete or outdated data, or even data that has been duplicated in the database.

See also

  • Signal noise

References

  1. Margaret Chu (2004), "What Are Dirty Data?", Blissful Data, p. 71 et seq., ISBN 9780814407806 
This article is issued from Wikipedia. The text is available under the Creative Commons Attribution/Share Alike; additional terms may apply for the media files.