Delimiter-separated values

From Wikipedia, the free encyclopedia

Delimited data uses specific characters (delimiters) to separate its values. Most database and spreadsheet programs are able to read or save data in a delimited format.

Any character may be used to separate the values, but the most common characters used are the comma, tab, vertical bar (also referred to as pipe) and space. For example, in a comma-separated values (CSV) file the data items are delimited using commas. Delimited data is often divided into columns and rows. Column headers are defined in the first line of data, and each subsequent line is a row of data. The lines are separated by newline and/or carriage return characters.

Typically a delimited file format is indicated by a specification. Some specifications provide conventions for avoiding delimiter collision, others do not. A comma separated file avoids delimiter collision by using double quotes.

Due to their widespread use, comma- and tab-delimited text files can be opened by most spreadsheet programs without the user designating which delimiter has been used.

For example, the following data is delimited by vertical bars and newline characters:

Date|Pupil|Grade
25 May|Fred Bloggs|C
25 May|Jane Doe|B
15 July|Fred Bloggs|D

ASCII includes several control characters that are intended to be used as delimiters. They are: 28 file separator, 29 group separator, 30 record separator, 31 unit separator. Use of these characters has not achieved wide-spread adoption, some systems have replaced their control properties with more accepted controls such as CR/LF and TAB.


[edit] See also

Delimiter

In other languages