Index (computer science)

In computer science, an index can be:

an integer that identifies an array element
a data structure that enables sublinear-time lookup (an associative array)

1 Array element identifier
2 Support for fast lookup
3 References
4 See also

Array element identifier

When data objects are stored in an array, individual objects are selected by an index that is usually a non-negative scalar integer. Indices are also called subscripts. An index maps the array value to a stored object.

There are three ways in which the elements of an array can be indexed:

0 (zero-based indexing): The first element of the array is indexed by subscript of 0.^[1]
1 (one-based indexing): The first element of the array is indexed by subscript of 1.^[2]
n (n-based indexing): The base index of an array can be freely chosen. Usually programming languages allowing n-based indexing also allow negative index values and other scalar data types like enumerations, or characters may be used as an array index.

Arrays can have multiple dimensions, thus it is not uncommon to access an array using multiple indices. For example a two dimensional array A with three rows and four columns might provide access to the element at the 2nd row and 4th column by the expression: A[1, 3] (in a row major language) and A[3, 1] (in a column major language) in the case of a zero-based indexing system. Thus two indices are used for a two dimensional array, three for a three dimensional array, and n for an n dimensional array.

Support for fast lookup

Suppose a data store contains N data objects, and it is desired to retrieve one of them based on the value of one of the object's fields. A naive implementation would retrieve and examine each object until a match was found. A successful lookup would retrieve half the objects on average; an unsuccessful lookup all of them for each attempt. This means that the number of operations in the worst case is Ω(N) or linear time. Since data stores commonly contain millions of objects and since lookup is a common operation, it is often desirable to improve on this performance.

An index is any data structure that improves the performance of lookup. There are many different data structures used for this purpose, and in fact a substantial proportion of the field of Computer Science is devoted to the design and analysis of index data structures. There are complex design trade-offs involving lookup performance, index size, and index update performance. Many index designs exhibit logarithmic (O(log(N)) lookup performance and in some applications it is possible to achieve flat (O(1)) performance.

All database software includes indexing technology in the interests of improving performance. See Index (database).

One specific and very common application is in the domain of information retrieval, where the application of a full-text index enables rapid identification of documents based on their textual content.

References

^ "Array Code Examples - PHP Array Functions - PHP code". http://www.configure-all.com/: Computer Programming Web programming Tips. http://www.configure-all.com/arrays.php. Retrieved 2011-04-08. "In most computer languages array index (counting) starts from 0, not from 1. Index of the first element of the array is 0, index of the second element of the array is 1, and so on. In array of names below you can see indexes and values."
^ | accessdate = 2011-04-08 | location = http://www.modula2.org/ | publisher = Module-2 | title = Chapter 6 - Arrays, Types, and Constants | quote = The names of the twelve variables are given by Automobiles[1], Automobiles[2], ... Automobiles[12]. The variable name is "Automobiles" and the array subscripts are the numbers 1 through 12. [i.e. in Modula-2, the index starts by one!]

Index (computer science)

Contents

Array element identifier

Support for fast lookup

References

See also