Heap (data structure)

From Wikipedia, the free encyclopedia

This article is about heap data structures. For “the heap” as a large pool of unused memory, see Dynamic memory allocation.

In computer science, a heap is a specialized tree-based data structure that satisfies the heap property. Its base datatype (used for node keys) must be an ordered set.

If B is a child node of A, then the heap property is that:

key(A) ≥ key(B)
Example of a complete binary max heap
Enlarge
Example of a complete binary max heap

This implies that the greatest element is always in the root node, and such a heap is sometimes called a max heap. (Alternatively, if the comparison is reversed, the smallest element is always in the root node, which results in a min heap.) This is why heaps are used to implement priority queues. The efficiency of heap operations is crucial in several graph algorithms.

The operations commonly performed with a heap are

  • delete-max or delete-min: removing the root node of a max- or min-heap, respectively
  • decrease-key: updating a key within the heap
  • insert: adding a new key to the heap
  • merge: joining two heaps to form a valid new heap containing all the elements of both.

Heaps are used in the sorting algorithm called heapsort.

Contents

[edit] Variants

[edit] Comparison of theoretic bounds for variants

Function names assume a min-heap:

Operation Binary Binomial Fibonacci Pairing Leftist 2-3 Treap Beap
worst-case worst-case amortized worst-case amortized worst-case worst-case
find-min O(1) O(logn) O(1) O(1) O(1) O(1) O(1)
delete-min O(logn) O(logn) O(logn) O(n) O(logn) O(n) O(logn)
insert O(logn) O(logn) O(1) O(1) O(1) or O(logn) O(1) O(logn)
decrease-key O(logn) O(logn) O(1) O(1) O(loglogn) or O(logn) O(1) O(logn)
merge O(n) O(logn) O(1) O(1) O(1) or O(logn) O(1) O(logn)

For pairing heaps the insert, decreaseKey and merge operations are conjectured to be O(1) amortized complexity but this has not yet been proven.

[edit] Heap applications

Heaps are favorite data structures for many applications.

Interestingly, heaps may be represented using an array alone. The first (or last) element will contain the root. The next two elements of the array contain its children. The next four contain the four children of the two child nodes, etc. Thus the children of the node at position n would be at positions 2n and 2n+1 in a one-based array, or 2n+1 and 2n+2 in a zero-based array. Balancing a heap is done by swapping elements which are out of order. As we can build a heap from an array without requiring extra memory (for the nodes, for example), heapsort can be used to sort an array in-place.

One more advantage of heaps over trees in some applications is that construction of heaps can be done in linear time using Tarjan's algorithm.

[edit] Heap implementations

  • The C++ Standard Template Library provides the make_heap, push_heap and pop_heap algorithms for binary heaps, which operate on arbitrary random access iterators. It treats the iterators as a reference to an array, and uses the array-to-heap conversion detailed above.

[edit] Build Heap

A procedure that makes a heap of an array, that is, rearranges items so the array has the heap property. The time of this algorithm is O(n) on an array-based heap implementation, where n is the number of nodes in the heap.

It works by heapifying the elements starting from the middle of the array.

[edit] See also

[edit] External links