Quicksort

From Wikipedia, the free encyclopedia

Quicksort in action on a list of random numbers. The horizontal lines are pivot values.
Quicksort in action on a list of random numbers. The horizontal lines are pivot values.

Quicksort is a well-known sorting algorithm developed by C. A. R. Hoare that, on average, makes Θ(n log n) comparisons to sort n items. However, in the worst case, it makes Θ(n2) comparisons. Typically, quicksort is significantly faster in practice than other Θ(n log n) algorithms, because its inner loop can be efficiently implemented on most architectures, and in most real-world data it is possible to make design choices which minimize the possibility of requiring quadratic time. Quicksort is a comparison sort and, in efficient implementations, is not a stable sort.

Contents

[edit] The algorithm

Quicksort sorts by employing a divide and conquer strategy to divide a list into two sub-lists.

The steps are:

  1. Pick an element, called a pivot, from the list.
  2. Reorder the list so that all elements which are less than the pivot come before the pivot and so that all elements greater than the pivot come after it (equal values can go either way). After this partitioning, the pivot is in its final position. This is called the partition operation.
  3. Recursively sort the sub-list of lesser elements and the sub-list of greater elements.

The base case of the recursion are lists of size zero or one, which are always sorted. This sort is a variation of the lesser known Hill Sort[verification needed]. The algorithm always terminates because it puts at least one element in its final place on each iteration (the loop invariant).

In simple pseudocode, the algorithm might be expressed as:

 function quicksort(q)
     var list less, pivotList, greater
     if length(q) ≤ 1  
         return q  
     select a pivot value pivot from q
     for each x in q
         if x < pivot then add x to less
         if x = pivot then add x to pivotList
         if x > pivot then add x to greater
     return concatenate(quicksort(less), pivotList, quicksort(greater))

Notice that we only examine elements by comparing them to other elements. This makes quicksort a comparison sort.

[edit] Version with in-place partition

In-place partition in action on a small list. The boxed element is the pivot element, blue elements are less or equal, and red elements are larger.
In-place partition in action on a small list. The boxed element is the pivot element, blue elements are less or equal, and red elements are larger.

The disadvantage of the simple version above is that it requires Ω(n) extra storage space, which is as bad as mergesort (see big-O notation for the meaning of Ω). The additional memory allocations required can also drastically impact speed and cache performance in practical implementations. There is a more complicated version which uses an in-place partition algorithm and can achieve O(log n) space use on average for good pivot choices:

 function partition(array, left, right, pivotIndex)
     pivotValue := array[pivotIndex]
     swap(array[pivotIndex], array[right]) // Move pivot to end
     storeIndex := left
     for i  from  left to right-1
         if array[i] < pivotValue
             swap(array[storeIndex], array[i])
             storeIndex := storeIndex + 1
     swap(array[right], array[storeIndex]) // Move pivot to its final place
     return storeIndex

This form of the partition algorithm is not the original form; multiple variations can be found in various textbooks, such as versions not having the storeIndex. However, this form is probably the easiest to understand.

This is the in-place partition algorithm. It partitions the portion of the array between indexes left and right, inclusively, by moving all elements less than or equal to a[pivotIndex] to the beginning of the subarray, leaving all the greater elements following them. In the process it also finds the final position for the pivot element, which it returns. It temporarily moves the pivot element to the end of the subarray, so that it doesn't get in the way. Because it only uses exchanges, the final list has the same elements as the original list. Notice that an element may be exchanged multiple times before reaching its final place.

Another version of the partition algorithm in Java-like pseudocode:

 public static int partition(int low, int high, keytype[] S) {
     int j = low; 
     keytype pivot = S[low];
     for (int i = low + 1; i <= high; i++) {
          if (S[i] < pivot) {
                j++;
                swap( S[i] , S[j]);
          }
     }
     swap (S[low] , S[j]);
     return j;
 }

Once we have this, writing quicksort itself is easy:

 function quicksort(array, left, right)
     if right > left
         select a pivot index (e.g. pivotIndex = left)
         pivotNewIndex := partition(array, left, right, pivotIndex)
         quicksort(array, left, pivotNewIndex-1)
         quicksort(array, pivotNewIndex+1, right)

This version is the one more typically used in imperative languages such as C.

Below is the basic quick sort algorithm written in C.

void q_sort(int numbers[], int left, int right)
{
  int l_hold = left;
  int r_hold = right;
  int pivot = numbers[left];
  int t;
  while (left < right)
  {
    while ((numbers[right] >= pivot) && (left < right))
      right--;
    if (left != right)
    {
      t = numbers[left]; numbers[left] = numbers[right]; numbers[right] = t;
      left++;
    }
    while ((numbers[left] <= pivot) && (left < right))
      left++;
    if (left != right)
    {
      t = numbers[right]; numbers[right] = numbers[left]; numbers[left] = t;
      right--;
    }
  }
  numbers[left] = pivot;
  pivot = left;
  left = l_hold;
  right = r_hold;
  if (left < pivot)
    q_sort(numbers, left, pivot-1);
  if (right > pivot)
    q_sort(numbers, pivot+1, right);
}
void quickSort(int numbers[], int array_size)
{
  q_sort(numbers, 0, array_size - 1);
}

Below is a javascript implementation of the same code above that works (tested in IE7) with options in an html select element.

// call this using: SelectSorter.execute('<id of a select element>');
var SelectSorter = {
        execute: function(id) {
        var select = document.getElementById(id);
        var items = SelectSorter._toArray(select.options);
        SelectSorter._clear(select);
        SelectSorter._sort(items, 0, items.length - 1);
        SelectSorter._append(select, items);
    },
        _clear: function(select) {
        for (var i = select.options.length - 1; i >= 0; i--) {
        select.remove(i);
       }
    },
        _append: function(select, items) {
        var item;
        for (var i = 0; i < items.length; i++) {
        item = document.createElement('option');
        item.value = items[i].value;
        item.text = items[i].text;
        try {
                select.add(item, null);
          }
        catch (exception) {
                select.add(item);
          }
       }
    },
        _sort: function(items, start, finish) {
        var left = start;
      var right = finish;
      var pivot = items[left].text;
      while (left < right) {
        while ((items[right].text >= pivot) && (left < right)){
          right--;
        }
        if (left != right) {
                SelectSorter._swap(items, left, right);
          left++;
        }
        while ((items[left].text <= pivot) && (left < right)) {
          left++;
        }
        if (left != right) {
                SelectSorter._swap(items, left, right);
          right--;
        }
      }
      items[left].text = pivot;
      pivot = left;
      if (start < pivot) {
        SelectSorter._sort(items, start, pivot - 1);
      }
      if (finish > pivot) {
        SelectSorter._sort(items, pivot + 1, finish);
      }
    },
        _swap: function(items, index1, index2) {
        var x = items[index1];
        items[index1] = items[index2];
        items[index2] = x;
    },
        _toArray: function(options) {
        var a = new Array();
        for (var i = 0; i < options.length; i++) {
        a[i] = {value: options[i].value, text: options[i].text};
      }
        return a;
   }
}

[edit] Version with pivot choosing

Algorithms with pivot choosing are generally more efficient than algorithms without pivot choosing only when the pivot choosing section of the algorithm isn't too complex and is chosen appropriately for expected data being sorted. The following code, written in C, is quite efficient when the arithmetic mean of the array is approximately equal to the arithmetic mean of the lowest and the greatest elements of the array. It works by setting the lowest element of the array at the beginning, and the greatest element at the end of the array. The pivot value is then chosen as the value of the element whose value is closest to the arithmetic mean of the first and last elements of the array (being the lowest and the greatest elements of the array, respectively). The quicksort is then implemented to the elements inbetween the first and last elements, for they are already in correct positions.

#include <stdio.h>
#include <stdlib.h>

#define ELTS 10

void swap(int *, int *);
void quicksort(int [], int, int);
void printnums(int [], int);

int main(void)
{
  int nums[ELTS] = { 7, 210, 903, 57, 292, 792, 37, 103, 819, 649 };

  puts("Before:");
  printnums(nums, ELTS);

  quicksort(nums, 0, ELTS - 1);
  puts("After:");
  printnums(nums, ELTS);

  return 0;
}

/* function used for swapping values of two variables */
void swap(int *a, int *b)
{
  int c = *a;

  *a = *b;
  *b = c;
}

/* nums[] - the array which elements are to be sorted
 * first - index of the first element to be sorted
 * last - index of the last element to be sorted */
void quicksort(int nums[], int first, int last)
{
  int i,
      pivot,
      left,
      right,
      passes = last - first,
      am;

  /* Do not run if we're on our last pass */
  if (passes > 0)
  {
    /* The following if check and the next loop
     * set the lowest element at the beginning
     * and the greatest at the end of the array. */
    if (nums[first] > nums[last])
      swap(&nums[first], &nums[last]);

    if (passes > 1)
    {
      for (i = first + 1; i < last; i++)
      {
        if (nums[i] < nums[first])
          swap(&nums[i], &nums[first]);
        else if (nums[i] > nums[last])
          swap(&nums[i], &nums[last]);
      }

      if (passes > 2)
      {
        am = (nums[first] + nums[last]) / 2;
        pivot = nums[first + 1];

        /* The following loop sets the value of pivot element. */
        for (i = first + 2; i < last; i++)
        {
          if (abs(nums[i] - am) < abs(pivot - am))
            pivot = nums[i];
        }

        left = first + 1;
        right = last - 1;

        /* After following loop (one quicksort pass),
         * r shows the location of pivot element,... */
        while (left < right)
        {
          while (nums[left] < pivot)
            left++;

          while (nums[right] > pivot)
            right--;

          if (left < right)
            swap(&nums[left], &nums[right]);

          if (nums[left] == pivot && nums[right] == pivot)
            left++;
        }

        /* ...and that's why the quicksort is used recursively
         * on array nums from point first + 1 to r - 1 and from
         * point r + 1 to last - 1. */
        quicksort(nums, first + 1, right - 1);
        quicksort(nums, right + 1, last - 1);
      }
    }
  }
}

/* Prints the values from the nums array to stdout */
void printnums(int nums[], int array_size)
{
  int x;

  for (x = 0; x < array_size; x++)
  {
    printf("%d%s", nums[x], x + 1 != array_size ? ", " : "\n");
  }
}

[edit] Iterative version

Explicit recursion can be avoided using an iterative form of quicksort that replaces the call stack by an explicit stack data structure. Although this does not make the algorithm use asymptotically less time or space, the practical advantage lies in avoiding the time and space overhead of subroutine entry and exit that occurs during general recursive calls. The disadvantage is of considerably greater code complexity. A simpler in-place sort of competitive speed is heapsort.

 function QuickSort(Array, Left, Right)
 var
     L2, R2, PivotValue
 begin
     Stack.Push(Left, Right);       // pushes Left, and then Right, on to a stack
     while not Stack.Empty do
     begin
         Stack.Pop(Left, Right);    // pops 2 values, storing them in Right and then Left
         repeat
             PivotValue := Array[(Left + Right) div 2];
             L2 := Left;
             R2 := Right;
             repeat
                 while Array[L2] < PivotValue do // scan left partition
                     L2 := L2 + 1;
                 while Array[R2] > PivotValue do // scan right partition
                     R2 := R2 - 1;
                 if L2 <= R2 then
                 begin
                     if L2 != R2 then
                         Swap(Array[L2], Array[R2]);  // swaps the data at L2 and R2
                     L2 := L2 + 1;
                     R2 := R2 - 1;
                 end;
             until L2 >= R2;
             if R2 - Left > Right - L2 then // is left side piece larger?
             begin
                 if Left < R2 then
                     Stack.Push(Left, R2);
                 Left := L2;
             else
             begin
                 if L2 < Right then // if left side isn't, right side is larger
                     Stack.Push(L2, Right);
                 Right := R2;
             end;
         until Left >= Right;
     end;
 end;

Which is indeed more complicated. Extensions might include a median-of-three selection of the pivot (with re-ordering during that selection), and the use of InsertionSort for any partition less than (say) eight elements in length.

[edit] Parallelization

Like mergesort, quicksort can also be easily parallelized due to its divide-and-conquer nature. Individual in-place partition operations are difficult to parallelize, but once divided, different sections of the list can be sorted in parallel. If we have p processors, we can divide a list of n elements into p sublists in Θ(n) average time, then sort each of these in O((n/p) log(n/p)) average time. Ignoring the O(n) preprocessing, this is linear speedup. Given Θ(n) processors, only O(n) time is required overall.

One advantage of parallel quicksort over other parallel sort algorithms is that no synchronization is required. A new thread is started as soon as a sublist is available for it to work on and it does not communicate with other threads. When all threads complete, the sort is done.

Other more sophisticated parallel sorting algorithms can achieve even better time bounds. For example, in 1991 David Powers described a parallelized quicksort that can operate in O(log n) time given enough processors by performing partitioning implicitly.1

[edit] Competitive sorting algorithms

Quicksort is a space-optimized version of the binary tree sort. Instead of inserting items sequentially into an explicit tree, quicksort organizes them concurrently into a tree that is implied by the recursive calls. The algorithms make exactly the same comparisons, but in a different order.

The most direct competitor of quicksort is heapsort. Heapsort is typically somewhat slower than quicksort, but the worst-case running time is always O(n log n). Quicksort is usually faster, though there remains the chance of worst case performance except in the introsort variant. If it's known in advance that heapsort is going to be necessary, using it directly will be faster than waiting for introsort to switch to it. Heapsort also has the important advantage of using only constant additional space (heapsort is in-place), whereas even the best variant of quicksort uses Θ(log n) space. However, heapsort requires efficient random access to be practical.

Quicksort also competes with mergesort, another recursive sort algorithm but with the benefit of worst-case O(n log n) running time. Mergesort is a stable sort, unlike quicksort and heapsort, and can be easily adapted to operate on linked lists and very large lists stored on slow-to-access media such as disk storage or network attached storage. Although quicksort can be written to operate on linked lists, it will often suffer from poor pivot choices without random access. The main disadvantage of mergesort is that, when operating on arrays, it requires Ω(n) auxiliary space in the best case, whereas the variant of quicksort with in-place partitioning and tail recursion uses only O(log n) space. (Note that when operating on linked lists, mergesort only requires a small, constant amount of auxiliary storage.)

[edit] Formal analysis

From the initial description it's not obvious that quicksort takes O(n log n) time on average. It's not hard to see that the partition operation, which simply loops over the elements of the array once, uses Θ(n) time. In versions that perform concatenation, this operation is also Θ(n).

In the best case, each time we perform a partition we divide the list into two nearly equal pieces. This means each recursive call processes a list of half the size. Consequently, we can make only log n nested calls before we reach a list of size 1. This means that the depth of the call tree is O(log n). But no two calls at the same level of the call tree process the same part of the original list; thus, each level of calls needs only O(n) time all together (each call has some constant overhead, but since there are only O(n) calls at each level, this is subsumed in the O(n) factor). The result is that the algorithm uses only O(n log n) time.

An alternate approach is to set up a recurrence relation for T(n), the time needed to sort a list of size n. Because a single quicksort call involves O(n) work plus two recursive calls on lists of size n/2 in the best case, the relation would be:

T(n) = O(n) + 2T(n/2)

The master theorem tells us that T(n) = Θ(n log n).

In fact, it's not necessary to divide the list this precisely; even if each pivot splits the elements with 99% on one side and 1% on the other (or any other fixed fraction), the call depth is still limited to 100log n, so the total running time is still O(n log n).

In the worst case, however, the two sublists have size 1 and n-1, and the call tree becomes a linear chain of n nested calls. The ith call does O(n-i) work, and \sum_{i=0}^n (n-i) = O(n^2). The recurrence relation is:

T(n) = O(n) + T(1) + T(n - 1) = O(n) + T(n - 1)

This is the same relation as for insertion sort and selection sort, and it solves to T(n) = Θ(n2).

[edit] Randomized quicksort expected complexity

Randomized quicksort has the desirable property that it requires only O(n log n) expected time, regardless of the input. But what makes random pivots a good choice?

Suppose we sort the list and then divide it into four parts. The two parts in the middle will contain the best pivots; each of them is larger than at least 25% of the elements and smaller than at least 25% of the elements. If we could consistently choose an element from these two middle parts, we would only have to split the list at most 2log2 n times before reaching lists of size 1, yielding an O(n log n) algorithm.

Unfortunately, a random choice will only choose from these middle parts half the time. The surprising fact is that this is good enough. Imagine that you are flipping a coin over and over until you get k heads. Although this could take a long time, on average only 2k flips are required, and the chance that you won't get k heads after 100k flips is infinitesimally small. By the same argument, quicksort's recursion will terminate on average at a call depth of only 2(2log2 n). But if its average call depth is O(log n), and each level of the call tree processes at most n elements, the total amount of work done on average is the product, O(n log n).

[edit] Average complexity

Even if we aren't able to choose pivots randomly, quicksort still requires only O(n log n) time over all possible permutations of its input. Because this average is simply the sum of the times over all permutations of the input divided by n factorial, it's equivalent to choosing a random permutation of the input. When we do this, the pivot choices are essentially random, leading to an algorithm with the same running time as randomized quicksort.

More precisely, the average number of comparisons over all permutations of the input sequence can be estimated accurately by solving the recurrence relation:

C(n) = n - 1 + \frac{1}{n} \sum_{i=0}^{n-1} (C(i)+C(n-i-1)) = 2n \ln n = 1.39n \log_2 n.

Here, n - 1 is the number of comparisons the partition uses. Since the pivot is equally likely to fall anywhere in the sorted list order, the sum is averaging over all possible splits.

This means that, on average, quicksort performs only about 39% worse than the ideal number of comparisons, which is its best case. In this sense it is closer to the best case than the worst case. This fast average runtime is another reason for quicksort's practical dominance over other sorting algorithms.

C(n)= (n-1) + C(n/2) + C(n/2)
    = (n-1) + 2C(n/2)
    = (n-1) + 2((n/2) - 1 + C(n/4))
    = n + n + 4C(n/4) - 1 - 2
    = n + n + n + 8C(n/8) - 1 - 2- 4
    = kn + 2^kC(n/(2^k)) - (1 + 2 + 3 + . . . . . + 2^(k-1))
  
                 where n>k>0
    =

[edit] Space complexity

The space used by quicksort depends on the version used.

Quicksort has a space complexity of O(log n), even in the worst case, when it is carefully implemented such that

  • in-place partitioning is used. This requires O(1).
  • After partitioning, the partition with the fewest elements is (recursively) sorted first, requiring at most O(log n) space. Then the other partition is sorted using tail-recursion or iteration. (This idea is commonly attributed to R.M.Sedgewick [1][2][3])

The version of quicksort with in-place partitioning uses only constant additional space before making any recursive call. However, if it has made O(log n) nested recursive calls, it needs to store a constant amount of information from each of them. Since the best case makes at most O(log n) nested recursive calls, it uses O(log n) space. The worst case makes O(n) nested recursive calls, and so needs O(n) space.

We are eliding a small detail here, however. If we consider sorting arbitrarily large lists, we have to keep in mind that our variables like left and right can no longer be considered to occupy constant space; it takes O(log n) bits to index into a list of n items. Because we have variables like this in every stack frame, in reality quicksort requires O(log2 n) bits of space in the best and average case and O(n log n) space in the worst case. This isn't too terrible, though, since if the list contains mostly distinct elements, the list itself will also occupy O(n log n) bits of space.

The not-in-place version of quicksort uses O(n) space before it even makes any recursive calls. In the best case its space is still limited to O(n), because each level of the recursion uses half as much space as the last, and

\sum_{i=0}^{\infty} \frac{n}{2^i} = 2n.

Its worst case is dismal, requiring

\sum_{i=0}^n (n-i+1) = \Theta (n^2)

space, far more than the list itself. If the list elements are not themselves constant size, the problem grows even larger; for example, if most of the list elements are distinct, each would require about O(log n) bits, leading to a best-case O(n log n) and worst-case O(n2 log n) space requirement.

[edit] Relationship to selection

A selection algorithm chooses the kth smallest of a list of numbers; this is an easier problem in general than sorting. One simple but effective selection algorithm works nearly in the same manner as quicksort, except that instead of making recursive calls on both sublists, it only makes a single tail-recursive call on the sublist which contains the desired element. This small change lowers the average complexity to linear or Θ(n) time, and makes it an in-place algorithm. A variation on this algorithm brings the worst-case time down to O(n) (see selection algorithm for more information).

Conversely, once we know a worst-case O(n) selection algorithm is available, we can use it to find the ideal pivot (the median) at every step of quicksort, producing a variant with worst-case O(n log n) running time. In practical implementations, however, this variant is considerably slower on average.

[edit] External links

Wikibooks
Wikibooks Algorithm implementation has a page on the topic of

[edit] References

  • Brian C. Dean, "A Simple Expected Running Time Analysis for Randomized 'Divide and Conquer' Algorithms." Discrete Applied Mathematics 154(1): 1-5. 2006.
  • Hoare, C. A. R. "Partition: Algorithm 63," "Quicksort: Algorithm 64," and "Find: Algorithm 65." Comm. ACM 4(7), 321-322, 1961
  • R. Sedgewick. Implementing quicksort programs, Comm. ACM, 21(10):847-857, 1978.
  • David Musser. Introspective Sorting and Selection Algorithms, Software Practice and Experience vol 27, number 8, pages 983-993, 1997
  • Donald Knuth. The Art of Computer Programming, Volume 3: Sorting and Searching, Third Edition. Addison-Wesley, 1997. ISBN 0-201-89685-0. Pages 113–122 of section 5.2.2: Sorting by Exchanging.
  • Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest, and Clifford Stein. Introduction to Algorithms, Second Edition. MIT Press and McGraw-Hill, 2001. ISBN 0-262-03293-7. Chapter 7: Quicksort, pp.145–164.
  • A. LaMarca and R. E. Ladner. "The Influence of Caches on the Performance of Sorting." Proceedings of the Eighth Annual ACM-SIAM Symposium on Discrete Algorithms, 1997. pp. 370-379.
  • Faron Moller. Analysis of Quicksort. CS 332: Designing Algorithms. Department of Computer Science, University of Wales Swansea.
  • Steven Skiena. Lecture 5 - quicksort. CSE 373/548 - Analysis of Algorithms. Department of Computer Science. State University of New York at Stony Brook.
  • Conrado Martínez and Salvador Roura, Optimal sampling strategies in quicksort and quickselect. SIAM J. Computing 31(3):683-705, 2001.

[edit] Footnotes

1. David M. W. Powers. Parallelized Quicksort with Optimal Speedup. Proceedings of International Conference on Parallel Computing Technologies. Novosibirsk. 1991.