Swap (computer programming)
In computer programming, the act of swapping two variables refers to mutually exchanging the values of the variables. Usually, this is done with the data in memory. For example, in a program, two variables may be defined thus (in pseudocode):
data_item x := 1 data_item y := 0 swap (x, y);
(In many programming languages where the swap function is built-in; in C++, overloads are provided allowing std::swap to swap some large structures in O(1) time.) After swap() is performed, x will contain the value 0 and y will contain 1; their values have been exchanged. Of course, this operation may be generalized to other types of values, such as strings, aggregated data types and comparison sorts, utilize swaps to change the positions of data.
Using a temporary variable
The simplest and probably most widely used method to swap two variables is to use a third temporary variable:
define swap (x, y) temp := x x := y y := temp
While this is conceptually simple and in many cases the only convenient way to swap two variables, it uses extra memory. Although this should not be a problem in most applications, the sizes of the values being swapped may be huge (which means the temporary variable may occupy a lot of memory as well), or the swap operation may need to be performed many times, as in sorting algorithms.
In addition, swapping two variables in object-oriented languages such as C++ may involve one call to the class constructor and destructor for the temporary variable, and three calls to the copy constructor. Some classes may allocate memory in the constructor and deallocate it in the destructor, thus creating expensive calls to the system. Copy constructors for classes containing a lot of data, e.g. in an array, may even need to copy the data manually.
XOR swap
XOR swap uses the XOR operation to swap two numeric variables. It is generally touted to be faster than the naive method mentioned above; however it does have disadvantages. XOR swap is generally used to swap low-level data types, like integers. However, it is, in theory, capable of swapping any two values which can be represented by fixed-length bitstrings.
Swap through addition and subtraction
This method swaps two variables by adding and subtracting their values. This is rarely used in practical applications, mainly because:
- It can only swap numeric variables; it may not be possible or logical to add or subtract complex data types, like containers.
- When swapping variables of a fixed size, arithmetic overflow becomes an issue.
- It does not work generally for floating-point values, because floating-point arithmetic is non-associative.
Swapping containers
Containers which allocate memory from the heap using pointers may be swapped in a single operation, by swapping the pointers alone. This is usually found in programming languages supporting pointers, like C or C++. For example, the STL overloads its built-in swap function to exchange containers efficiently this way.[1]
As pointer variables are usually of a fixed size (e.g., most desktop computers have pointers 64 bits long), and they are numeric, they can be swapped quickly using XOR swap.
Parallel assignment
Some languages, like Ruby or Python support parallel assignments, which simplifies the notation for swapping two variables:
a, b := b, a
This is shorthand for an operation involving an intermediate data structure: in Python, a tuple; in Ruby, an array.
Other languages, like javascript (>= ES6) supports destructuring operators which just do the same thing:
[a, b] = [b, a];
Facilitation of swapping in modern computers
Dedicated instructions
Because of the many applications of swapping data in computers, most processors now provide the ability to swap variables directly through built-in instructions. x86 processors, for example, include an XCHG instruction to swap two registers directly without requiring that a third temporary register is used. A CMPXCHG instruction, which compares and conditionally swaps two registers, is even provided in some processor architectures.
XCHG may not be as efficient as one may think. For example, in x86 processors, XCHG will implicitly lock access to any operands in memory to keep the operation atomic, and so may not be efficient when swapping memory. Such locking is important when it is used to implement thread-safe synchronization, as in mutexes. However, an XCHG is usually the fastest way to swap two machine-size words residing in registers. Register renaming may also be used to swap registers efficiently.
Parallel execution
With the advent of instruction pipelining in modern computers and multi-core processors facilitating parallel computing, two or more operations can be performed at once. This can speed up the lowly temporary-variable swap algorithm and give it an edge over other algorithms. For example, the XOR swap algorithm requires sequential execution of three instructions. However, using two temporary registers, two processors executing in parallel can swap two variables in two clock cycles:
Step 1 Processor 1: temp_1 := X Processor 2: temp_2 := Y Step 2 Processor 1: X := temp_2 Processor 2: Y := temp_1
This uses fewer instructions; but other temporary registers may be in use, and four instructions are needed instead of three. In any case, in practice this could not be implemented in separate processors, as it violates Bernstein's conditions for parallel computing. It would be infeasible to keep the processors sufficiently in sync with one another for this swap to have any significant advantage over traditional versions. However, it can be used to optimize swapping for a single processor with multiple load/store units.