Not to be confused with the related concept of the Wave equation
Quantum mechanics |
---|
|
Introduction Glossary · History |
Background
|
Fundamental concepts
|
Formulations
|
Equations
|
Advanced topics
|
Scientists
Bell · Bohm · Bohr · Born · Bose
de Broglie · Dirac · Ehrenfest Everett · Feynman · Heisenberg Jordan · Kramers · von Neumann Pauli · Planck · Schrödinger Sommerfeld · Wien · Wigner |
A wave function or wavefunction is a probability amplitude in quantum mechanics describing the quantum state of a particle and how it behaves. Typically, its values are complex numbers and, for a single particle, it is a function of space and time. The laws of quantum mechanics (the Schrödinger equation) describe how the wave function evolves over time. The wave function behaves qualitatively like other waves, like water waves or waves on a string, because the Schrödinger equation is mathematically a type of wave equation. This explains the name "wave function", and gives rise to wave-particle duality.
The most common symbols for a wave function are ψ or Ψ (lower-case and capital psi).
Although ψ is a complex number, |ψ|2 is real, and corresponds to the probability density of finding a particle in a given place at a given time, if the particle's position is measured.
The SI units for ψ depend on the system. For one particle in three dimensions, its units are m–3/2. These unusual units are required so that an integral of |ψ|2 over a region of three-dimensional space is a unitless probability (i.e., the probability that the particle is in that region). For different numbers of particles and/or dimensions, the units may be different (though can be determined by dimensional analysis).
The wave function is absolutely central to quantum mechanics: it makes the subject what it is. Also; it is the source of the mysterious consequences and philosophical difficulties in what quantum mechanics means in nature, and even how nature itself behaves at the atomic scale and beyond - which continue in debate to this day.
In the 1920s and 1930s, there were two divisions (so to speak) of theoretical physicists who simultaneously founded quantum mechanics: one for calculus and one for linear algebra. Those who used the techniques of calculus included Louis de Broglie, Erwin Schrödinger, Paul Dirac, Hermann Weyl, Oskar Klein, Walter Gordon, Douglas Hartree and Vladimir Fock. This hand of quantum mechanics became known as "wave mechanics". Those who applied the methods of linear algebra included Werner Heisenberg, Max Born, Wolfgang Pauli and John Slater. This other hand of quantum mechanics came to be called "matrix mechanics". Schrödinger was one who subsequently showed that the two approaches were equivalent.[1] In each case, the wavefunction was at the centre of attention in two forms, giving quantum mechanics its unity.
De Broglie could be considered the founder of the wave model in 1925, due to his symmetric relation between momentum and wavelength: the De Broglie equation. Schrödinger searched for an equation that would describe these waves, and was the first to construct and publish an equation for which the wave function satisfied in 1926, based on classical energy conservation. Indeed it is now called the Schrödinger equation. However, no-one, even Schrödinger and De Broglie, were clear on how to interpret it. What did this function mean? [2] Around 1924–27, Born, Heisenberg, Bohr and others provided the perspective of probability amplitude.[3] This is the Copenhagen interpretation of quantum mechanics. There are many other interpretations of quantum mechanics, but this is considered the most important - since quantum calculations can be understood.
In 1927, Hartree and Fock made the first step in an attempt to solve the N-body wave function, and developed the self-consistency cycle: an iterative algorithm to approximate the solution. Now it is also known as the Hartree–Fock method.[4] The Slater determinant and permanent (of a matrix) was part of the method, provided by Slater.
Interestingly, Schrödinger did encounter an equation for which the wave function satisfied relativistic energy conservation before he published the non-relativistic one, but it lead to unacceptable consequences for that time so he discarded it.[5] In 1927, Klein, Gorden and Fock also found it, but taking a step further: enmeshed the electromagnetic interaction into it and proved it was Lorentz-invariant. De Broglie also arrived at exactly the same equation in 1928. This wave equation is now known most commonly as the Klein–Gordon equation.[6]
By 1928 Dirac deduced his equation from the first successful unified combination of special relativity and quantum mechanics to the electron - the Dirac equation. He found an unusual character of the wavefunction for this equation: it was not a single complex number, but a spinor.[7] Spin automatically entered into the properties of the wavefunction. Although there were problems, Dirac was capable of resolving them. Around the same time Weyl also found his relativistic equation, which also had spinor solutions. Later other wave equations were developed: see Relativistic wave equations for further information.
Multivariable calculus and analysis (study of functions, change etc.) can be used to represent the wavefunction in a number of situations. Superficially, this formalism is simple to understand for the following reasons.
Although these functions are continuous, they are not deterministic; rather, they are probability distributions. Perhaps oddly, this approach is not the most general way to represent probability amplitudes. The more advanced techniques use linear algebra (the study of vectors, matrices, etc.) and, more generally still, abstract algebra (algebraic structures, generalizations of Euclidean spaces etc.).
The set of all possible wave functions (at any given time) forms an abstract mathematical vector space. Specifically, the entire wave function is treated as a single abstract vector:
where is a column vector written in bra-ket notation. The statement that "wave functions form an abstract vector space" simply means that it is possible to add together different wave functions, and multiply wave functions by complex numbers (see vector space for details). (Technically, because of the normalization condition, wave functions form a projective space rather than an ordinary vector space.) This vector space is infinite-dimensional, because there is no finite set of functions which can be added together in various combinations to create every possible function. Also, it is a Hilbert space, because the inner product of wave functions and can be defined as
where * denotes complex conjugate.
There are several advantages to understanding wave functions as elements of an abstract vector space:
Given an isolated physical system, the allowed states of this system (i.e. the states the system could occupy without violating the laws of physics) are part of a Hilbert space H. Some properties of such a space are
Physically, the nature of the inner product is dependent on the basis in use, because the basis is chosen to reflect the quantum state of the system.
When the basis is a countable set and orthonormal, that is
then an arbitrary vector can be expressed as
where the components are the (complex) numbers This wave function is known as a discrete spectrum, since the bases are discrete.
When the basis is an uncountable set, the orthonormality condition holds similarly,
then an arbitrary vector can be expressed as
where the components are the functions This wave function is known as a continuous spectrum, since the bases are continuous.
Paramount to the analysis is the Kronecker delta, , and the Dirac delta function, , since the bases used are orthonormal. More detailed discussion of wave functions as elements of vector spaces is below, following further definitions.
The wavefunction must satisfy the following constraints for the calculations and physical interpretation to make sense:[8]
If these requirements are not met; it is not possible to interpret the wavefunction as a probability amplitude. The values of the wavefunction and its first order derivatives may not be finite and definite (exactly one value), i.e. probabilities can be infinite and multiple-valued at any one position and time - which is nonsense and doesn't satisfy the probability axioms. Furthermore, when using the wavefunction to calculate a measurable observable of the quantum system, there will not be finite or definite values to calculate from - in this case the observable can take a number of values and can be infinite. This is unphysical and not observed when measuring in an experiment. Hence a wavefunction is meaningful only if these conditions are satisfied.
Although the wavefunction contains information, it is a complex number valued quantity; only its relative phase and relative magnitude can be measured. It does not directly tell anything about the magnitudes or directions of measurable observables. An operator extracts this information by acting on the wavefunction ψ. For details and examples on how quantum mechanical operators act on the wave function, commutation of operators, and expectation values of operators; see Operator (physics).
For now, consider the simple case of a single particle, without spin, in one spatial dimension. (More general cases are discussed below). The state of such a particle is completely described by its wave function:
where x is position and t is time. This function is complex-valued, meaning that is a complex number.
If the particle's position is measured, its location is not deterministic, but is described by a probability distribution. The probability that its position x will be in the interval (meaning a ≤ x ≤ b) is:
where t is the time at which the particle was measured. In other words, is the probability density that the particle is at x, rather than some other location.
This leads to the normalization condition:
because if the particle is measured, there is 100% probability that it will be somewhere.
The particle also has a wave function in momentum space:
where p is the momentum in one dimension, which can be any value from to , and t is time. If the particle's momentum is measured, the result is not deterministic, but is described by a probability distribution:
analogous to the position case. The position-space and momentum-space wave functions are Fourier transforms of each other, therefore both contain the same information, and either one alone is sufficient to calculate any property of the particle. For one-dimension:[9]
Sometimes the wave-vector k is used in place of momentum p, since they are related by the de Broglie relation
and the equivalent space is referred to as k-space. Again it makes no difference which is used since p and k are equivalent - up to a constant. In practice, the position-space wavefunction is used much more often than the momentum-space wavefunction.
The previous wavefunction can be generalized to incorporate N particles in one dimension:
The probability that particle 1 is in an x-interval R1 = [a1,b1] and particle 2 in interval R2 = [a2,b2] etc., up to particle N in interval RN = [aN,bN], all measured simultaneously at time t, is given by:
The normalization condition becomes:
In each case, there are N one-dimensional integrals, one for each particle.
The position-space wave function of a single particle in three spatial dimensions is similar to the case of one spatial dimension above:
where r is the position in three-dimensional space (r is short for (x,y,z)), and t is time. As always is a complex number. If the particle's position is measured at time t, the probability that it is in a region R is:
(a three-dimensional integral over the region R, with differential volume element d3r, also written "dV" or "dx dy dz"). The normalization condition is:
where is all of three-dimensional space.
There is a corresponding momentum space wavefunction for three-dimensions also:
where p is the momentum in 3-dimensional space, and t is time. This time there are three components of momentum which can have values to in each direction, in Cartesian coordinates x, y, z.
The probability of measuring the momentum components px between a and b, py between c and d, and pz between e and f, is given by:
hence the normalization:
analogous to space, d3p = dpxdpydpz. The generalization of the previous Fourier transform is [10]
When there are many particles, in general there is only one wave function, not a separate wave function for each particle. The fact that one wave function describes many particles is what makes quantum entanglement and the EPR paradox possible. The position-space wave function for N particles is written:[11]
where ri is the position of the ith particle in three-dimensional space, and t is time. If the particles' positions are all measured simultaneously at time t, the probability that particle 1 is in region R1 and particle 2 is in region R2 and so on is:
The normalization condition is:
(altogether, this is 3N one-dimensional integrals).
In quantum mechanics there is a fundamental distinction between identical particles and distinguishable particles. For example, any two electrons are fundamentally indistinguishable from each other; the laws of physics make it impossible to "stamp an identification number" on a certain electron to keep track of it.[12] This translates to a requirement on the wavefunction: For example, if particles 1 and 2 are indistinguishable, then:
where the + sign is required if the particles are bosons, and the – sign is required if they are fermions. More exactly stated:
where s = spin quantum number,
The wavefunction is said to be symmetric (no sign change) under boson interchange and antisymmetric (sign changes) under fermion interchange. This feature of the wavefunction is known as the Pauli principle.
For N interacting particles, i.e. particles which interact mutually and constitute a many-body system, the wavefunction is a function of all positions of the particles and time, it can't be separated into the separate wavefunctions of the particles. However, for non-interacting particles, i.e. particles which do not interact mutually and move independently, in a time-independent potential, the wavefunction can be separated into the product of separate wavefunctions for each particle: [13]
For a particle with spin, the wave function can be written in "position-spin-space" as:
where r is a position in three-dimensional space, t is time, and sz is the spin projection quantum number along the z axis. (The z axis is an arbitrary choice; other axes can be used instead if the wave function is transformed appropriately, see below.) The sz parameter, unlike r and t, is a discrete variable. For example, for a spin-1/2 particle, sz can only be +1/2 or -1/2, and not any other value. (In general, for spin s, sz can be s, s–1,...,–s.) If the particle's position and spin is measured simultaneously at time t, the probability that its position is in R1 and its spin projection quantum number is a certain value m is:
The normalization condition is:
where is all of three-dimensional space. Since the spin quantum number has discrete values, it must be written as a sum rather than an integral, taken over all possible values.
Likewise, the wavefunction for N particles each with spin is:
The probability that particle 1 is in region R1 with spin sz1 = m1 and particle 2 is in region R2 with spin sz2 = m2 etc. reads (probability subscripts now removed due to their great length):
The normalization condition is:
Now there are 3N one-dimensional integrals followed by N sums.
Again, for non-interacting particles in a time-independent potential the wavefunction is the product of separate wavefunctions for each particle: [14]
As explained above, quantum states are always vectors in an abstract vector space (technically, a complex projective Hilbert space). For the wave functions above, the Hilbert space usually has not only infinite dimensions, but uncountably infinitely many dimensions. On the hand, linear algebra is much simpler for finite-dimensional vector spaces. Therefore it is helpful to look at an example where the Hilbert space of wave functions is finite dimensional.
A wave function describes the state of a physical system , by expanding it in terms of other possible states of the same system - collectively referred to as a basis or representation . In what follows, all wave functions are assumed to be normalized.
An element of a vector space can be expressed in different bases elements; and so the same applies to wave functions. The components of a wave function describing the same physical state take different complex values depending on the basis being used; however, just like elements of a vector space, the wave function itself is independent on the basis chosen. Choosing a new coordinate system does not change the vector itself, only the representation of the vector with respect to the new coordinate frame, since the components will be different but the linear combination of them still equals the vector.
To start, consider the finite basis representation. A wave function with n components describes how to express the state of the physical system as the linear combination of n basis elements , (i = 1, 2...n). The following is a breakdown of the used formalism.
Conventional vector: Ψ and conventional notation
As a column vector or column matrix:
State vector: Ψ and bra-ket notation
Equivalently in bra-ket notation, the state of a particle with wave function Ψ can be written as a ket;
The corresponding bra is the complex conjugate of the transposed matrix (into a row matrix/row vector):
By "the state of a particle with wavefunction Ψ", written as , this means the variables which characterize the system, with respect to the wavefunction. The wave function associated with a particular state may be seen as an expansion of the state in a basis of . For example, a basis could be for a free particle travelling in one dimension, with momentum eigenstates ψ± corresponding to the ±x direction:
Another example is the superposition of two energy eigenstates for a particle trapped in a 1-d box (these states are stationary state):
The most characteristic example is a particle in a spin up or down configuration:
(see below for details of this frequent case). Notice how kets are not completely analogous to the ordinary notion of vectors - rather they are labels for a state of a wavefunction, which are used in a similar way. In all of the above examples, the particle is not in any one definite or preferred state, but rather in both at the same time - hence the term superposition. The free particle could be have momentum in the +x or –x direction simultaneously, the trapped particle in the 1-d potential well can be in the energy eigenstates corresponding to eigenvalues E1 and E2 at the same time, the particle with spin could be in spin up or down orientation at any instant of time. The relative chance of which state occurs is related to the (moduli squares of the) coefficients.
The choice of basis vectors is important, as two column vectors with the same components can represent two different states of a system if their associated basis states are different.
To illustrate this, let have the bases and let have bases , i.e.
which implies
If for each index i;
it then follows
The orthonormality relation is:
The physical meaning of the components of is given by the wave function collapse postulate:
If the states have distinct, definite values, λi, of some observable (momentum, position, etc.) and a measurement of that variable is performed on a system in the state
then the probability of measuring λi is . If the measurement yields λi, the system remains in the state . That is, the wavefunction collapses from to :
The sum of the probabilities of all possible states must sum to 1 (see normalization using kets below), demanding:
Each row or column matrix entry corresponds to a coefficient (of a ket) in the linear combination. The equality
can be verified using the orthonormality relation,
which allows any component to be found simply by multiplying by . For component q (between 1 and n),
The wavefunction for position and momentum space, respectively, can be written using bra-ket notation in one dimension as:
for three dimensions:
All read: the probability amplitude of a particle in state at position r or momentum p (in the relevant number of dimensions).[15]
Note that (say) is not the same as . The former is the state of the particle, whereas the latter is simply a wave function describing how to express the former as a superposition of states with definite position.
Suppose we have another wavefunction in the same basis:
then the inner product can be defined as:
that is
The outer product of two bra-ket vectors is defined as:
Summation over the inner product of like bases kets leads to the closure relation:
The equality to unity is not due to normalization, but because this is an identity operator (its action on anything leaves it unchanged). This can be used to obtain the ket wavefunction as a superposition of its basis vectors simply multiplying by the state of the wavefunction :
which was a previous statement. Also the inner product can be obtained:
Starting from:
taking the inner product (and recalling orthonormality; ):
This expression means the projection of a complex probability amplitude onto itself is real. Since it is a probability amplitude, normalization requires this product to be unity, because it is equal to the sum of all possible quantum states (probabilities of these states occurring):
Collecting equivalences together:
As a corollary, the normalization constant N can be written in terms of the coefficients of the vector components. Normally N is inserted into the eigenfunctions - then the normalizing sum of the squares of eiegnfunctions is done. Since |ψ|2 is calculated, N2 factors out of the sum rather than N. Even if ψ is left non-normalized then N2 can be inserted in front of the sum, then setting this multiplied sum to 1 will normalize ψ. Either way - N satisfies:
hence
A normalizing constant scalar multiplies an non-normalized wavefunction ψ to obtain the normalized wavefunction ψ':
A simple and important case is a spin-½ particle, but for this instance ignore its spatial degrees of freedom. Using the definition above, the wave function can now be written without position dependence:
where again is the spin quantum number in the z-direction, either +1/2 or -1/2. So at a given time t, is completely characterized by just the two complex numbers Ψ(+1/2,t) and Ψ(–1/2,t). For simplicity these are often written as Ψ(+1/2,t) ≡ Ψ+ ≡ Ψ↑, and Ψ(–1/2,t) ≡ Ψ– ≡ Ψ↓ respectively. This is still called a "wave function", even though in this situation it has no resemblance to familiar waves (like mechanical waves), being only a pair of numbers instead of a continuous function.
Using the above formalism, the two numbers characterizing the wave function can be written as a column vector:
where and . Therefore the set of all possible wave functions is a two dimensional complex vector space. If the particle's spin projection in the z-direction is measured, it will be spin up (+1/2 ≡ ↑z) with probability , and spin down (–1/2 ≡ ↓z) with probability .
In bra-ket notation this can be written:
using the basis vectors (in alternate notations)
The normalization requirement is
which says the probability of the particle in the spin up state (↑z, corresponding to the coefficient c1) plus the probability in the spin down(↓z, corresponding to the coefficient c2) state is 1.
To see this explicitly for this case, expand the ket in terms of the bases:
implying
taking the inner product (and recalling orthonormality) leads to the normalization condition:
The case of a countably infinite vector, with a discrete index, is treated and interpreted in the same manner as a finite vector, except the sum is extended over an infinite number of basis elements.
Conventional vector: Ψ and conventional notation
As a column vector or column matrix, there are infinitely many entries:
State vector: Ψ and bra-ket notation
In bra-ket notation;
The corresponding bra is as before:
Now consider an uncountably infinite number of components of the physical state of the particle, . For this reason the collection of all states is known as a continuum or spectrum of states. Finite or countably infinite basis vectors are summed over a discrete index - for a continuous basis the integral is over the continuous index, replacing the sum.
Continuously indexed vector: Ψ and bra-ket notation
As usual is the physical state of the particle. The sum for a superposition of states now becomes an integral. In what follows, all integrals are with respect to the basis variable ϕ, over the required range. Usually this is just the real line or subset intervals of it. The state is given by:
See below for more on notation of basis and components.
As with the discrete bases, some symbol is used to denotes the basis states, again written in the form , so that is the span of all these basis states. The symbol usually corresponds to some property or observable, but for generality any letter may be used. The general state is written , a particular state may be written as (say) subscripted or primed: or . Alternatively and equivalently, denotes a basis ket for the state of ψ corresponding to the observable ϕ, the meaning is the same. In short the general basis can be written , and a particular basis is .
The basis states are given by:
which can be deduced from the orthonormality and closure relations given below. The orthonormality relation is:
The components of the state are still , that is the projection of the wavefunction onto some basis is a component. This is a function of the basis variable ϕ, sometimes written using another symbol such as or more usually the same as the physical state , since the component of the state ψ corresponds to the basis ϕ. That is: . It is still true that is the probability density of measuring the observable ϕ. In what follows both alternatives are repeated to connect analogous notation with the previous summed countable states, and illustrate equivalence between notations used in the literature.
Given two states in the same basis:
the inner product becomes
that is
The outer product is still:
integrating over the inner product of like bases kets leads to the analogous closure relation:
Multiplying by the state of the wavefunction obtains the ket wavefunction as a superposition of its basis vectors:
Also the inner product can be obtained:
Starting from:
taking the inner product;
Since is the probability density of measuring the observable ϕ in the state ψ, this integral must be 1 as before:
Collecting equivalences together:
The normalization constant N for the continuous case can be manipulated in a similar way to the discrete case. The normalization constant N can be written in terms of the component continuum. Normally N is inserted into the eigenfunctions - then the normalizing integral of the squares of eiegnfunctions is done. Again |ψ|2 is calculated, and N2 factors out of the integrand rather than N. Even if ψ is left non-normalized then N2 can be inserted in front of the integral, then setting this multiplied integral to 1 will normalize ψ. Either way - N satisfies:
hence
An example of this is the spatial wave function of a particle. First consider one-dimension; the x-axis or real line (as in the one-dimensional cases above). Then can be expanded in terms of a continuum of states with definite position, , in the following way.
The basis states are the one-dimensional position states: . Consider the region that the particle may occupy, given by the interval R = [a, b], then this implies the basis vectors are all the possible positions between x = a and x = b.
The components are: .
Therefore the state for the wavefunction is:
In this case a base state can be expressed in terms of all possible basis states as:
The spatial state of the wave function associated with the position state is by orthogonality
We have the identity as another corollary of the closure relation, relating ψ(x) to a basis position x0;
Taking the inner product of with itself leads to the inner product stated at the beginning of this article (in that case and ):
For the inclusion of time dependence, simply attach the time coordinate to all basis states by the replacements
However no integration with respect to t should be done because t is a constant; the instant x is measured to take some value in the interval a to b. For the momentum analogue, simply make the replacement .
The generalization of the previous result is straightforward. In three dimensions, can be expanded in terms of a continuum of states with definite position, as follows.
The basis states are the three-dimensional position states: . Let the 3-dimensional region that the particle may occupy be R.
The components are: .
Therefore the state for the wavefunction is:
In this case a base state can be expressed in terms of all possible basis states as:
where the 3d Dirac-δ function is generalized to:
The spatial state of the wave function associated with the position state is by orthogonally
Again we have the identity as another corollary of the closure relation, relating ψ(r) to a basis position r0;
Taking the inner product of with itself leads to the normalization conditions in the three-dimensional definitions above:
In short, the above expressions take the same form for any number of spatial dimensions.
For a particle, with spin, in all three spatial dimensions, the wavefunction is
in which the basis states are a combination of the descrete variable sz(z-component of spin) and the continuous variable r (position) [16]:
Since the particle has some position and a value of spin, the wavefunction can be written as a product of states, the probability amplitude that the particle is at position r with spin sz:[17]
i.e. we can write:
Applying what we have above, the identity operators are:
where mi are all possible values of sz, leading to:
we can write so the closure relation is:
which implies
and the inner product:
Again this also leads directly to the normalization condition by setting the inner product to unity.
Whether the wave function really exists, and what it represents, are major questions in the interpretation of quantum mechanics. Many famous physicists of a previous generation puzzled over this problem, such as Schrödinger, Einstein and Bohr. Some advocate formulations or variants of the Copenhagen interpretation (e.g. Bohr, Wigner and von Neuman) while others, such as Wheeler or Jaynes, take the more classical approach[18] and regard the wave function as representing information in the mind of the observer, i.e. a measure of our knowledge of reality. Some, ranging from Schrödinger, Einstein, Bohm and Everett and others, argued that the wave function must have an objective, physical existence. The later argument was recently supported by the demonstration (not peer reviewed) of a theorem stating the physical reality of the quantum state[19] . For more on this topic, see Interpretations of quantum mechanics.
Here are examples of wavefunctions for specific applications:
2.Quantum Mechanics(Non-Relativistic Theory), L.D. Landau and E.M. Lifshitz, ISBN 0-08-020940-8