Reference (C++)
From Wikipedia, the free encyclopedia
In the C++ programming language, a reference is a simple reference datatype that is less powerful but safer than the pointer type inherited from C, which is a reference in the general sense but not in the sense used by C++.
Contents |
[edit] Syntax and terminology
The declaration of the form
<Type> & <Name>...
where <Type> is a type and <Name> is an identifier is said to define an identifier whose type is reference to <Type>.
Examples:
int A = 5;
int A;
int& rA = A;
extern int& rB;
int& foo ();
void bar (int& rP);
class MyClass { int& m_b; /* ... */ };
int funcX() { return 42 ; }; int (&xFunc)() = funcX ;
Here, rA and rB are of type "reference to int", foo() is a function that returns a reference to int, bar() is a function with a reference parameter, which is reference to int, MyClass is a class with a member which is reference to int, funcX() is a function that returns an int, xFunc() is an alias for funcX.
Types which are of kind "reference to <Type>" are sometimes called reference types. Identifiers which are of reference type are called reference variables. To call them variable, however, is in fact is a misnomer, as we will see.
[edit] Relationship to pointers
C++ references differ from pointers in several essential ways:
- It is not possible to refer to a reference object directly after it is defined; any occurrence of its name refers directly to the object it references.
- As a consequence of the first point, neither arithmetic, casts, nor any other operation can be performed on references except copying their binding into other references.
- Once a reference is created, it cannot be later made to reference another object; we say it cannot be reseated. This is often done with pointers.
- References cannot be null, whereas pointers can; every reference refers to some object, although it may or may not be valid.
- References cannot be uninitialized. Because it is impossible to reinitialize a reference, they must be initialized as soon as they are created. In particular, local and global variables must be initialized where they are defined, and references which are data members of class instances must be initialized in the initializer list of the class's constructor.
There is a simple conversion between pointers and references: the address-of operator (&
) will yield a pointer referring to the same object when applied to a reference, and a reference which is initialized from the dereference (*
) of a pointer value will refer to the same object as that pointer, where this is possible without invoking undefined behavior. This equivalence is a reflection of the typical implementation, which effectively compiles references into pointers which are implicitly dereferenced at each use.
A consequence of this is that in many implementations, operating on a variable with automatic or static lifetime through a reference, although syntactically similar to accessing it directly, can involve hidden dereference operations that are costly.
Also, because the operations on references are so limited, they are also much easier to reason about formally than pointers, and harder to cause errors with. While pointers can be made invalid through a variety of mechanisms, ranging from carrying a null value to out-of-bounds arithmetic to illegal casts to producing them from random integers, a reference only becomes invalid in two cases:
- If it refers to an object with automatic allocation which goes out of scope,
- If it refers to an object inside a block of dynamic memory which has been freed.
The first is easy to detect automatically due to static scoping of variables; the second is more difficult to assure, but it is the only concern with references, and one suitably addressed by a reasonable allocation policy.
[edit] Uses of references
Other than just a helpful replacement for pointers, one convenient application of references is in function parameter lists, where they allow passing of parameters used for output with no explicit address-taking by the caller. For example:
void square(int x, int& result) { result = x*x; }
Then, the following call would place 9 in y:
square(3, y);
However, the following call would give a compiler error, since reference parameters not qualified with const can only be bound to addressable values:
square(3, 6);
Returning a reference also allows a surprising syntax in which function calls can be assigned to:
int& preinc(int& x) { ++x; return x; } preinc(y) = 5; // same as ++y, y = 5
In many implementations, normal parameter-passing mechanisms often imply an expensive copy operation for large parameters. References qualified with const are a useful way of passing large objects between functions that avoids this overhead:
void f_slow(BigObject x) { /* ... */ } void f_fast(const BigObject& x) { /* ... */ } BigObject y; f_slow(y); // slow, copies y to parameter x f_fast(y); // fast, gives direct read-only access to y
If f_fast()
actually requires its own copy of x that it can modify, it must create a copy explicitly. While the same technique could be applied using pointers, this would involve modifying every call site of the function to add cumbersome address-of (&
) operators to the argument, and would be equally difficult to undo, if the object became smaller later on.
[edit] Quotes
References are defined by ISO/IEC 14882:1998(E), the ISO C++ standard, in section 8.3.2 [dcl.ref], as follows (excluding the example section):
- In a declaration T D where D has the form
& D1
- and the type of the identifier in the declaration T D1 is “derived-declarator-type-list
T
,” then the type of the identifier of D is “derived-declarator-type-list reference toT
.” Cv-qualified references are ill-formed except when the cv-qualifiers (const and volatile) are introduced through the use of a typedef (7.1.3) or of a template type argument (14.3), in which case the cv-qualifiers are ignored. [Example: intypedef int& A;
const A aref = 3; //
ill-formed;//
non-const
reference initialized with rvalue
- the type of
aref
is “reference toint
”, not “const
reference toint
”. ] [Note: a reference can be thought of as a name of an object. ] A declarator that specifies the type “reference to cv void” is ill-formed.
- It is unspecified whether or not a reference requires storage (3.7).
- There shall be no references to references, no arrays of references, and no pointers to references. The declaration of a reference shall contain an initializer (8.5.3) except when the declaration contains an explicit
extern
specifier (7.1.1), is a class member (9.2) declaration within a class declaration, or is the declaration of a parameter or a return type (8.3.5); see 3.1. A reference shall be initialized to refer to a valid object or function. [Note: in particular, a null reference cannot exist in a well-defined program, because the only way to create such a reference would be to bind it to the “object” obtained by dereferencing a null pointer, which causes undefined behavior. As described in 9.6, a reference cannot be bound directly to a bitfield. ]
[edit] References
- International Standard ISO/IEC 14882:1998(E). Programming Languages – C++. Purchase 2003 version.