Virtual method table
A virtual method table, virtual function table, virtual call table, dispatch table, vtable, or vftable is a mechanism used in a programming language to support dynamic dispatch (or run-time method binding).
Whenever a class defines a virtual function (or method), most compilers add a hidden member variable to the class which points to an array of pointers to (virtual) functions called the virtual method table (VMT or Vtable). At runtime these pointers will be set to point to the right function, because at compile time, it is not yet known if the base function is to be called or a derived one implemented by a class that inherits from the base class.
Suppose a program contains several classes in an inheritance hierarchy: a superclass, Cat
, and two subclasses, HouseCat
and Lion
. Class Cat
defines a virtual function named speak
, so its subclasses may provide an appropriate implementation (e.g. either meow
or roar
).
When the program calls the speak
method on a Cat
pointer (which can point to a Cat
class, or any subclass of Cat
), the calling code must be able to determine which implementation to call, depending on the actual type of object that is pointed to. Because the type of object pointed to by the Cat
pointer is not determined at compile-time, the decision as to which branch to take cannot be decided at compile-time.
There are a variety of different ways to implement such dynamic dispatch, but the vtable (virtual table) solution is especially common among C++ and related languages (such as D and C#). Languages which separate the programmatic interface of objects from the implementation, like Visual Basic and Delphi, also tend to use the vtable approach, because it allows objects to use a different implementation simply by using a different set of method pointers.
Implementation
An object's dispatch table will contain the addresses of the object's dynamically bound methods. Method calls are performed by fetching the method's address from the object's dispatch table. The dispatch table is the same for all objects belonging to the same class, and is therefore typically shared between them. Objects belonging to type-compatible classes (for example siblings in an inheritance hierarchy) will have dispatch tables with the same layout: the address of a given method will appear at the same offset for all type-compatible classes. Thus, fetching the method's address from a given dispatch table offset will get the method corresponding to the object's actual class.[1]
The C++ standards do not mandate exactly how dynamic dispatch must be implemented, but compilers generally use minor variations on the same basic model.
Typically, the compiler creates a separate vtable for each class. When an object is created, a pointer to this vtable, called the virtual table pointer, vpointer or VPTR, is added as a hidden member of this object. As such, the compiler must also generate "hidden" code in the constructor of each class to initialize a new objects' vpointer to the address of its class's vtable.
Many compilers place the vpointer as the last member of the object; other compilers place the vpointer as the first member of the object; portable source code works either way.[2] For example, g++ previously placed the vpointer at the end of the object.[3]
Example
Consider the following class declarations in C++ syntax:
class B1 {
public:
void f0() {}
virtual void f1() {}
int int_in_b1;
};
class B2 {
public:
virtual void f2() {}
int int_in_b2;
};
used to derive the following class:
class D : public B1, public B2 {
public:
void d() {}
void f2() {} // override B2::f2()
int int_in_d;
};
and the following piece of C++ code:
B2 *b2 = new B2();
D *d = new D();
g++ 3.4.6 from GCC produces the following 32-bit memory layout for the object b2
:[nb 1]
b2: +0: pointer to virtual method table of B2 +4: value of int_in_b2 virtual method table of B2: +0: B2::f2()
and the following memory layout for the object d
:
d: +0: pointer to virtual method table of D (for B1) +4: value of int_in_b1 +8: pointer to virtual method table of D (for B2) +12: value of int_in_b2 +16: value of int_in_d Total size: 20 Bytes. virtual method table of D (for B1): +0: B1::f1() // B1::f1() is not overridden virtual method table of D (for B2): +0: D::f2() // B2::f2() is overridden by D::f2()
Note that those functions not carrying the keyword virtual
in their declaration (such as f0()
and d()
) do not generally appear in the vtable. There are exceptions for special cases as posed by the default constructor.
Overriding of the method f2()
in class D
is implemented by duplicating the virtual method table of B2
and replacing the pointer to B2::f2()
with a pointer to D::f2()
.
Multiple inheritance and thunks
The g++ compiler implements the multiple inheritance of the classes B1
and B2
in class D
using two virtual method tables, one for each base class. (There are other ways to implement multiple inheritance, but this is the most common.) This leads to the necessity for "pointer fixups", also called thunks, when casting.
Consider the following C++ code:
D *d = new D();
B1 *b1 = static_cast<B1*>(d);
B2 *b2 = static_cast<B2*>(d);
While d
and b1
will point to the same memory location after execution of this code, b2
will point to the location d+8
(eight bytes beyond the memory location of d
). Thus, b2
points to the region within d
which "looks like" an instance of B2
, i.e., has the same memory layout as an instance of B2
.
Invocation
A call to d->f1()
is handled by dereferencing d
's D::B1
vpointer, looking up the f1
entry in the vtable, and then dereferencing that pointer to call the code.
In the case of single inheritance (or in a language with only single inheritance), if the vpointer is always the first element in d
(as it is with many compilers), this reduces to the following pseudo-C++:
(*((*d)[0]))(d)
Where *d refers to the virtual method table of D and [0] refers to the first method in the vtable. The parameter d becomes the "this" pointer to the object.
In the more general case, calling B1::f1()
or D::f2()
is more complicated:
(*(*(d[+0]/*pointer to virtual method table of D (for B1)*/)[0]))(d) /* Call d->f1() */
(*(*(d[+8]/*pointer to virtual method table of D (for B2)*/)[0]))(d+8) /* Call d->f2() */
The call to d->f1() passes a B1 pointer as a parameter. The call to d->f2() passes a B2 pointer as a parameter. This second call requires a fixup to produce the correct pointer. It is impossible to call B2::f2 since it has been overridden in D's implementation. The location of B2::f2 is not in the vtable for D.
By comparison, a call to d->f0()
is much simpler:
(*B1::f0)(d)
Efficiency
A virtual call requires at least an extra indexed dereference, and sometimes a "fixup" addition, compared to a non-virtual call, which is simply a jump to a compiled-in pointer. Therefore, calling virtual functions is inherently slower than calling non-virtual functions. An experiment done in 1996 indicates that approximately 6–13% of execution time is spent simply dispatching to the correct function, though the overhead can be as high as 50%.[4] The cost of virtual functions may not be so high on modern CPU architectures due to much larger caches and better branch prediction.
Furthermore, in environments where JIT compilation is not in use, virtual function calls usually cannot be inlined. In certain cases it may be possible for the compiler to perform a process known as devirtualization in which, for instance, the lookup and indirect call are replaced with a conditional execution of each inlined body, but such optimizations are not common.
To avoid this overhead, compilers usually avoid using vtables whenever the call can be resolved at compile time.
Thus, the call to f1
above may not require a vtable lookup because the compiler may be able to tell that d
can only hold a D
at this point, and D
does not override f1
. Or the compiler (or optimizer) may be able to detect that there are no subclasses of B1
anywhere in the program that override f1
. The call to B1::f1
or B2::f2
will probably not require a vtable lookup because the implementation is specified explicitly (although it does still require the 'this'-pointer fixup).
Something that should be noted is however that a virtual call is a conditional expression. If the virtual call is removed by removing the inheritance structure then the decision about which function to call (the base or the derived) still needs to be evaluated in code. This will require evaluation code at each position of the code where the class is used so dynamic dispatch isn't really a performance overhead and actually reduces code size compared to not using it.
Comparison with alternatives
The vtable is generally a good performance trade-off to achieve dynamic dispatch, but there are alternatives, such as binary tree dispatch, with higher performance but different costs.[5]
However, vtables only allow for single dispatch on the special "this" parameter, in contrast to multiple dispatch (as in CLOS or Dylan), where the types of all parameters can be taken into account in dispatching.
Vtables also only work if dispatching is constrained to a known set of methods, so they can be placed in a simple array built at compile time, in contrast to duck typing languages (such as Smalltalk, Python or JavaScript).
Languages that provide either or both of these features often dispatch by looking up a string in a hash table, or some other equivalent method. There are a variety of techniques to make this faster (e.g., interning/tokenizing method names, caching lookups, just-in-time compilation).
See also
Notes
- ↑ G++'s
-fdump-class-hierarchy
argument can be used to dump virtual method tables for manual inspection. For AIX VisualAge XlC compiler, use-qdump_class_hierarchy
to dump class hierarchy and virtual function table layout.
References
- Margaret A. Ellis and Bjarne Stroustrup (1990) The Annotated C++ Reference Manual. Reading, MA: Addison-Wesley. (ISBN 0-201-51459-1)
- ↑ Ellis & Stroustrup 1990, pp. 227–232
- ↑ Danny Kalev. "C++ Reference Guide: The Object Model II". 2003. Heading "Inheritance and Polymorphism" and "Multiple Inheritance".
- ↑ C++ ABI Closed Issues at the Wayback Machine (archived July 25, 2011)
- ↑ Driesen, Karel and Hölzle, Urs, "The Direct Cost of Virtual Function Calls in C++", OOPSLA 1996
- ↑ Zendra, Olivier and Driesen, Karel, "Stress-testing Control Structures for Dynamic Dispatch in Java", Pp. 105–118, Proceedings of the USENIX 2nd Java Virtual Machine Research and Technology Symposium, 2002 (JVM '02)
|