Dynamic loading is a mechanism by which a computer program can, at run time, load a library (or other binary) into memory, retrieve the addresses of functions and variables contained in the library, execute those functions or access those variables, and unload the library from memory. Unlike static linking and loadtime linking, this mechanism allows a computer program to startup in the absence of these libraries, to discover available libraries, and to potentially gain additional functionality.[1][2]
Contents |
Dynamic loading was a common technique for IBM/360 Operating systems (1960s to, the - still extant - Z/Architecture), particularly for I/O subroutines, and for COBOL and PL/1 runtime libraries. As far as the application programmer is concerned, the loading is largely transparent, since it is mostly handled by the operating system (or its I/O subsystem). The main advantages are:
IBM's strategic transaction processing system, CICS (1970s onwards) uses dynamic loading extensively both for its kernel and for normal application program loading. Corrections to application programs could be made offline and new copies of changed programs loaded dynamically without needing to restart CICS [3][4](that can, and frequently does, run 24/7).
Dynamic loading is most frequently used in implementing software plugins.[1] For example, the Apache Web Server's *.dso
"dynamic shared object" plugin files are libraries which are loaded at runtime with dynamic loading.[5] Dynamic loading is also used in implementing computer programs where multiple different libraries may supply the requisite functionality and where the user has the option to select which library or libraries to provide.
Not all systems support dynamic loading. UNIX-like operating systems such as Mac OS X, Linux, and Solaris provide dynamic loading with the C programming language "dl" library. The Windows operating system provides dynamic loading through the Windows API.
Name | Standard POSIX/UNIX API | Microsoft Windows API |
---|---|---|
Header file inclusion | #include <dlfcn.h> |
#include <windows.h> |
Definitions for header | dl
( |
Kernel32.dll |
Loading the library | dlopen |
LoadLibrary LoadLibraryEx |
Extracting contents | dlsym |
GetProcAddress |
Unloading the library | dlclose |
FreeLibrary |
Loading the library is accomplished with LoadLibrary
or LoadLibraryEx
on Windows and with dlopen
on UNIX-like operating systems. Examples follow:
void* sdl_library = dlopen("libSDL.so", RTLD_LAZY); if(sdl_library == NULL) { // report error ... } else { // use the result in a call to dlsym }
As a UNIX library:
void* sdl_library = dlopen("libsdl.dylib", RTLD_LAZY); if(sdl_library == NULL) { // report error ... } else { // use the result in a call to dlsym }
As an OS X Framework:
void* sdl_library = dlopen("/Library/Frameworks/SDL.framework/SDL", RTLD_LAZY); if(sdl_library == NULL) { // report error ... } else { // use the result in a call to dlsym }
HMODULE sdl_library = LoadLibrary("SDL.dll"); if( sdl_library == NULL) { // report error ... } else { // use the result in a call to GetProcAddress }
Extracting the contents of a dynamically loaded library is achieved with GetProcAddress
on Windows and with dlsym
on UNIX-like operating systems.
void* initializer = dlsym(sdl_library,"SDL_Init"); if(initializer == NULL) { // report error ... } else { // cast initializer to its proper type and use }
FARPROC initializer = GetProcAddress(sdl_library,"SDL_Init"); if(initializer == NULL) { // report error ... } else { // cast initializer to its proper type and use }
The result of dlsym()
or GetProcAddress()
has to be converted to the desired destination before it can be used.
In the Windows case, the conversion is straightforward, since FARPROC is essentially already a function pointer:
typedef INT_PTR (*FARPROC)(void);
This can be problematic when the address of an object is to be retrieved rather than a function. However, usually one wants to extract functions anyway, so this is normally not a problem.
typedef void (*sdl_init_function_type)(void); sdl_init_function_type init_func = (sdl_init_function_type) initializer;
According to the POSIX specification, the result of dlsym()
is a void
pointer. Therefore, the specification actually contains a defect, since both ISO C and the C++ programming language prohibit conversion between object pointers and function pointers (in fact, a function pointer is not required to even have the same size as an object pointer). Therefore, strictly speaking, a legal conversion between type void*
and a pointer to a function cannot exist.
On most systems in use today, function and object pointers are de facto convertible. The following code snippet demonstrates one workaround which allows to perform the conversion anyway on many systems:
typedef void (*sdl_init_function_type)(void); sdl_init_function_type init_func = (sdl_init_function_type)initializer;
The above snippet will give a warning on some compilers: warning: dereferencing type-punned pointer will break strict-aliasing rules
. Another workaround is:
typedef void (*sdl_init_function_type)(void); union { sdl_init_function_type func; void * obj; } alias; alias.obj = initializer; sdl_init_function_type init_func = alias.func;
which disables the warning even if strict aliasing is in effect. This makes use of the fact that reading from a different union member than the one most recently written to (called "type punning") is common, and explicitly allowed even if strict aliasing is in force, provided the memory is accessed through the union type directly.[6] However, this is not strictly the case here, since the function pointer is copied to be used outside the union.
The fact remains that any conversion between function and object pointers has to be regarded as an (inherently non-portable) implementation extension, and that no "correct" way for a direct conversion exists, since in this regard the POSIX and ISO standards contradict each other.
Because of this problem, the POSIX documentation on dlsym()
(issue 6) stated that "a future version may either add a new function to return function pointers, or the current interface may be deprecated in favor of two new functions: one that returns data pointers and the other that returns function pointers".[7] However, the most current version of the standard (issue 7, 2008) simply states that function pointers have to be convertible to void*
for POSIX compliance,[8] leaving compiler makers to choose which standard they adhere to.
If the contents of the library can be changed (i.e. in the case of a custom library), in addition to the function itself a pointer to it can be exported. Since a pointer to a function pointer is itself an object pointer, this pointer can always be legally retrieved by call to dlsym()
and subsequent conversion. However, this approach requires maintaining separate pointers to all functions that are to be used externally, and the benefits are usually small.
Loading a library causes memory to be allocated; the library must be deallocated in order to avoid a memory leak. Additionally, failure to unload a library can prevent filesystem operations on the file which contains the library. Unloading the library is accomplished with FreeLibrary
on Windows and with dlclose
on UNIX-like operating systems. However, unloading a DLL can lead to program crashes if objects in the main application refer to memory allocated within the DLL. For example, if a DLL introduces a new class and the DLL is closed, further operations on instances of that class from the main application will likely cause a memory access violation. Likewise, if the DLL introduces a factory function for instantiating dynamically-loaded classes, calling or dereferencing that function after the DLL is closed leads to undefined behaviour.
dlclose(sdl_library);
FreeLibrary(sdl_library);
Both Windows and UNIX implementations of dynamic loading allow programmers to extract symbols from the currently executing process. In both of these APIs, the currently executing process can be "loaded" such that the result can be used in the same manner as the result from dynamically loading a library with LoadLibrary
or dlopen
.
void* this_process = dlopen(NULL,0);
HMODULE this_process; GetModuleHandleEx(0,0,&this_process); HMODULE this_process_again = GetModuleHandle(0);
In the Java programming language, classes can be dynamically loaded using the ClassLoader
object. For example:
Class type = ClassLoader.getSystemClassLoader().loadClass(name); Object obj = type.newInstance();
dlopen()
(issue 6).dlopen()
(issue 7)