Boehm garbage collector

Boehm-Demers-Weiser Garbage Collector
Written in C and C++
Type garbage collector
License similar to X11 (free software)
Website http://www.hpl.hp.com/personal/Hans_Boehm/gc/

In computer science, the Boehm-Demers-Weiser garbage collector, often simply known as Boehm GC, is a conservative garbage collector for C and C++.

Boehm GC is free software distributed under a permissive free software licence similar to the X11 license.

Contents

Design

The developer describes the operation of the collector as follows:[1]

The collector uses a mark-sweep algorithm. It provides incremental and generational collection under operating systems which provide the right kind of virtual memory support. (Currently this includes SunOS[45], IRIX, OSF/1, Linux, and Windows, with varying restrictions.) It allows finalization code to be invoked when an object is collected. It can take advantage of type information to locate pointers if such information is provided, but it is usually used without such information.

Boehm GC can also run in test mode in which memory management is still done manually, but the Boehm GC can check if it is done properly. In this way a programmer can find memory leaks and double deallocations.

Boehm GC is also distributed with a C string handling library called cords. This is similar to ropes in C++ (strings are trees of small arrays, and they never change), but instead of using reference counting for proper deallocation, it relies on garbage collection to free objects. Cords are good at handling very large texts, modifications to them in the middle, slicing, concatenating, and keeping history of changes (undo/redo functionality).

Operation

The garbage collector works with most unmodified C programs, simply by replacing malloc() with GC_MALLOC() calls, replacing realloc() with GC_REALLOC() calls, and removing free() calls. The code piece below shows how one can use Boehm instead of traditional malloc and free in C.[2] An annotated version passing Splint memory leak checks is available at GitHub.[3]

#include <assert.h>
#include <stdio.h>
#include <gc.h>
 
int main(void)
{
    int i;
 
    GC_INIT();
    for (i = 0; i < 10000000; ++i)
    {
        int **p = (int**)GC_MALLOC(sizeof(int *));
        int *q = (int*)GC_MALLOC_ATOMIC(sizeof(int));
 
        assert(*p == 0);
        *p = (int*)GC_REALLOC(q, 2 * sizeof(int));
        if (i % 100000 == 0)
            printf("Heap size = %zu\n", GC_get_heap_size());
    }
 
    return 0;
}

Uses and ports

The Boehm GC is used by many projects that are implemented in C or C++, as well as by runtime environments for a number of other languages, including the GNU Compiler for Java runtime environment, the Portable.NET project, LLVM, GNU D Compiler, Embeddable Common Lisp and the Mono implementation of the Microsoft .NET platform (the last is also using precise compacting GC since version 2.8). It supports numerous operating systems, including many Unix variants (such as Mac OS X) and Microsoft Windows, and provides a number of advanced features including incremental collection, parallel collection and a variety of finalizer semantics. Boehm GC was ported with small changes to the D programming language and is part of Digital Mars D compiler's standard runtime library called Phobos (this differs to other usage, because other runtimes use an unmodified C version).

References

External links