Intel Threading Building Blocks
From Wikipedia, the free encyclopedia
Intel Threading Building Blocks (also known as TBB) is the name of a C++ template library developed by Intel for writing software programs that take advantage of multi-core processors. The library consists of data structures and algorithms that allow a programmer to avoid some complications arising from the use of native threading packages such as POSIX threads, Windows threads, or the portable Boost Threads in which individual threads of execution are created, synchronized, and terminated manually. Instead the library abstracts access to the multiple processors by allowing the operations to be treated as "tasks," which are allocated to individual cores dynamically by the library's run-time engine, and by automating efficient use of the cache. This approach groups TBB in a family of solutions for parallel programming aiming to decouple the programming from the particulars of the underlying machine.
Contents |
[edit] Implementation
TBB implements "task stealing" to balance a parallel workload across available processing cores in order to increase core utilization and therefore scaling. The TBB task stealing model is similar to the work stealing model applied in Cilk. Initially, the workload is evenly divided among the available processor cores. If one core completes its work while other cores still have a significant amount of work in their queue, TBB reassigns some of the work from one of the busy cores to the idle core. This dynamic capability decouples the programmer from the machine, allowing applications written using the library to scale to utilize the available processing cores with no changes to the source code or the executable program file.
TBB uses templates thereby relying on compile-time polymorphism that can be more temporally efficient than traditional run-time polymorphism since modern C++ compilers are tuned to minimize any abstraction penalty arising from heavy use of templates such as Standard Template Library and TBB.
[edit] Library contents
TBB is a collection of components for parallel programming:
- Basic algorithms:
parallel_for
,parallel_reduce
,parallel_scan
- Advanced algorithms:
parallel_while
,pipeline
,parallel_sort
- Containers:
concurrent_queue
,concurrent_vector
,concurrent_hash_map
- Scalable memory allocation:
scalable_malloc
,scalable_free
,scalable_realloc
,scalable_calloc
,scalable_allocator
,cache_aligned_allocator
- Mutual exclusion:
mutex
,spin_mutex
,queuing_mutex
,spin_rw_mutex
,queuing_rw_mutex
- Atomic operations:
fetch_and_add
,fetch_and_increment
,fetch_and_decrement
,compare_and_swap
,fetch_and_store
- Timing: portable fine grained global time stamp
- Task Scheduler: direct access to control the creation and activation of tasks
[edit] History
Version 1.0 was introduced by Intel on August 29, 2006, the year after the introduction of Intel's first dual-core x86 processor, the Pentium D.
Version 1.1 was introduced on April 10, 2007. This version introduced auto_partitioner which offered an automatic alternative to specifying a grain size parameter to estimate the best granularity for your tasks. This version was added to the Intel C++ Compiler 10.0 with the new Professional Edition later that year on June 5.
Version 2.0 was introduced on July 24, 2007. This version included the release of the source code and the creation of an open source project.[1] The license used for open source is the same as the one used by the GNU Compiler Collection C++ standard library, a GPLv2 with an "runtime exception" (because of being template heavy code that usually becomes part of the executable after compilation). TBB is still available in a commercial version (without source code) with support but with no differences in functionality from the open source version.
Possible future version features were outlined in a posting to the project web site.[2]
Between July 2007 and March 2008, significant development was put into improvements in the TBB container classes (especially concurrent_vector
), and a new algorithm (parallel_do
) was developed. These features were made available in open source TBB development releases during this time period.
The new parallel_do
component is a replacement for parallel_while
, which will eventually be deprecated. The parallel_do
component is structured in a manner that is consistent with the other TBB algorithms (parallel_for
, parallel_reduce
, parallel_scan
), making its application simpler and more intuitive for developers than was the case with parallel_while
.
[edit] Systems supported
The TBB commercial release 2.0 supports Microsoft Windows (XP or newer), Mac OS X (version 10.4.4 or higher) and Linux using compilers Visual C++ (version 7.1 or higher, on Windows OS only), Intel C++ Compiler (version 9.0 or higher) or GNU Compiler Collection (gcc).[3] Additionally, the open source builds of TBB supports Solaris [4] and FreeBSD.
[edit] Open source operating systems
As of March 2008, TBB is available in FreeBSD and has been packaged into the following Linux distributions:
[edit] See also
[edit] Notes
- ^ Thread Building Blocks
- ^ Dave Sekowski (October 10, 2007). Next Major TBB Release Features. Retrieved on 2008-04-07.
- ^ Intel Threading Building Blocks - Release Notes Version 2.0. Retrieved on 2008-04-07.
- ^ Using Intel's Threaded Building Blocks (TBB) With Sun Studio Express. Retrieved on 2008-05-08.
[edit] References
- Reinders, James (2007, July). Intel Threading Building Blocks: Outfitting C++ for Multi-core Processor Parallelism (Paperback) Sebastopol: O'Reilly Media, ISBN 978-0-596-51480-8.
- Voss, M. (2006, October). "Demystify Scalable Parallelism with Intel Threading Building Blocks' Generic Parallel Algorithms."
- Voss, M. (2006, December). "Enable Safe, Scalable Parallelism with Intel Threading Building Blocks' Concurrent Containers."
- Hudson, R. L., B. Saha, et al. (2006, June). "McRT-Malloc: a scalable transactional memory allocator." Proceedings of the 2006 International Symposium on Memory Management. New York: ACM Press, pp. 74-83.
[edit] External links
|