2007
DOI: 10.1535/itj.1104.05
|View full text |Cite
|
Sign up to set email alerts
|

The Foundations for Scalable Multicore Software in Intel Threading Building Blocks

Abstract: This paper describes two features of Intel ® Threading Building Blocks (Intel ® TBB) [1] that provide the foundation for its robust performance: a work-stealing task scheduler and a scalable memory allocator. Work-stealing task schedulers efficiently balance load while maintaining the natural data locality found in many applications. The Intel TBB task scheduler is available to users directly through an API and is also used in the implementation of the algorithms included in the library. In this paper, we prov… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
61
0

Year Published

2010
2010
2021
2021

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 99 publications
(63 citation statements)
references
References 8 publications
0
61
0
Order By: Relevance
“…In addition to OpenMP 3.0, there are currently several other task parallel languages and libraries available to developers, Microsoft Task Parallel Library [19] for Windows, Intel Thread Bulding Blocks (TBB) [5], and Intel Cilk++ [4], a younger sibling of Cilk based on C++ rather than C. We will use TBB and Cilk++, along with Cilk, as comparison points for our performance evaluation with OpenMP 3.0.…”
Section: Background and Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In addition to OpenMP 3.0, there are currently several other task parallel languages and libraries available to developers, Microsoft Task Parallel Library [19] for Windows, Intel Thread Bulding Blocks (TBB) [5], and Intel Cilk++ [4], a younger sibling of Cilk based on C++ rather than C. We will use TBB and Cilk++, along with Cilk, as comparison points for our performance evaluation with OpenMP 3.0.…”
Section: Background and Related Workmentioning
confidence: 99%
“…In this paper, we compare the performance and scalability of the UTS benchmark on several different OpenMP 3.0 compiler and run time implementations (Intel icc 11, gcc 4.4, Mercurium 1.2, SunStudio 12). For comparison we also examine the performance of the UTS benchmark using Cilk [3] , Intel Cilk++ [4], Intel Thread Building Blocks [5]. We also compare with an OpenMP implementation without tasks that performs all scheduling, load balancing, and termination detection explicitly [6].…”
mentioning
confidence: 99%
“…This shows that x t j ≤ 1. Since x t j is increasing in t, it converges to some x ∞ j that satisfies (11)- (12).…”
Section: Steady State Limitmentioning
confidence: 99%
“…There are many different libraries that address parallelism in different ways [18,26,21]. For the purposes of this evaluation, we choose STAPL, the Structured Template Adaptive Parallel Library [4], to serve as an exemplar.…”
Section: Evaluation Of Parallelization Via Libraries: Staplmentioning
confidence: 99%