2005
DOI: 10.1145/1080695.1069982
|View full text |Cite
|
Sign up to set email alerts
|

An Integrated Memory Array Processor Architecture for Embedded Image Recognition Systems

Abstract: Embedded processors for video image recognition require to address both the cost (

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2008
2008
2014
2014

Publication Types

Select...
3
3
1

Relationship

1
6

Authors

Journals

citations
Cited by 22 publications
(16 citation statements)
references
References 28 publications
0
16
0
Order By: Relevance
“…Its type and number of operators -from 4 to 128 -can be customized at the time of chip design. For the automotive market, NEC has devised the ImapCar processor (Kyo, 2005) containing 128 SIMD -Single Instruction Multiple Data means that every processor executes the same instruction on different data, for example each processor do the same job on each pixel of an image -parallel arithmetic and logic units with a power consumption of more than one Watt. Xetal also proposes a programmable, massively-parallel processor integrating 320 computing units (Abbo, 2008).…”
Section: Programmable Architecturesmentioning
confidence: 99%
“…Its type and number of operators -from 4 to 128 -can be customized at the time of chip design. For the automotive market, NEC has devised the ImapCar processor (Kyo, 2005) containing 128 SIMD -Single Instruction Multiple Data means that every processor executes the same instruction on different data, for example each processor do the same job on each pixel of an image -parallel arithmetic and logic units with a power consumption of more than one Watt. Xetal also proposes a programmable, massively-parallel processor integrating 320 computing units (Abbo, 2008).…”
Section: Programmable Architecturesmentioning
confidence: 99%
“…Parallelizing methods are designed based on the idea of sweeping PUL in various ways across the 2-D memory plane [7]. As described in the previous subsection, the one RAM/block per PE configuration enables each PE to access a different memory address at the same time, thus enabling sweeping of the PUL in various ways across the 2-D memory plane.…”
Section: Programming Modelmentioning
confidence: 99%
“…The performance of major image processing tasks and vision applications implemented on IMAPCAR by using several standard parallelizing techniques [7] are evaluated by comparing 1DC code running on a 100 MHz IMAPCAR with C code running on a 2.4 GHz Intel P4, a GPP (general purpose processor). As shown in Figure 11, the first benchmark, which consists of various image processing tasks, IMAPCAR achieves a performance up to 8 times better with a speedup approximately proportional to the amount of inherent parallelism of each task.…”
Section: Performance and Power Consumption Evaluationmentioning
confidence: 99%
“…By avoiding cache coherence overheads as well as cache indeterminacy, the shared L2 TCDM can be used as a frame buffer for video processing which should deal with a large amount of data within a tightly bounded time [7,8]. The fully combinational Mesh-of-Trees (MoT) interconnection network proposed in [5] is suitable for this shared multi-banked L2 TCDM with high throughput and low memory access latency.…”
Section: Introductionmentioning
confidence: 99%