2021
DOI: 10.1007/s10766-020-00690-y
|View full text |Cite
|
Sign up to set email alerts
|

iDocChip: A Configurable Hardware Architecture for Historical Document Image Processing

Abstract: In recent years, $$\hbox {optical character recognition (OCR)}$$ optical character recognition (OCR) systems have been used to digitally preserve historical archives. To transcribe historical archives into a machine-readable form, first, the documents are scanned, then an $$\hbox {OCR}$$ OCR is applied. In order to digitize documents without the need to remove them from where they are archived, it is valuable to h… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(10 citation statements)
references
References 32 publications
(47 reference statements)
0
10
0
Order By: Relevance
“…4. Resource utilization of the hardware implementation for the end-to-end OCR iDocChip system (this work) compared to the total resource utilization of the previous separately implemented pipeline steps [26][27][28][29] The original anyOCR is a Python-based software that uses the multi-dimensional image processing library [84] and runs on a multi-threaded Intel ® Core™ i7-4790T with Turbo Boost up to 3.9 GHz for one active core and 2.7 GHz for four active cores. For further analysis, the runtime and energy efficiency of the optimized software implementations are examined on different platforms; see Table 5.…”
Section: Experimental Setup and Resultsmentioning
confidence: 99%
See 4 more Smart Citations
“…4. Resource utilization of the hardware implementation for the end-to-end OCR iDocChip system (this work) compared to the total resource utilization of the previous separately implemented pipeline steps [26][27][28][29] The original anyOCR is a Python-based software that uses the multi-dimensional image processing library [84] and runs on a multi-threaded Intel ® Core™ i7-4790T with Turbo Boost up to 3.9 GHz for one active core and 2.7 GHz for four active cores. For further analysis, the runtime and energy efficiency of the optimized software implementations are examined on different platforms; see Table 5.…”
Section: Experimental Setup and Resultsmentioning
confidence: 99%
“…However, the software part of this hybrid accelerator is very inefficient due to the sequential and time-consuming union-find algorithm. Hence, in [27], we have presented a new optimized heterogeneous architecture based on an improved text and image segmentation algorithm. The resulting accelerator has reduced the runtime and improved the energy efficiency of the previous accelerator stated in [30] by 40% and 46%, respectively.…”
Section: Text and Image Segmentationmentioning
confidence: 99%
See 3 more Smart Citations