A Real-World Approach on the Problem of Chart Recognition Using Classification, Detection and Perspective Correction

Araújo, Tiago; Chagas, Paulo; Alves, João; Santos, Carlos Gustavo Resque dos; Santos, Beatriz Sousa; Meiguins, Bianchi Serique

doi:10.3390/s20164370

Cited by 13 publications

(6 citation statements)

References 43 publications

(82 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…CNNs can also be used out-of-the-box; some are available as pre-trained models. Pre-trained models are trained on large datasets (e.g., ImageNet [ 32 ]), and to use them, only the final layers need to be changed and retrained as in Reverse-Engineering Visualizations [ 4 ], FigureSeer [ 33 ], Chart-Text [ 34 ], VizByWiki [ 24 ], Visualizing for the Non-Visual [ 25 ], DocFigure [ 26 ], by Huang [ 35 ] and by Arujo et al [ 36 ]. Common to both architectures is the average classification accuracy, in the range from 90% to 100%.…”

Section: Related Workmentioning

confidence: 99%

A Multi-Purpose Shallow Convolutional Neural Network for Chart Images

Bajić

Orel

Habijan

2022

Sensors

View full text Add to dashboard Cite

Charts are often used for the graphical representation of tabular data. Due to their vast expansion in various fields, it is necessary to develop computer algorithms that can easily retrieve and process information from chart images in a helpful way. Convolutional neural networks (CNNs) have succeeded in various image processing and classification tasks. Nevertheless, the success of training neural networks in terms of result accuracy and computational requirements requires careful construction of the network layers’ and networks’ parameters. We propose a novel Shallow Convolutional Neural Network (SCNN) architecture for chart-type classification and image generation. We validate the proposed novel network by using it in three different models. The first use case is a traditional SCNN classifier where the model achieves average classification accuracy of 97.14%. The second use case consists of two previously introduced SCNN-based models in parallel, with the same configuration, shared weights, and parameters mirrored and updated in both models. The model achieves average classification accuracy of 100%. The third proposed use case consists of two distinct models, a generator and a discriminator, which are both trained simultaneously using an adversarial process. The generated chart images are plausible to the originals. Extensive experimental analysis end evaluation is provided for the classification task of seven chart classes. The results show that the proposed SCNN is a powerful tool for chart image classification and generation, comparable with Deep Convolutional Neural Networks (DCNNs) but with higher efficiency, reduced computational time, and space complexity.

show abstract

Section: Related Workmentioning

confidence: 99%

A Multi-Purpose Shallow Convolutional Neural Network for Chart Images

Bajić

Orel

Habijan

2022

Sensors

View full text Add to dashboard Cite

show abstract

“…There is a lack of works in the literature linking real-world photos with the task of labeling charts before labeling. There are many issues to solve, such as locating charts in images and removing camera distortions [20].…”

Section: The Modern State Of Ocr Processing For Technical Documentsmentioning

confidence: 99%

Огляд Інструментів Ocr Для Завдання Розпізнавання Таблиць І Графіків У Документах

Ярошенко

2022

ІО&ІС

View full text Add to dashboard Cite

У цьому дослідженні представлено огляд інструментів OCR для розпізнавання таблиць документів і графіків. Оцифрування паперових документів має багато переваг як для фізичних осіб, так і для компаній. Для оцифрування потрібно використовувати програмне забезпечення OCR (оптичне розпізнавання символів). Таке програмне забезпечення сканує документи, щоб зробити текст зрозумілим для комп’ютера. Їх можна конвертувати у формати, які підтримуються Microsoft Word або Google Docs. Програмне забезпечення OCR стає радше необхідністю, ніж утилітою для розваг. OCR створює текст із можливістю пошуку та редагування з друкованих документів, а також із відсканованих фотографій або книг і PDF-файлів. Зараз спостерігається активна тенденція до цифровізації документів. Існує великий попит на рішення, які можуть ефективно автоматизувати обробку великого масиву документів з високою точністю. Окремим випадком є обробка PDF-файлів, таких як відскановані документи або створені програмними редакторами. Рішення OCR спрямовані на підвищення ефективності обробки та аналізу цифрових документів за допомогою штучного інтелекту. Цими рішеннями можуть користуватися як державні установи, так і підприємства. Розроблені системи можуть стати цінним доповненням до CRM-систем і можуть бути інтегровані замість існуючих модулів обробки документів або використовуватися як окреме рішення. Хоча існуючі рішення OCR можуть ефективно розпізнавати текст, розпізнавання графічних елементів, таких як діаграми та таблиці, все ще знаходиться на стадії розробки. Рішення, які можуть підвищити точність розпізнавання візуальних даних, можуть бути цінними для обробки технічних документів, таких як наукові, фінансові та інші аналітичні документи. Ключові слова: OCR, файли PDF, FastText, виявлення, розпізнавання, глибоке навчання, технічні документи.

show abstract

“…This section first evaluates and compare the performances of nine popularly used CNN based models in chart classification namely AlexNet [16], VGG-16 [3,10,19], VGG-19 [2], ResNet-50 [5,18], ResNet-101 [8], ResNet-152 [2], Inception-v3 [14], Inception-v4 [5] and Xception [2]. These classifiers are built on our dataset using randomly selected five-fold cross validation.…”

Section: Comparison Of Different Modelsmentioning

confidence: 99%

“…For example, the study in [10] reports to obtain 89% accuracy, whereas study in [3] reports to have obtained only 80% over the same dataset using the same model. Some study uses a real dataset as small as 129 samples [11], and some study uses real dataset as large as 21,099 samples [2]. Motivated by the inconsistencies in observations reported in various studies, this paper aims at identifying potential challenges in building automatic chart classification system.…”

Section: Introductionmentioning

confidence: 99%

Challenges in chart image classification

Thiyam

Singh

Bora

2021

Proceedings of the 21st ACM Symposium on Document Engineering

View full text Add to dashboard Cite

Charts are commonly used forms of visualizing scientific observations from research findings or commercial trends. They provide an abstraction of the underlying information in a more understandable way. Over time, different forms of charts are developed. With the increase in the number of scientific documents present on the internet with different types of charts, automatic chart classification is becoming an important task for various applications. There have been several studies on chart classification with methods ranging from traditional machine learning approaches like SVM, KNN, and HMM to recent deep learning models like VGG, ResNet, and Xception. However, inconsistencies in experimental results are evident. This paper evaluates nine of the recently proposed deep learning-based models on three datasets (one curated and annotated by authors, and two publicly available), and systematically studies their performances over various setups to understand the reason for observing inconsistent results.

show abstract

A Real-World Approach on the Problem of Chart Recognition Using Classification, Detection and Perspective Correction

Cited by 13 publications

References 43 publications

A Multi-Purpose Shallow Convolutional Neural Network for Chart Images

A Multi-Purpose Shallow Convolutional Neural Network for Chart Images

Огляд Інструментів Ocr Для Завдання Розпізнавання Таблиць І Графіків У Документах

Challenges in chart image classification

Contact Info

Product

Resources

About