Reverse-engineering bar charts using neural networks

Zhou, Fangfang; Zhao, Yong; Chen, Wenjiang; Tan, Yijing; Xu, Yanwei; Chen, Yi; Liu, Chao; Zhao, Ying

doi:10.1007/s12650-020-00702-6

Cited by 21 publications

(21 citation statements)

References 42 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The author uses custom CNN architecture and achieves average classification accuracy of 97%. Bar charts are researched in BarChartAnalyzer [18] and by Zhou et al [19]. BarChartAnalyzer uses CNN that classifies the bar chart into seven subtypes (simple bar, grouped bar, stacked bar, and a combination of different orientations).…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Chart Classification Using Siamese CNN

Bajić

Job

2021

J. Imaging

View full text Add to dashboard Cite

In recovering information from the chart image, the first step should be chart type classification. Throughout history, many approaches have been used, and some of them achieve results better than others. The latest articles are using a Support Vector Machine (SVM) in combination with a Convolutional Neural Network (CNN), which achieve almost perfect results with the datasets of few thousand images per class. The datasets containing chart images are primarily synthetic and lack real-world examples. To overcome the problem of small datasets, to our knowledge, this is the first report of using Siamese CNN architecture for chart type classification. Multiple network architectures are tested, and the results of different dataset sizes are compared. The network verification is conducted using Few-shot learning (FSL). Many of described advantages of Siamese CNNs are shown in examples. In the end, we show that the Siamese CNN can work with one image per class, and a 100% average classification accuracy is achieved with 50 images per class, where the CNN achieves only average classification accuracy of 43% for the same dataset.

show abstract

Section: Related Workmentioning

confidence: 99%

“…The average classification accuracy is 85%. In [19] authors proposed a new method for extracting textual and numerical information from bar charts. For textual information, extraction Region-based CNN combined with Tesseract Optical Character Recognition (OCR) engine is used.…”

Section: Related Workmentioning

confidence: 99%

Chart Classification Using Siamese CNN

Bajić

Job

2021

J. Imaging

View full text Add to dashboard Cite

show abstract

“…ReVision [39] used a combination of feature identification and patch clustering to not only classify figures but also reverse-engineer their data to enable re-visualization to other chart formats, and Jung et al and Dai et al [2,16] similarly trained classifiers to categorize published charts in addition to extracting features and text. Last year, Fangfang Zhou et al [53] took a fully neural network-based approach to interpreting bar charts, using a Faster-RCNN [36] to locate and classify textual chart elements, and an attentional encoder-decoder to extract numerical information. To our knowledge the prior work focuses entirely on 2D charts, leaving the problem of interpreting 3D surface plots like that in Figure 2 unaddressed.…”

Section: Related Workmentioning

confidence: 99%

Toward Automatic Interpretation of 3D Plots

Brandt¹,

Freeman²

2021

Preprint

View full text Add to dashboard Cite

This paper explores the challenge of teaching a machine how to reverse-engineer the grid-marked surfaces used to represent data in 3D surface plots of two-variable functions. These are common in scientific and economic publications; and humans can often interpret them with ease, quickly gleaning general shape and curvature information from the simple collection of curves. While machines have no such visual intuition, they do have the potential to accurately extract the more detailed quantitative data that guided the surface's construction. We approach this problem by synthesizing a new dataset of 3D grid-marked surfaces (SurfaceGrid) and training a deep neural net to estimate their shape. Our algorithm successfully recovers shape information from synthetic 3D surface plots that have had axes and shading information removed, been rendered with a variety of grid types, and viewed from a range of viewpoints.

show abstract

“…Recent interest in automatic document processing and conversion, such as in summarization and question answering tasks, has increased the importance of the extraction of underlying tabular data from chart images embedded in the converted documents. Chart analysis methods have evolved substantially in recent years from human-in-the-loop platforms relying on manual annotations [8,15], through early data extraction algorithms [2], hybrid neural-algorithmic pipelines [7,13], to end-to-end processing by a neural network [9,12,16].…”

Section: Introductionmentioning

confidence: 99%

“…Commonly a two-stage approach is used, first detecting the chart regions in the documents, and then applying some data extraction process to the detected charts. While the scope of the detection stage can be quite wide, including many types of charts [7], current tabular data extraction systems are mostly limited to the bar charts [2,7,9,16], with few exceptions. One of the possible reasons is that standard object detectors, employed in recent works, better cope with (and enable easy inference from) objects like rectangular bars and text elements, less so with pie segments, while elements like line or area plots defy handling by box proposals.…”

Section: Introductionmentioning

confidence: 99%

CHARTER: heatmap-based multi-type chart data extraction

Shtok¹,

Harary²,

Azulai³

et al. 2021

Preprint

View full text Add to dashboard Cite

The digital conversion of information stored in documents is a great source of knowledge. In contrast to the documents text, the conversion of the embedded documents graphics, such as charts and plots, has been much less explored. We present a method and a system for end-to-end conversion of document charts into machine readable tabular data format, which can be easily stored and analyzed in the digital domain. Our approach extracts and analyses charts along with their graphical elements and supporting structures such as legends, axes, titles, and captions. Our detection system is based on neural networks, trained solely on synthetic data, eliminating the limiting factor of data collection. As opposed to previous methods, which detect graphical elements using bounding-boxes, our networks feature auxiliary domain specific heatmaps prediction enabling the precise detection of pie charts, line and scatter plots which do not fit the rectangular bounding-box presumption. Qualitative and quantitative results show high robustness and precision, improving upon previous works on popular benchmarks.

show abstract

Reverse-engineering bar charts using neural networks

Cited by 21 publications

References 42 publications

Chart Classification Using Siamese CNN

Chart Classification Using Siamese CNN

Toward Automatic Interpretation of 3D Plots

CHARTER: heatmap-based multi-type chart data extraction

Contact Info

Product

Resources

About