Enhancing text recognition on Tor Darknet images

Blanco, Medina, Pablo; Fidalgo, Eduardo; Alegre, Enrique; Al-Nabki, Mhd Wesam; Chaves, Deisy

doi:10.17979/spudc.9788497497169.828

Cited by 4 publications

(6 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…When applied to similar characters, these approaches may highlight these alterations by deblurring or enhancing key areas, which can potentially penalize similar character recognition. Such a penalty can be prevented by implementing lexicons, string-matching techniques and the average-edit distance to measure word closeness better and avoid recognition mistakes [ 23 ].…”

Section: Resultsmentioning

confidence: 99%

“…We can attribute the lower score obtained in these works [ 22 , 23 ] to different image quality factors, such as colour distribution, brightness, and partial occlusion, some of which we illustrate in Figure 2 . Other problems affecting the text recognition task include multiple fonts or languages in the same image, character similarity, lighting conditions or even mistakes when labelling the images [ 24 ].…”

Section: Introductionmentioning

confidence: 98%

“…In the first, Blanco et al [ 22 ] obtained an F-Measure of 0.57 in the text detection task, but their performance on text recognition with state-of-the-art techniques was much lower. Through the use of dictionaries and string-matching techniques, Blanco et al [ 23 ] improved the score of the recognition stage to 0.3970.…”

Section: Introductionmentioning

confidence: 99%

“…Other problems affecting the text recognition task include multiple fonts or languages in the same image, character similarity, lighting conditions or even mistakes when labelling the images [ 24 ]. Although these are frequent problems in state-of-the-art datasets, the issues of low resolution and oriented text are the most remarkable in both CSA and Tor-based images [ 23 ]. In Tor images, it is very common to fit several documents in a single picture to express quantity or product variations.…”

Section: Introductionmentioning

confidence: 99%

“…In this paper, we address the problem of performing text recognition on non-horizontal and low-resolution text [ 25 ], by enhancing images using two different techniques; rectification networks [ 26 , 27 ], which correct an image’s orientation to reduce transcription mismatches, and super-resolution techniques, which improve the image quality before recognition. We combine these two tasks on CSA focused images as well as on Tor darknet images [ 22 , 23 ] in order to retrieve information that can be of use to identify potentially illegal activities.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Rectification and Super-Resolution Enhancements for Forensic Text Recognition

Blanco‐Medina

Fidalgo

Alegre

et al. 2020

Sensors

Self Cite

View full text Add to dashboard Cite

Retrieving text embedded within images is a challenging task in real-world settings. Multiple problems such as low-resolution and the orientation of the text can hinder the extraction of information. These problems are common in environments such as Tor Darknet and Child Sexual Abuse images, where text extraction is crucial in the prevention of illegal activities. In this work, we evaluate eight text recognizers and, to increase the performance of text transcription, we combine these recognizers with rectification networks and super-resolution algorithms. We test our approach on four state-of-the-art and two custom datasets (TOICO-1K and Child Sexual Abuse (CSA)-text, based on text retrieved from Tor Darknet and Child Sexual Exploitation Material, respectively). We obtained a 0.3170 score of correctly recognized words in the TOICO-1K dataset when we combined Deep Convolutional Neural Networks (CNN) and rectification-based recognizers. For the CSA-text dataset, applying resolution enhancements achieved a final score of 0.6960. The highest performance increase was achieved on the ICDAR 2015 dataset, with an improvement of 4.83% when combining the MORAN recognizer and the Residual Dense resolution approach. We conclude that rectification outperforms super-resolution when applied separately, while their combination achieves the best average improvements in the chosen datasets.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 98%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Rectification and Super-Resolution Enhancements for Forensic Text Recognition

Blanco‐Medina

Fidalgo

Alegre

et al. 2020

Sensors

Self Cite

View full text Add to dashboard Cite

show abstract

Development of Vertical Text Interpreter for Natural Scene Images

2021

View full text Add to dashboard Cite

Applications of scene text spotting to the darknet and industry 4.0

Blanco Medina

View full text Add to dashboard Cite

In this thesis, we work on the task of Text Spotting, within the field of Computer Vision. In this manuscript, we propose new algorithms, methods, and datasets that can be used to detect, recognize, and enhance text character sequences found within images, based on the need for information retrieval on systems that cannot crawl or access such information by any other means that is not a graphical representation. Motivated by our work alongside the Spanish National Cybersecurity Institute (INCIBE), we focus our research on recovering character sequences found within visual media of both darknet and industrial sources. We intend to support INCIBE products and services related to cybersecurity that may monitor potential illegal activities and critical infrastructures.To improve scene text recognition performance, we analyze images in terms of their irregularity, because some methods often claim to be robust on irregular datasets that contain a large amount of irregular text. After building a classification model for these categories, we created a new dataset, the Fully Irregular Text (FIT-Text) dataset, composed primarily of irregular images, with the intention that other methods, oriented to this problem, can use it to evaluate their performance.We propose a new performance metric, the Contained-Levenshtein (C-Lev) accuracy. Literature scene text recognizers have traditionally reported both the accuracy and the normalized edit distance on datasets as a performance metric, but never combined the two into a singular, effective metric that can help discern between severe and low priority mistakes. C-Lev also serves as a label-checking tool, helping methods stay robust against minor human-generated labeling errors.To increase scene text accuracy, we propose the integration of string-distance measurements as components of the loss functions in both CTC and Attention recognizers. Testing various distances as the proposed weight, we consider the Hamming distance the most beneficial, with a total improvement of over 6% accuracy using literature datasets.For scene text detectors, we propose a new metric that assigns value to scene text images according to their documented regions, the Text Density Distribution (TDD), which classifies visual media according to the spatial distribution of region clusters. We also propose using this metric to train scene text detectors, while monitoring their computational cost and performance balance. We note that the detection F1 score only drops 4% when Para los detectores de texto, proponemos una nueva métrica que asigna valor a las imágenes según sus regiones documentadas, la Distribución de Densidad de Texto (TDD en inglés), que clasifica los medios visuales según su cantidad y distribución espacial de regiones. Proponemos utilizar esta métrica para seleccionar conjuntos reducidos de datos con los que entrenar detectores de texto, reduciendo su coste computacional y preservando su rendimiento. Observamos que la F1 score de la detección sólo disminuye en un 4 % cuando se utiliza menos del...

show abstract

Enhancing text recognition on Tor Darknet images

Cited by 4 publications

References 15 publications

Rectification and Super-Resolution Enhancements for Forensic Text Recognition

Rectification and Super-Resolution Enhancements for Forensic Text Recognition

Development of Vertical Text Interpreter for Natural Scene Images

Applications of scene text spotting to the darknet and industry 4.0

Contact Info

Product

Resources

About