2023
DOI: 10.1109/access.2023.3256723
|View full text |Cite
|
Sign up to set email alerts
|

Quantifying the Effects of Ground Truth Annotation Quality on Object Detection and Instance Segmentation Performance

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 33 publications
0
2
0
Order By: Relevance
“…Two fundamentally different HTR models that are both considered state of the art in HTR, are used for handwriting recognition. Py-Laia [19] is a more traditional CRNN, with a stack of convolutions, 2 This only applies to the end of line similar to VGG [23], followed by a bidirectional LSTM [11] and a Connectionist Temporal Classification (CTC) [10] for the decoding part. TrOCR decided to combine a pre-trained Vision Transformer (ViT) [7] with a pre-trained language model, such as BERT [6], and since they are sharing the Transformer architecture, they can easily be combined into one model.…”
Section: Handwriting Recognitionmentioning
confidence: 99%
See 1 more Smart Citation
“…Two fundamentally different HTR models that are both considered state of the art in HTR, are used for handwriting recognition. Py-Laia [19] is a more traditional CRNN, with a stack of convolutions, 2 This only applies to the end of line similar to VGG [23], followed by a bidirectional LSTM [11] and a Connectionist Temporal Classification (CTC) [10] for the decoding part. TrOCR decided to combine a pre-trained Vision Transformer (ViT) [7] with a pre-trained language model, such as BERT [6], and since they are sharing the Transformer architecture, they can easily be combined into one model.…”
Section: Handwriting Recognitionmentioning
confidence: 99%
“…In related fields, several studies have been conducted to investigate the impact of ground truth quality on deep learning, for example in the context of object detection [2,13], text-line segmentation [3,22], and semantic segmentation [20,25] in natural images or historical document images. However, the problems encountered for HTR are specific and to the best of our knowledge, there are currently no comprehensive studies on the impact of ground-truth quality for deep learning-based HTR.…”
Section: Introductionmentioning
confidence: 99%