“…These methods fail to produce word or line level detections and can only be used in company with standalone 1,000 0 500 4.4/6.5K Total-Text [10] 1,255 0 300 7.4/11K CTW1500 [60] 1,000 0 500 6.7/10K MSRA-TD500 [59] 300 0 200 6.9/3.5K IC17 MLT [38] 7,200 1,800 9,000 9.5/85K IC19 MLT [37] 10,000 0 10,000 8.9/89K IC19 LSVT [49] 30,000 0 20,000 8.1/243K IC19 ArT [11] 5,603 0 4,563 8.9/50K TextOCR [48] 21,778 3,124 3,232 32.1/903K Intel OCR [22] 191,059 text detectors, increasing the complexity of the pipeline. Another branch of work [54] takes a hierarchical view and apply graph-based models on the finest granularity, i.e. individual words, to analyze the layout.…”