“…{ "items": [ { "name": "3002-Kyoto Choco Mochi", "count": 2, "priceInfo": { "unitPrice": 14000, "price": 28000 } }, { "name": "1001 -Choco Bun", "count": 1, "priceInfo": { "unitPrice": 22000 "price": 22000 } }, ... ], "total": [ { "menuqty_cnt": 4, "total_price": 50000 } ] } { "words": [ { "id": 1, "bbox": [[360,2048],..., [355,2127]], "text": "3002-Kyoto" }, { "id": 2, "bbox": [[801,2074],..., [801,2139]], "text": "Choco" }, { "id": 3, "bbox": [[1035,2074],..., [1035,2147]], "text": "Mochi" }, { "id": 4, "bbox": [[761,2172],..., [761,2253]], "text": "14.000" }, …, { "id": 22, "bbox": [[1573,3030],..., [1571,3126]], "text": "50.000" } ] } text information as input and perform their own objectives with the OCR-extracted texts. (Katti et al, 2018;Hwang et al, 2019Hwang et al, , 2020Hwang et al, , 2021aSage et al, 2020;Majumder et al, 2020a;Xu et al, 2019Xu et al, , 2021. For example, (Hwang et al, 2019), a currently-deployed document parsing system for business card and receipt images, consists of three separate modules for text detection, text recognition, and parsing (See Figure 2).…”