“…It calculates the average grey value for each pixel column then split every blank region in the middle, making it vulnerable to disconnected structure and touching characters. Recently, improved methods have been proposed but are only specific for single language [1,2,5,6,13,16,17,19,23]. Other researches exploit complex processing pipelines and hand-crafted rules to tackle multilingual cases [4,10,24,25].…”