“…Instead, we may consider direct matching of place names, which can be done using deep neural networks [3,15,19,30,40,45,49]. However, these models are known to be more suitable for longer sentences with more complex structures, rather than the short simple place names, and they require large amounts of training data, which is extremely hard to acquire for place deduplication [7,9,31]. Finally, by considering additional place attributes like location, ad-hoc models have been developed based on heuristic feature engineering [8,10,22], but their flexibility is limited to take more different place attributes, such as address and category.…”