2020
DOI: 10.1111/tgis.12627
|View full text |Cite
|
Sign up to set email alerts
|

NeuroTPR: A neuro‐net toponym recognition model for extracting locations from social media messages

Abstract: Social media messages, such as tweets, are frequently used by people during natural disasters to share real‐time information and to report incidents. Within these messages, geographic locations are often described. Accurate recognition and geolocation of these locations are critical for reaching those in need. This article focuses on the first part of this process, namely recognizing locations from social media messages. While general named entity recognition tools are often used to recognize locations, their … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
66
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 75 publications
(66 citation statements)
references
References 41 publications
0
66
0
Order By: Relevance
“…Statistical learning: As the development of machine learning, specifically deep learning, using statistical learning algorithms to extracting place names is gaining more and more attentions (Wang, Hu, and Joseph 2020;Qi et al 2020;Gelernter and Mushegian 2011;Lingad, Karimi, and Yin 2013;Unankard, Li, and Sharaf 2015;Das and Purves 2019;Limsopatham and Collier 2016;Kumar and Singh 2019). Given abundant annotated data, statistical learning-based approaches can recognize the place names according to the context cues and the intrinsic features of place names, thus achieving a higher tagging accuracy than the rule-based approaches.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Statistical learning: As the development of machine learning, specifically deep learning, using statistical learning algorithms to extracting place names is gaining more and more attentions (Wang, Hu, and Joseph 2020;Qi et al 2020;Gelernter and Mushegian 2011;Lingad, Karimi, and Yin 2013;Unankard, Li, and Sharaf 2015;Das and Purves 2019;Limsopatham and Collier 2016;Kumar and Singh 2019). Given abundant annotated data, statistical learning-based approaches can recognize the place names according to the context cues and the intrinsic features of place names, thus achieving a higher tagging accuracy than the rule-based approaches.…”
Section: Related Workmentioning
confidence: 99%
“…Using 10-fold crossvalidation, the model achieved an F1-score of 0.96. NeuroTPR (Wang, Hu, and Joseph 2020) extended a general Bi-LSTM architecture with several features to account for the linguistic irregularities in Twitter texts, such as the use of character embeddings to capture the morphological features of words, and POS tags and contextual embeddings to capture the semantics of tokens in tweets. The approach mitigates the need for a large training dataset by generating annotated data from Wikipedia articles for the task of location name extraction.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…gazetteers of place names or lists of locative cues) together with supportive natural language processing (NLP) tasks, such as the part-of-speech (POS) tagging of n-grams [3,11,12]. Other systems are based on probabilistic frameworks, using Conditional Random Fields (CRF) or neural networks such as Convolutional Neural Networks (CNN) or Recurrent Neural Networks (RNN) with or without additional linguistic feature engineering [7,13]. Others present hybrid systems, which combine rule-based methods with machine-or deep-learning techniques and/or other named entity recognition (NER) tools [8,14].…”
Section: Related Workmentioning
confidence: 99%
“…A location extractor called NeuroTPR was built by [13], who used a bi-directional RNN with Long Short Term Memory (LSTM) enriched with linguistic features. Apart from building a tweet corpus from the 2017 Hurricane Harvey dataset, they also used GeoCorpora [4].…”
Section: Related Workmentioning
confidence: 99%