Proceedings of the 27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems 2019
DOI: 10.1145/3347146.3359070
|View full text |Cite
|
Sign up to set email alerts
|

Scaling Address Parsing Sequence Models through Active Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 10 publications
0
2
0
Order By: Relevance
“…This training dataset may be subtly different from the actual data the resulting algorithms are applied to, which can lead to weaknesses. For example, the most well-established parser ‘libpostal’ (Barratine, 2017) has been shown to have issues parsing incomplete addresses (Yassine et al, 2020), and it also struggles with addresses in formats it is not used to (Craig et al, 2019). A more fundamental issue may be that the algorithms are designed on the assumption that they are parsing a single address, but the OCOD dataset contains large numbers of nested addresses in which a single free text line may contain tens or even hundreds of properties.…”
Section: Introductionmentioning
confidence: 99%
“…This training dataset may be subtly different from the actual data the resulting algorithms are applied to, which can lead to weaknesses. For example, the most well-established parser ‘libpostal’ (Barratine, 2017) has been shown to have issues parsing incomplete addresses (Yassine et al, 2020), and it also struggles with addresses in formats it is not used to (Craig et al, 2019). A more fundamental issue may be that the algorithms are designed on the assumption that they are parsing a single address, but the OCOD dataset contains large numbers of nested addresses in which a single free text line may contain tens or even hundreds of properties.…”
Section: Introductionmentioning
confidence: 99%
“…This training dataset may be subtly different from the actual data the resulting algorithms are applied to. This can lead to weaknesses, for example the most well established parser libpostal [23] has been shown to have issues parsing incomplete addresses [22], it also struggles with addresses in formats it is not used to [26]. A more fundemental issue may be that the algorithms are designed on the assumption that they are parsing a single address, the OCOD dataset contains large numbers of nested addresses where a single free text line may contain tens or even hundreds of properties.…”
Section: Introductionmentioning
confidence: 99%