2012
DOI: 10.1007/s11192-012-0733-6
|View full text |Cite
|
Sign up to set email alerts
|

Towards the automation of address identification

Abstract: A new semi-automatic method is presented to standardize or codify addresses, in order to produce bibliometric indicators from bibliographic databases. The hypothesis is that this new method is very trustworthy to normalize authors' addresses, easy and quick to obtain. As a way to test the method, a set of already hand-coded data is chosen to verify its reliability: 136,821 Spanish documents (2006-2008) downloaded previously from the Web of Science database. Unique addresses from this set were selected to produ… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
7
0
1

Year Published

2013
2013
2024
2024

Publication Types

Select...
8
2

Relationship

3
7

Authors

Journals

citations
Cited by 21 publications
(8 citation statements)
references
References 15 publications
(25 reference statements)
0
7
0
1
Order By: Relevance
“…In this study, 75 TCs (Appendix Table 5) were gathered from different sources (through web pages information and/or through email answers). These centres were identified, in WoS documents, and the Spanish institutional sectors with which they collaborate, using automatic applications that analyse addresses and assign optional codes from various master lists (Morillo et al 2013a, b).…”
Section: Materials and Methodologymentioning
confidence: 99%
“…In this study, 75 TCs (Appendix Table 5) were gathered from different sources (through web pages information and/or through email answers). These centres were identified, in WoS documents, and the Spanish institutional sectors with which they collaborate, using automatic applications that analyse addresses and assign optional codes from various master lists (Morillo et al 2013a, b).…”
Section: Materials and Methodologymentioning
confidence: 99%
“…Based on that, they also proposed an approximate string metric that handles acronyms and abbreviations [2]. Morillo et al propose a new semi-automatic method is presented to standardize or codify addresses that need a large number of hand-coded data [14]. Nooj is a new corpus processing system with large-coverage multilingual dictionaries and grammars [19].…”
Section: Related Workmentioning
confidence: 99%
“…centros de salud), se considera adecuada para conocer la tendencia de los grupos hospitalarios a incluir el IIS entre sus afiliaciones a . Dado que los hospitales e institutos no están normalizados en las publicaciones y pueden aparecer bajo distintas denominaciones o variantes de nombre, se ha realizado una codificación semi-automática de los lugares de trabajo, seguida de una fase de verificación manual, que permite unificar la producción de las instituciones e identificar de forma adecuada su producción 6 . Se ha calculado la visibilidad de los IIS en 2009-2011, comparándose con la correspondiente al periodo 2013-2015.…”
Section: Materials Y Métodosunclassified