2019
DOI: 10.1111/2041-210x.13152
|View full text |Cite
|
Sign up to set email alerts
|

CoordinateCleaner: Standardized cleaning of occurrence records from biological collection databases

Abstract: 1. Species occurrence records from online databases are an indispensable resource in ecological, biogeographical and palaeontological research. However, issues with data quality, especially incorrect geo-referencing or dating, can diminish their usefulness. Manual cleaning is time-consuming, error prone, difficult to reproduce and limited to known geographical areas and taxonomic groups, making it impractical for datasets with thousands or millions of records.2. Here, we present CoordinateCleaner, an r-package… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
403
0
3

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
3
1

Relationship

1
9

Authors

Journals

citations
Cited by 559 publications
(407 citation statements)
references
References 21 publications
1
403
0
3
Order By: Relevance
“…Further research into the use of additional niche traits is needed. 24,33,8,34,7,21,41,31,39,35,3,22,40,19,9,16,29,20 Diceros bicornis 162 0.9237 22,12,2,36,18,35,29,32,10,20,40,14,34,19,5,21,15,41,39 Equus burchellii 740 0.894 35,19,16,18,28,40,15,32,25,5,22,37,41,21,20,…”
Section: Resultsmentioning
confidence: 99%
“…Further research into the use of additional niche traits is needed. 24,33,8,34,7,21,41,31,39,35,3,22,40,19,9,16,29,20 Diceros bicornis 162 0.9237 22,12,2,36,18,35,29,32,10,20,40,14,34,19,5,21,15,41,39 Equus burchellii 740 0.894 35,19,16,18,28,40,15,32,25,5,22,37,41,21,20,…”
Section: Resultsmentioning
confidence: 99%
“…Then, qualitative information on the geographic distribution of each wild progenitor was searched for, and records were filtered manually by drawing one polygon per taxon that excluded occurrences outside its native range. Country centroids and capitals, coordinates with equal latitude/longitude, zero coordinates, coordinates in biodiversity institutions and GBIF headquarters, and records flagged as fossil specimens were also removed using function clean_coordinates of the ‘CoordinateCleaner’ package in R (Zizka et al, ). The cleaned set of coordinates included 149,847 occurrences, and was used to retrieve climate and ecoregion data at the locations of each crop wild progenitor.…”
Section: Methodsmentioning
confidence: 99%
“…Species occurrence records were acquired from the Global Biodiversity Information Facility via the rgbif function in the dismo package in R (Hijmans et al 2015). Geographic outliers were removed (Zizka et al 2019), climatic outliers more than two standard deviations from the mean were removed (Butterfield et al 2017), as were duplicates within the same~1-km climate grid cell. Species with fewer than 20 records (30 of 385 species) were excluded from further analysis (Thuiller et al 2005).…”
Section: Climate Niche Estimatesmentioning
confidence: 99%