2022
DOI: 10.1111/2041-210x.13868
|View full text |Cite
|
Sign up to set email alerts
|

bdc: A toolkit for standardizing, integrating and cleaning biodiversity data

Abstract: 1. The increase in online and openly accessible biodiversity databases provides a vast and invaluable resource to support research and policy. However, without scrutiny, errors in primary species occurrence data can lead to erroneous results and misleading information.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
20
0
2

Year Published

2022
2022
2024
2024

Publication Types

Select...
8
2

Relationship

0
10

Authors

Journals

citations
Cited by 32 publications
(28 citation statements)
references
References 28 publications
(40 reference statements)
0
20
0
2
Order By: Relevance
“…The BeeBDC package and script relied heavily on several R- packages. Much of the workflow script used and built upon the bdc (biodiversity data cleaner) package 13 , tidyverse packages — particularly dplyr 65 , magrittr 66 , tibble 67 , stringr 68 , tidyselect 69 , ggplot2 70 , tidyr 71 , rlang 72 , xml2 73 , readr 74 , and lubridate 75 , — and CoordinateCleaner 76 . Additionally, we used R.utils 77 , galah 63 , emld 78 , openxlsx 79 , rnaturalearth 80 , rnaturalearthdata 81 , countrycode 82 , hexbin 83 , cowplot 84 , ggspatial 85 , renv 86 , chorddiag 87 , igraph 88 , sf 89 , and terra 90 .…”
Section: Methodsmentioning
confidence: 99%
“…The BeeBDC package and script relied heavily on several R- packages. Much of the workflow script used and built upon the bdc (biodiversity data cleaner) package 13 , tidyverse packages — particularly dplyr 65 , magrittr 66 , tibble 67 , stringr 68 , tidyselect 69 , ggplot2 70 , tidyr 71 , rlang 72 , xml2 73 , readr 74 , and lubridate 75 , — and CoordinateCleaner 76 . Additionally, we used R.utils 77 , galah 63 , emld 78 , openxlsx 79 , rnaturalearth 80 , rnaturalearthdata 81 , countrycode 82 , hexbin 83 , cowplot 84 , ggspatial 85 , renv 86 , chorddiag 87 , igraph 88 , sf 89 , and terra 90 .…”
Section: Methodsmentioning
confidence: 99%
“…While strides have been made to refine SDM methods that are robust to very small sample sizes, there has also been tremendous growth in the volume of biodiversity data federated in, for example, the Global Biodiversity Information Facility (GBIF) and other databases, addressing the ‘Wallacean shortfall’ (Lomolino, 2004) and providing larger samples for SDM. There are also improved tools to access, screen and edit georeferenced biodiversity data from multiple sources for SDM (Ribeiro et al, 2022).…”
Section: Challengesmentioning
confidence: 99%
“…In practice, a first step is always to build species geographic extents of occurrence, which is now usually done based on local occurrence data (e.g., as available from GBIF). Dealing with such datasets, in turn, require procedures to evaluate data consistency and taxonomic resolution (e.g., Ribeiro et al 2022), which may be challenging under the potential links between the three shortfalls. Additionally, we can use several methods to deal with this issue of occurrence data uncertainty and explore Wallacean shortfall, and a promising approach would be to use the 'maps of biogeographical ignorance' (MOBIs) of each currently known species (Tessarolo et al 2021), or to use MOBIs created for entire groups altogether.…”
Section: Geographical Components Of Taxonomic Uncertainty and The Wal...mentioning
confidence: 99%