1. Species occurrence records from online databases are an indispensable resource in ecological, biogeographical and palaeontological research. However, issues with data quality, especially incorrect geo-referencing or dating, can diminish their usefulness. Manual cleaning is time-consuming, error prone, difficult to reproduce and limited to known geographical areas and taxonomic groups, making it impractical for datasets with thousands or millions of records.2. Here, we present CoordinateCleaner, an r-package to scan datasets of species occurrence records for geo-referencing and dating imprecisions and data entry errors in a standardized and reproducible way. CoordinateCleaner is tailored to problems common in biological and palaeontological databases and can handle datasets with millions of records. The software includes (a) functions to flag potentially problematic coordinate records based on geographical gazetteers, (b) a global database of 9,691 geo-referenced biodiversity institutions to identify records that are likely from horticulture or captivity, (c) novel algorithms to identify datasets with rasterized data, conversion errors and strong decimal rounding and (d) spatio-temporal tests for fossils.3. We describe the individual functions available in CoordinateCleaner and demonstrate them on more than 90 million occurrences of flowering plants from the Global Biodiversity Information Facility (GBIF) and 19,000 fossil occurrences from the Palaeobiology Database (PBDB). We find that in GBIF more than 3.4 million records (3.7%) are potentially problematic and that 179 of the tested contributing This is an open access article under the terms of the Creative Commons Attribution-NonCommercial License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
Amazonia is an environmentally heterogeneous and biologically megadiverse region, and its biodiversity varies considerably over space. However, existing knowledge on Amazonian biodiversity and its environmental determinants stems almost exclusively from studies of macroscopic above‐ground organisms, notably vertebrates and trees. In contrast, diversity patterns of most other organisms remain elusive, although some of them, for instance microorganisms, constitute the overwhelming majority of taxa in any given location, both in terms of diversity and abundance. Here, we use DNA metabarcoding to estimate prokaryote and eukaryote diversity in environmental soil and litter samples from 39 survey plots in a longitudinal transect across Brazilian Amazonia using 16S and 18S gene sequences, respectively. We characterize richness and community composition based on operational taxonomic units (OTUs) and test their correlation with longitude and habitat. We find that prokaryote and eukaryote OTU richness and community composition differ significantly among localities and habitats, and that prokaryotes are more strongly structured by locality and habitat type than eukaryotes. Our results 1) provide a first large‐scale mapping of Amazonian soil biodiversity, suggesting that OTU richness patterns might follow substantially different patterns from those observed for macro‐organisms; and 2) indicate that locality and habitat factors interact in determining OTU richness patterns and community composition. This study shows the potential of DNA metabarcoding in unveiling Amazonia's outstanding diversity, despite the lack of complete reference sequence databases for the organisms sequenced.
The unparalleled biodiversity found in the American tropics (the Neotropics) has attracted the attention of naturalists for centuries. Despite major advances in recent years in our understanding of the origin and diversification of many Neotropical taxa and biotic regions, many questions remain to be answered. Additional biological and geological data are still needed, as well as methodological advances that are capable of bridging these research fields. In this review, aimed primarily at advanced students and early-career scientists, we introduce the concept of “trans-disciplinary biogeography,” which refers to the integration of data from multiple areas of research in biology (e.g., community ecology, phylogeography, systematics, historical biogeography) and Earth and the physical sciences (e.g., geology, climatology, palaeontology), as a means to reconstruct the giant puzzle of Neotropical biodiversity and evolution in space and time. We caution against extrapolating results derived from the study of one or a few taxa to convey general scenarios of Neotropical evolution and landscape formation. We urge more coordination and integration of data and ideas among disciplines, transcending their traditional boundaries, as a basis for advancing tomorrow’s ground-breaking research. Our review highlights the great opportunities for studying the Neotropical biota to understand the evolution of life.
BackgroundKnowledge on the globally outstanding Amazonian biodiversity and its environmental determinants stems almost exclusively from aboveground organisms, notably plants. In contrast, the environmental factors and habitat preferences that drive diversity patterns for micro-organisms in the ground remain elusive, despite the fact that micro-organisms constitute the overwhelming majority of life forms in any given location, in terms of both diversity and abundance. Here we address how the diversity and community turnover of operational taxonomic units (OTU) of organisms in soil and litter respond to soil physicochemical properties; whether OTU diversities and community composition in soil and litter are correlated with each other; and whether they respond in a similar way to soil properties.MethodsWe used recently inferred OTUs from high-throughput metabarcoding of the 16S (prokaryotes) and 18S (eukaryotes) genes to estimate OTU diversity (OTU richness and effective number of OTUs) and community composition for prokaryotes and eukaryotes in soil and litter across four localities in Brazilian Amazonia. All analyses were run separately for prokaryote and eukaryote OTUs, and for each group using both presence-absence and abundance data. Combining these with novel data on soil chemical and physical properties, we identify abiotic correlates of soil and litter organism diversity and community structure using regression, ordination, and variance partitioning analysis.ResultsSoil organic carbon content was the strongest factor explaining OTU diversity (negative correlation) and pH was the strongest factor explaining community turnover for prokaryotes and eukaryotes in both soil and litter. We found significant effects also for other soil variables, including both chemical and physical properties. The correlation between OTU diversity in litter and in soil was non-significant for eukaryotes and weak for prokaryotes. The community compositions of both prokaryotes and eukaryotes were more separated among habitat types (terra-firme, várzea, igapó and campina) than between substrates (soil and litter).DiscussionIn spite of the limited sampling (four localities, 39 plots), our results provide a broad-scale view of the physical and chemical correlations of soil and litter biodiversity in a longitudinal transect across the world’s largest rainforest. Our methods help to understand links between soil properties, OTU diversity patterns, and community composition and turnover. The lack of strong correlation between OTU diversity in litter and in soil suggests independence of diversity drives of these substrates and highlights the importance of including both measures in biodiversity assessments. Massive sequencing of soil and litter samples holds the potential to complement traditional biological inventories in advancing our understanding of the factors affecting tropical diversity.
Fungi are highly diverse organisms, which provide multiple ecosystem services.However, compared with charismatic animals and plants, the distribution patterns and conservation needs of fungi have been little explored. Here, we examined endemicity patterns, global change vulnerability and conservation priority areas for functional groups of soil fungi based on six global surveys using a high-resolution, long-read metabarcoding approach. We found that the endemicity of all fungi and most functional groups peaks in tropical habitats, including Amazonia, Yucatan, West-Central Africa, Sri Lanka, and New Caledonia, with a negligible island effect compared with plants and animals. We also found that fungi are predominantly vulnerable to drought, heat and land-cover change, particularly in dry tropical regions with high human population density. Fungal conservation areas of highest priority include herbaceous wetlands, tropical forests, and woodlands. We stress that more attention should be focused on the conservation of fungi, especially root symbiotic arbuscular mycorrhizal and ectomycorrhizal fungi in tropical regions as well as unicellular early-diverging groups and macrofungi in general. Given the low overlap between the endemicity of fungi and macroorganisms, but high conservation needs in both groups, detailed analyses on distribution and conservation requirements are warranted for other microorganisms and soil organisms.
The rapid loss of biodiversity, coupled with difficulties in species identification, call for innovative approaches to assess biodiversity. Insects make up a substantial proportion of extant diversity and play fundamental roles in any given ecosystem. To complement morphological species identification, new techniques such as metabarcoding make it possible to quantify insect diversity and insect–ecosystem interactions through DNA sequencing. Here we examine the potential of bulk insect samples (i.e., containing many non-sorted specimens) to assess prokaryote and eukaryote biodiversity and to complement the taxonomic coverage of soil samples. We sampled 25 sites on three continents and in various ecosystems, collecting insects with SLAM traps (Brazil) and Malaise traps (South Africa and Sweden). We then compared our diversity estimates with the results obtained with biodiversity data from soil samples from the same localities. We found a largely different taxonomic composition between the soil and insect samples, testifying to the potential of bulk insect samples to complement soil samples. Finally, we found that non-destructive DNA extraction protocols, which preserve insect specimens for morphological studies, constitute a promising choice for cost-effective biodiversity assessments. We propose that the sampling and sequencing of insect samples should become a standard complement for biodiversity studies based on environmental DNA.
Fungi are highly important biotic components of terrestrial ecosystems, but we still have a very limited understanding about their diversity and distribution. This data article releases a global soil fungal dataset of the Global Soil Mycobiome consortium (GSMc) to boost further research in fungal diversity, biogeography and macroecology. The dataset comprises 722,682 fungal operational taxonomic units (OTUs) derived from PacBio sequencing of full-length ITS and 18S-V9 variable regions from 3200 plots in 108 countries on all continents. The plots are supplied with geographical and edaphic metadata. The OTUs are taxonomically and functionally assigned to guilds and other functional groups. The entire dataset has been corrected by excluding chimeras, index-switch artefacts and potential contamination. The dataset is more inclusive in terms of geographical breadth and phylogenetic diversity of fungi than previously published data. The GSMc dataset is available over the PlutoF repository.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.