2022
DOI: 10.1111/ecog.06068
|View full text |Cite
|
Sign up to set email alerts
|

Automated synthesis of biodiversity knowledge requires better tools and standardised research output

Abstract: As the impact of anthropogenic activity on the environment has grown, research into biodiversity change and associated threats has also accelerated. Synthesising this vast literature is important for understanding the drivers of biodiversity change and identifying those actions that will mitigate further ecological losses. However, keeping pace with an ever‐increasing publication rate presents a substantial challenge to efficient syntheses, an issue which could be partly addressed by increasing levels of autom… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1

Relationship

2
1

Authors

Journals

citations
Cited by 3 publications
(5 citation statements)
references
References 58 publications
(60 reference statements)
0
5
0
Order By: Relevance
“…To put some of the results obtained in this work into a broader perspective, a comparison can be made with previous studies that have utilised text mining tools for applications in ecology. Previous attempts to extract taxonomic terms from abstracts have resulted in mean recall scores per abstract of 79.5% (Millard et al, 2020) and 93.6% (Cornford et al, 2022) with the help of the R package Taxize (Chamberlain and Szöcs, 2013), while the extraction of geographic locations has been achieved with a mean recall of 82.1% per abstract (Cornford et al, 2022) with the help of the CLIFFCLAVIN geoparser model (D’Ignazio et al, 2014). Furthermore, an extensive comparison of eight taxonomic named entity recognition (NER) models over four gold standard ecology corpora (Le Guillarme and Thuiller, 2022) reports scores for approximate matches ranging between 78–96% (precision), 74–93% (recall) and 76– 91% (F1-score).…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…To put some of the results obtained in this work into a broader perspective, a comparison can be made with previous studies that have utilised text mining tools for applications in ecology. Previous attempts to extract taxonomic terms from abstracts have resulted in mean recall scores per abstract of 79.5% (Millard et al, 2020) and 93.6% (Cornford et al, 2022) with the help of the R package Taxize (Chamberlain and Szöcs, 2013), while the extraction of geographic locations has been achieved with a mean recall of 82.1% per abstract (Cornford et al, 2022) with the help of the CLIFFCLAVIN geoparser model (D’Ignazio et al, 2014). Furthermore, an extensive comparison of eight taxonomic named entity recognition (NER) models over four gold standard ecology corpora (Le Guillarme and Thuiller, 2022) reports scores for approximate matches ranging between 78–96% (precision), 74–93% (recall) and 76– 91% (F1-score).…”
Section: Discussionmentioning
confidence: 99%
“…Much of our knowledge of the global loss of biodiversity stems from large-scale syntheses of the ecological literature (Cornford et al, 2022). Such syntheses underlie the establishment of global environmental databases such as the WWF’s Living Planet Index (LPI, 2024), the PREDICTS (Hudson et al, 2017), and the BioTIME (Dornelas et al, 2018) databases, as well as the myriad global reports such as those of the Intergovernmental Science-Policy Platform on Biodiversity and Ecosystem Services (IPBES, 2019) and the Living Planet Report (Almond et al, 2022).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…To put some of the results obtained in this work into a broader perspective, we provide a comparison with previous studies that have utilised text mining tools for applications in ecology. Previous attempts to extract taxonomic terms from abstracts have resulted in mean recall scores per abstract of 79.5% (Millard et al., 2020) and 93.6% (Cornford et al., 2022) with the help of the R package Taxize (Chamberlain & Szöcs, 2013), while the extraction of geographic locations has been achieved with a mean recall of 82.1% per abstract (Cornford et al., 2022) with the help of the CLIFF‐CLAVIN geoparser model (D'Ignazio et al., 2014). Furthermore, an extensive comparison of eight taxonomic NER models over four gold standard ecology corpora (Le Guillarme & Thuiller, 2022) reports scores for approximate matches ranging between 78%–96% (precision), 74%–93% (recall) and 76%–91% (F1‐score).…”
Section: Discussionmentioning
confidence: 99%
“…Text mining and natural language processing (NLP) methods are expected to have significant potential in automating tasks such as document classification, named entity recognition (NER) and disambiguation, and the extraction of relations between entities (Farrell et al., 2022). Previous approaches in ecology have focused heavily on named entity recognition of species and taxonomy (Akella et al., 2012; Gerner et al., 2010; Le Guillarme & Thuiller, 2022; Millard et al., 2020) as well as geographical locations or population trends (Cornford et al., 2022) but also include document classification (Cornford et al., 2021) and relation extraction (Kaur et al., 2019). Furthermore, various gold standard databases of species names and taxonomy have been published to aid the evaluation of NER approaches in ecology (Abdelmageed et al., 2022; Nguyen et al., 2019).…”
Section: Introductionmentioning
confidence: 99%