2023
DOI: 10.1002/pra2.900
|View full text |Cite
|
Sign up to set email alerts
|

A Text Mining Approach to Uncover the Structure of Subject Metadata in the Biodiversity Heritage Library

Yi‐Yun Cheng,
Nikolaus Nova Parulian,
Ly Dinh

Abstract: We propose a bottom‐up, data‐driven pipeline to uncover the structure of biodiversity subject metadata using a combination of text mining approaches. In this study, we analyze 721,035 subject terms in the Biodiversity Heritage Library (BHL). We utilize named entity recognition and word‐embedding methods to systematically label and group terms based on their vector‐space distances. The results show that the subject terms from BHL are clustered into several prominent themes relating to environmental regulations,… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 8 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?