Qinjun Qiu scite author profile

A variety of detailed data about geological topics and geoscience knowledge are buried in the geoscience literature and rarely used. Named entity recognition (NER) provides both opportunities and challenges to leverage this wealth of data in the geoscience literature for data analysis and further information extraction. Existing NER models and techniques are mainly based on rule‐based and supervised approaches, and developing such systems requires a costly manual effort. In this paper, we first design a generic stepwise framework for domain‐specific NER. Following this framework, domain‐specific entities and domain‐general words are collected and selected as seed terms. Normalization and grouping processes are then applied to these seed terms for further analysis. A random extraction algorithm based on a unigram language model is used to generate a large‐scale training data set consisting of probabilistically labeled pseudosentences. Each generated sentence is then used as input to the self‐training and learning algorithm. Experimental results on two constructed data sets demonstrate that the proposed model effectively recognizes and identifies geological named entities.

show abstract

Automatic spatiotemporal and semantic information extraction from unstructured geoscience reports using text mining techniques

Qiu

Xie

Tao

2020

Earth Sci Inform

View full text Add to dashboard Cite

Chinese Word Segmentation Based on Self‐Learning Model and Geological Knowledge for the Geoscience Domain

Qiu

et al. 2021

Earth and Space Science

View full text Add to dashboard Cite

show abstract

What is this article about? Generative summarization with the BERT model in the geosciences domain

Miao

Tan

et al. 2021

Earth Sci Inform

View full text Add to dashboard Cite

Mapping hydrothermally altered minerals with AST_07XT, AST_05 and Hyperion datasets using a voting-based extreme learning machine algorithm

Wan

et al. 2019

Ore Geology Reviews

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Qinjun Qiu

GNER: A Generative Model for Geological Named Entity Recognition Without Labeled Data Using Deep Learning

Automatic spatiotemporal and semantic information extraction from unstructured geoscience reports using text mining techniques

Chinese Word Segmentation Based on Self‐Learning Model and Geological Knowledge for the Geoscience Domain

What is this article about? Generative summarization with the BERT model in the geosciences domain

Mapping hydrothermally altered minerals with AST_07XT, AST_05 and Hyperion datasets using a voting-based extreme learning machine algorithm

Contact Info

Product

Resources

About