Yue-Yang Bow scite author profile

Yue-Yang Bow

4Publications

23Citation Statements Received

77Citation Statements Given

How they've been cited

How they cite others

Affiliations

Institute of Information Engineering, Institute of Information Science, Academia Sinica

Publications

Order By: Most citations

HypertenGene: extracting key hypertension genes from biomedical literature with position and automatically-generated template features

Tsai¹,

Lai²,

Dai

et al. 2009

BMC Bioinformatics

View full text Add to dashboard Cite

BackgroundThe genetic factors leading to hypertension have been extensively studied, and large numbers of research papers have been published on the subject. One of hypertension researchers' primary research tasks is to locate key hypertension-related genes in abstracts. However, gathering such information with existing tools is not easy: (1) Searching for articles often returns far too many hits to browse through. (2) The search results do not highlight the hypertension-related genes discovered in the abstract. (3) Even though some text mining services mark up gene names in the abstract, the key genes investigated in a paper are still not distinguished from other genes. To facilitate the information gathering process for hypertension researchers, one solution would be to extract the key hypertension-related genes in each abstract. Three major tasks are involved in the construction of this system: (1) gene and hypertension named entity recognition, (2) section categorization, and (3) gene-hypertension relation extraction.ResultsWe first compare the retrieval performance achieved by individually adding template features and position features to the baseline system. Then, the combination of both is examined. We found that using position features can almost double the original AUC score (0.8140vs.0.4936) of the baseline system. However, adding template features only results in marginal improvement (0.0197). Including both improves AUC to 0.8184, indicating that these two sets of features are complementary, and do not have overlapping effects. We then examine the performance in a different domain--diabetes, and the result shows a satisfactory AUC of 0.83.ConclusionOur approach successfully exploits template features to recognize true hypertension-related gene mentions and position features to distinguish key genes from other related genes. Templates are automatically generated and checked by biologists to minimize labor costs. Our approach integrates the advantages of machine learning models and pattern matching. To the best of our knowledge, this the first systematic study of extracting hypertension-related genes and the first attempt to create a hypertension-gene relation corpus based on the GAD database. Furthermore, our paper proposes and tests novel features for extracting key hypertension genes, such as relative position, section, and template features, which could also be applied to key-gene extraction for other diseases.

show abstract

Using conditional random fields for result identification in biomedical abstracts

Lin

Dai

Bow

et al. 2009

ICA

View full text Add to dashboard Cite

The abstracts of biomedical papers usually contain three sections: objective, methods, and results-conclusion. The results-conclusion section is the most important because it usually describes the main contribution of a paper. Unfortunately, not all biomedical journals follow this three-section format. In this paper, we propose a machine learning (ML) based approach to automatically identify the results-conclusion section. The results-conclusion section identification problem is formulated as a sequence labeling task. Four feature sets, including Position, Named Entity, Tense, and Word Frequency, are employed with Conditional Random Fields (CRFs) as the underlying ML model. The experiment results show that the proposed approach can achieve F-measure, precision, and recall of 97.08%, 96.63% and 97.53%, respectively.

show abstract

Result identification for biomedical abstracts using Conditional Random Fields

Lin

Dai

Bow

et al. 2008

View full text Add to dashboard Cite

show abstract

Using contextual information to clarify Gene Normalization ambiguity

Lai

Bow

Huang

et al. 2009

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yue-Yang Bow

HypertenGene: extracting key hypertension genes from biomedical literature with position and automatically-generated template features

Using conditional random fields for result identification in biomedical abstracts

Result identification for biomedical abstracts using Conditional Random Fields

Using contextual information to clarify Gene Normalization ambiguity

Contact Info

Product

Resources

About