2003
DOI: 10.1089/106652703322756104
|View full text |Cite
|
Sign up to set email alerts
|

Mining the Biomedical Literature in the Genomic Era: An Overview

Abstract: The past decade has seen a tremendous growth in the amount of experimental and computational biomedical data, specifically in the areas of genomics and proteomics. This growth is accompanied by an accelerated increase in the number of biomedical publications discussing the findings. In the last few years, there has been a lot of interest within the scientific community in literature-mining tools to help sort through this abundance of literature and find the nuggets of information most relevant and useful for s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
113
0
3

Year Published

2004
2004
2013
2013

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 204 publications
(118 citation statements)
references
References 98 publications
1
113
0
3
Order By: Relevance
“…Precision and recall are the most common parameters when evaluating the IE system. Precision is to evaluate whether the system can only extract correct information, recall is to evaluate whether the system can recognize all useful information (Shatkay et al, 2003). The definition of precision and recall is shown as follows: Precision = number of correctly extracted entities / number of total extracted entities Recall = number of correctly extracted entities / number of all correct entities…”
Section: Evaluation Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Precision and recall are the most common parameters when evaluating the IE system. Precision is to evaluate whether the system can only extract correct information, recall is to evaluate whether the system can recognize all useful information (Shatkay et al, 2003). The definition of precision and recall is shown as follows: Precision = number of correctly extracted entities / number of total extracted entities Recall = number of correctly extracted entities / number of all correct entities…”
Section: Evaluation Resultsmentioning
confidence: 99%
“…(Shatkay, et al, 2003;Cohen, et al, 2004;Goodman, et al, 2004;Woods, et al, 2004). The experimental data is embedded in tens of thousands of published literature and it is difficult for the individual researcher to extract a comprehensive view of the gene function and regulation in different genomes.…”
Section: -6mentioning
confidence: 99%
“…Recently, biomedical journal articles have been a major source of interest in the IE community for a number of reasons: the amount of data available is enormous; the objects, proteins and genes, do not have standard naming conventions; and there is interest from biomedical practitioners to quickly find relevant information (Blaschke et al, 2002;Shatkay and Feldman, 2003;Ray and Craven, 2001;Bunescu et al, 2005). We have focused on learning multi-slot protein localization from Medline 1 abstracts, where the task is to identify links between phrases which correspond to a protein and the location of that particular protein in a cell.…”
Section: Information Extractionmentioning
confidence: 99%
“…However, the newspaper English which makes up the bulk of the PTB is only one of many distinct genres of writing in the Anglophone world, and certainly not the only domain where potential natural-language processing (NLP) applications exist that would benefit from robust and reliable syntactic analysis. Due to the massive glut of published literature, the biomedical sciences in general, and molecular biology in particular, constitute one such domain, and indeed much attention has been focused recently on NLP in this area (Shatkay and Feldman, 2003;Cohen and Hunter, 2004).…”
Section: Introductionmentioning
confidence: 99%