2015
DOI: 10.1016/j.jbi.2015.01.003
|View full text |Cite
|
Sign up to set email alerts
|

LGscore: A method to identify disease-related genes using biological literature and Google data

Abstract: Since the genome project in 1990s, a number of studies associated with genes have been conducted and researchers have confirmed that genes are involved in disease. For this reason, the identification of the relationships between diseases and genes is important in biology. We propose a method called LGscore, which identifies disease-related genes using Google data and literature data. To implement this method, first, we construct a disease-related gene network using text-mining results. We then extract gene-gen… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2015
2015
2019
2019

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 18 publications
(8 citation statements)
references
References 21 publications
0
7
0
Order By: Relevance
“…We defined a score method to measure the relevance (specificity) of the information in our system to OCD. The concept of relevance of an item A (gene, miRNA, drug, SNP, region) to the pathology OCD is defined through a triplet of values normalized by z-scores ( 63 ): w 1 : the number of articles in OCDB containing A and the association with the pathology OCD. w 2 : the number of relations (A, B) in the database.…”
Section: Methodsmentioning
confidence: 99%
“…We defined a score method to measure the relevance (specificity) of the information in our system to OCD. The concept of relevance of an item A (gene, miRNA, drug, SNP, region) to the pathology OCD is defined through a triplet of values normalized by z-scores ( 63 ): w 1 : the number of articles in OCDB containing A and the association with the pathology OCD. w 2 : the number of relations (A, B) in the database.…”
Section: Methodsmentioning
confidence: 99%
“…For example, a possible extractor would say gene X is associated with disease Y, because gene X and disease Y appear together more often than individually [20]. This approach has been used to establish the following relationship types: disease-gene relationships [20,21,22,23,24,25], protein-protein interactions [24,26,27], drug-disease treatments [28], and tissue-gene relations [29]. Extractors using the co-occurrence strategy provide exceptional recall results; however, these methods may fail to detect underreported relationships, because they depend on entity-pair frequency for detection.…”
Section: Unsupervised Extractorsmentioning
confidence: 99%
“…Text mining method had been applied in studying various biological problems like functional genomics [ 9 ], biological pathways [ 10 ], protein-protein interactions [ 11 ], protein representation [ 12 ], drug-gene association [ 13 ], comparative toxicogenomics [ 14 , 15 ], neuropsychiatric disorder [ 16 ], and other areas in the biomedical domain [ 17 ] including large-scale bioinformatics analyses [ 8 , 18 32 ]. DISEASES predicted the association through the co-occurrence method [ 21 ].…”
Section: Introductionmentioning
confidence: 99%
“…MimMiner [ 28 ] transformed OMIM [ 29 ] text to a relationship matrix and quantified the association among diseases using the term frequency–inverse document frequency method (TF-IDF). CATAPULT [ 8 ] and Heterogeneous Network Edge Prediction (HNEP) [ 30 ] integrated the graphic model and machine learning method, IMC [ 31 ] used a semi-supervised machine learning method, and LGscore [ 32 ] associated genes with disease through a Google search engine to predict associations between genes and diseases.…”
Section: Introductionmentioning
confidence: 99%