2021
DOI: 10.1093/nargab/lqab110
|View full text |Cite
|
Sign up to set email alerts
|

Identifying essential genes across eukaryotes by machine learning

Abstract: Identifying essential genes on a genome scale is resource intensive and has been performed for only a few eukaryotes. For less studied organisms essentiality might be predicted by gene homology. However, this approach cannot be applied to non-conserved genes. Additionally, divergent essentiality information is obtained from studying single cells or whole, multi-cellular organisms, and particularly when derived from human cell line screens and human population studies. We employed machine learning across six mo… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
3

Relationship

2
7

Authors

Journals

citations
Cited by 12 publications
(6 citation statements)
references
References 74 publications
(85 reference statements)
0
6
0
Order By: Relevance
“…Although finding similar works in the literature to compare our results was difficult due to our focus on context-specificity, we considered some of them to generally evaluate the goodness of the measures and consolidate the effectiveness of HELP. In [22], the authors proposed an ML method across six model eukaryotes, called CLEARER, that on human cell lines obtained results comparable to HELP (ROC-AUC=0.955±0.018) using many more features (CLEARER: 41635; HELP: 3456 (Kidney) and 3455 (Lung) Bio+CCcfs+N2V). By embedding the human PPI network, [31] reported BA lower than ours (BA=0.783).…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Although finding similar works in the literature to compare our results was difficult due to our focus on context-specificity, we considered some of them to generally evaluate the goodness of the measures and consolidate the effectiveness of HELP. In [22], the authors proposed an ML method across six model eukaryotes, called CLEARER, that on human cell lines obtained results comparable to HELP (ROC-AUC=0.955±0.018) using many more features (CLEARER: 41635; HELP: 3456 (Kidney) and 3455 (Lung) Bio+CCcfs+N2V). By embedding the human PPI network, [31] reported BA lower than ours (BA=0.783).…”
Section: Resultsmentioning
confidence: 99%
“…As EGs are genes strongly involved in physical and functional interactions, we considered four interactions-based attributes: "BIOGRID", the number of interactions annotated for each query gene [21]; "UCSC TFBS", the count of predicted transcription factors binding sites (TFBSs) from the UCSC TFBS; "CCBeder" is a set of attributes describing the GO characterisation of the query gene, also considering those of its April 12, 2024 4/20 neighbours in the PPI. They were obtained by performing, for each gene set, a Fisher's exact test for the enrichment of interaction partners through the code made available by [22]. The GO-CC terms were downloaded from the g:Profiler web service as GMT files [23].…”
Section: Data: Multi-omics Featuresmentioning
confidence: 99%
“…The procedure for feature generation was similar as published earlier for essential gene prediction (54, 55, 78). Each gene served as a sample in the machine learning procedure, either labelled as HDF or non-HDF, or was not used for training the classifiers.…”
Section: Methodsmentioning
confidence: 99%
“…The procedure for feature generation was similar as published earlier for essential gene prediction ( Acencio and Lemke, 2009 ; Aromolaran et al, 2020 ; Beder et al, 2021 ). Each gene served as a sample in the machine learning procedure, either labeled as HDF or non-HDF, or was not used for training the classifiers.…”
Section: Methodsmentioning
confidence: 99%