2023
DOI: 10.1101/2023.01.17.524402
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Predicting Protein-encoding Gene Content inEscherichia coliGenomes

Abstract: In this study, we built machine learning classifiers for predicting the presence or absence of the variable genes occurring in 10-90% of all publicly available high-quality Escherichia coli genomes. The BV-BRC genus-specific protein families were used to define orthologs across the set of genomes, and a single binary classifier was built for predicting the presence or absence of each family in each genome. Each model was built using the nucleotide k-mers from a set of 100 conserved genes as features. The resul… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 70 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?