2012
DOI: 10.1101/gr.133850.111
|View full text |Cite
|
Sign up to set email alerts
|

A vast collection of microbial genes that are toxic to bacteria

Abstract: In the process of clone-based genome sequencing, initial assemblies frequently contain cloning gaps that can be resolved using cloning-independent methods, but the reason for their occurrence is largely unknown. By analyzing 9,328,693 sequencing clones from 393 microbial genomes, we systematically mapped more than 15,000 genes residing in cloning gaps and experimentally showed that their expression products are toxic to the Escherichia coli host. A subset of these toxic sequences was further evaluated through … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
70
1

Year Published

2012
2012
2020
2020

Publication Types

Select...
5
2
1

Relationship

2
6

Authors

Journals

citations
Cited by 71 publications
(73 citation statements)
references
References 44 publications
2
70
1
Order By: Relevance
“…These results support that YgfX is not a toxin. Finally, in a recent shotgun cloning study to identify novel TA systems from a range of microbial genomes, YgfX was not identified as being toxic (Kimelman et al, 2012;Sberro et al, 2013). These independent lines of experimental investigation provide further evidence that YgfX-SdhE is not a TA system.…”
Section: Discussionmentioning
confidence: 73%
“…These results support that YgfX is not a toxin. Finally, in a recent shotgun cloning study to identify novel TA systems from a range of microbial genomes, YgfX was not identified as being toxic (Kimelman et al, 2012;Sberro et al, 2013). These independent lines of experimental investigation provide further evidence that YgfX-SdhE is not a TA system.…”
Section: Discussionmentioning
confidence: 73%
“…Each point represents one bacterial species. For each of its metabolic genes, EDGE predicted whether its transfer to E. coli would be successful, and the results were compared with previously published experimental data (20,21). The predictor's quality was quantified by the area under the ROC curve (AUC) (y axis).…”
Section: Genes Whose Expression Impedes Growth (According To Edge) Arementioning
confidence: 99%
“…It was previously observed that gaps in Sanger-based genome sequencing are often caused by toxic genes that cannot be expressed in an E. coli host (20). To study EDGE's ability to predict failed gene transfer between organisms due to toxic effects, we used the recently published PanDaTox dataset (21) of genes found to be unclonable into E. coli. We simulated in silico the process of gene transfer from 50 different Gammaproteobacteria into E. coli, and then used EDGE to predict which of the heterologous genes should be toxic to the E. coli host (SI Appendix).…”
Section: Edge Predicts Reduced Fitness Due To Overexpression Of Foreignmentioning
confidence: 99%
See 1 more Smart Citation
“…By experimenting with cumulatively more than 100 gap-residing genes from numerous different organisms, we have verified that our gap-based prediction of gene toxicity exceeds 80% of accuracy. 13,14 The accuracy in our predictions of gene clonability into E. coli stems from several factors. First, the high sequencing coverage, needed for proper genome assembly, dictates that on average more than 25 independent clones would contain each gene.…”
Section: Acknowledgmentsmentioning
confidence: 99%