2001
DOI: 10.1093/bioinformatics/17.12.1123
|View full text |Cite
|
Sign up to set email alerts
|

A probabilistic method for identifying start codons in bacterial genomes

Abstract: As the pace of genome sequencing has accelerated, the need for highly accurate gene prediction systems has grown. Computational systems for identifying genes in prokaryotic genomes have sensitivities of 98-99% or higher (Delcher et al., Nucleic Acids Res., 27, 4636-4641, 1999). These accuracy figures are calculated by comparing the locations of verified stop codons to the predictions. Determining the accuracy of start codon prediction is more problematic, however, due to the relatively small number of start si… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
135
0
1

Year Published

2002
2002
2015
2015

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 175 publications
(136 citation statements)
references
References 12 publications
0
135
0
1
Order By: Relevance
“…The success of this work can lead to the development of improved methods for identifying the precise location of translation initiation start sites, an area that is receiving greater computational research attention (Frishman et al, 1999;Tompa, 1999;Hannenhalli et al, 1999;Suzek et al, 2001;Walker et al, 2002;Zien et al, 2000;Yada et al, 2001;Besemer et al, 2001). Additionally, design of effective coding-based models for genetic regulatory systems can potentially help researchers determine how to incorporate deliberate, sequence-controlled regulation into engineered proteins.…”
Section: Resultsmentioning
confidence: 99%
“…The success of this work can lead to the development of improved methods for identifying the precise location of translation initiation start sites, an area that is receiving greater computational research attention (Frishman et al, 1999;Tompa, 1999;Hannenhalli et al, 1999;Suzek et al, 2001;Walker et al, 2002;Zien et al, 2000;Yada et al, 2001;Besemer et al, 2001). Additionally, design of effective coding-based models for genetic regulatory systems can potentially help researchers determine how to incorporate deliberate, sequence-controlled regulation into engineered proteins.…”
Section: Resultsmentioning
confidence: 99%
“…To determine the effects of SD sequence during protein translation, we calculated the free energy for base pairing of 16S rRNA with SD sequence for each gene. (Suzek et al 2001): This method used a ''seed'' sequence to train a probabilistic model of SD sequences, which was then used to find the SD sequences in regions upstream of start codons of all genes. A good seed sequence is the 39 end of the 16S rRNA (Suzek et al 2001).…”
Section: Methodsmentioning
confidence: 99%
“…(Suzek et al 2001): This method used a ''seed'' sequence to train a probabilistic model of SD sequences, which was then used to find the SD sequences in regions upstream of start codons of all genes. A good seed sequence is the 39 end of the 16S rRNA (Suzek et al 2001). Five copies of 16S rRNA genes have an identical 39 tail in D. vulgaris (notably the annotated sequences were incomplete and missing the sequence ctggatcacctccttt at the 39 end; however, this sequence does exist in all five copies of D. vulgaris 16S rRNA genes).…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The genome was searched against Rfam v.11 (Burge et al, 2013) using Infernal v.1.1 (Nawrocki and Eddy, 2013), and the resulting ncRNAs were manually integrated into the annotation following the INSDC conventions (http://www.insdc.org/files/feature_table.html). Ribosome-binding sites were predicted using RBSfinder (Suzek et al, 2001) and signal peptides were predicted using SignalP v.4.1 (Petersen et al, 2011). The resulting annotation was manually curated in Artemis (Rutherford et al, 2000).…”
Section: Westeberhardia Detection and Phylogenomic Analysesmentioning
confidence: 99%