2009
DOI: 10.1186/1471-2105-10-s6-s2
|View full text |Cite
|
Sign up to set email alerts
|

Statistical assessment of discriminative features for protein-coding and non coding cross-species conserved sequence elements

Abstract: Background: The identification of protein coding elements in sets of mammalian conserved elements is one of the major challenges in the current molecular biology research. Many features have been proposed for automatically distinguishing coding and non coding conserved sequences, making so necessary a systematic statistical assessment of their differences. A comprehensive study should be composed of an association study, i.e. a comparison of the distributions of the features in the two classes, and a predictio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2009
2009
2014
2014

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(7 citation statements)
references
References 38 publications
0
7
0
Order By: Relevance
“…We give an example. Many single sequence techniques for measuring the coding potential of DNA sequences are based upon frequencies of n-mers in known intronic and exonic regions [2,10,11,23]. We can use this principle to write down parameters v exon which are based simply on the frequency of codons in the exon sequences.…”
Section: Cds Density Estimation On Musmentioning
confidence: 99%
See 1 more Smart Citation
“…We give an example. Many single sequence techniques for measuring the coding potential of DNA sequences are based upon frequencies of n-mers in known intronic and exonic regions [2,10,11,23]. We can use this principle to write down parameters v exon which are based simply on the frequency of codons in the exon sequences.…”
Section: Cds Density Estimation On Musmentioning
confidence: 99%
“…The development of robust techniques that detect the coding potential of short sequences is an important area of research [11,14,16,21,30,31,40,46] with applications to sequence annotation as well as gene prediction. We show that our equilibrium measure is reasonably effective in distinguishing between randomly selected introns and exons of length 750bp in the human genome.…”
Section: Introductionmentioning
confidence: 99%
“…Information flow in the biological network has been used in previous studies for predicting protein functions, prioritizing candidate disease genes, and finding network centralities (Nabieva et al, 2005; Newman, 2005; Tu et al, 2006; Stojmirovic and Yu, 2007; Köhler et al, 2008; Suthram et al, 2008; Zotenko et al, 2008; Lee et al, 2009b; Missiuro et al, 2009; Yeger-Lotem et al, 2009; Vanunu and Sharan, 2010). In particular, a flow based approach can be used to augment network information to eQTL analysis, helping identify causal genes in genomic regions and understand the propagation of information signals from causal genes to their target genes.…”
Section: From Genotype To Phenotypementioning
confidence: 99%
“…Comparative genomics is a central step in many sequence analysis studies and the annotation of whole genomes through the identification of coding and regulatory regions is one of the major challenges in the current research in molecular biology. Creanza et al [ 63 ] present a statistical assessment of discriminative features for protein-coding and non coding cross-species conserved sequence elements, comparing distributions of a set of comparative and non comparative features and evaluating the prediction accuracy of classifiers trained for discriminating sequence elements conserved among human, mouse and rat species.…”
Section: An Overview Of Proceeding Contentsmentioning
confidence: 99%