2012
DOI: 10.1186/1471-2105-13-55
|View full text |Cite
|
Sign up to set email alerts
|

CLIPS-1D: analysis of multiple sequence alignments to deduce for residue-positions a role in catalysis, ligand-binding, or protein structure

Abstract: BackgroundOne aim of the in silico characterization of proteins is to identify all residue-positions, which are crucial for function or structure. Several sequence-based algorithms exist, which predict functionally important sites. However, with respect to sequence information, many functionally and structurally important sites are hard to distinguish and consequently a large number of incorrectly predicted functional sites have to be expected. This is why we were interested to design a new classifier that dif… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
17
0

Year Published

2013
2013
2023
2023

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 15 publications
(17 citation statements)
references
References 57 publications
(74 reference statements)
0
17
0
Order By: Relevance
“…We assume a log-normal distribution because log transformed metabolomics data are usually approximately normally distributed (Karpievitch et al 2012). Further, using a log-normal distribution is consistent with typical analytical approaches to ‘omics data that log transform intensity values and then use a t -test, ANOVA, or linear regression which assume normally distributed data.…”
Section: Model Formulation and Methodologymentioning
confidence: 99%
See 1 more Smart Citation
“…We assume a log-normal distribution because log transformed metabolomics data are usually approximately normally distributed (Karpievitch et al 2012). Further, using a log-normal distribution is consistent with typical analytical approaches to ‘omics data that log transform intensity values and then use a t -test, ANOVA, or linear regression which assume normally distributed data.…”
Section: Model Formulation and Methodologymentioning
confidence: 99%
“…In mass spectrometry ‘omics studies, however, missing values can originate from detection limit censoring and hence are MNAR. Because most imputation techniques produce unbiased results only if the missing data are MCAR or missing at random, but not MNAR (Karpievitch et al 2012, Lee 2004), using the imputation methods developed for microarrary studies in mass spectrometry ‘omics studies could lead to biased results. Further, the choice of imputation method can substantially affect the results and interpretation of analyses of metabolomics data (Hrydziuszko and Viant 2012).…”
Section: Introductionmentioning
confidence: 99%
“…Given these results, we investigated whether functional residue prediction programs specifically designed to identify catalytic, ligand-binding and subtype-specific residues yield similar results. S2.1A–S2.1D Fig compares our analysis of the Gna1 subgroup to that of two such programs: FRpred [43] and CLIPS-1D [45]. This reveals that the structural bipartitioning of residues is unique to our hiMSA analysis, which therefore, at least in this case, is finding protein structural features that these other methods fail to identify.…”
Section: Applicationmentioning
confidence: 85%
“…Since the 3D structures are often unavailable, Capra and Singh [34] developed a window score for such predictions. The concrete shape of our scores takes pattern form Janda et al [45], who in turn refer to Fischer et al [33]. Our scores are convex combinations of the Jensen-Shannon terms associated with the residues belonging to the surrounding window w(k).…”
Section: Methodsmentioning
confidence: 99%