Background: Knowledge of subcellular localization of proteins is crucial to proteomics, drug target discovery and systems biology since localization and biological function are highly correlated. In recent years, numerous computational prediction methods have been developed. Nevertheless, there is still a need for prediction methods that show more robustness and higher accuracy.
COPD is associated with a relevant burden of disease and a high mortality worldwide. Only recently, the importance of comorbidities of COPD has been recognized. Studies postulated an association with inflammatory conditions potentially sharing pathogenic pathways and worsening overall prognosis. More evidence is required to estimate the role of comorbidities of COPD. Our aim was to investigate the prevalence and clustering of comorbidities associated with COPD, and to estimate their impact on clinically relevant outcomes. In this population-based case-control study, a nation-wide database provided by the Swiss Federal Office for Statistics enclosing every hospital entry covering the years 2002–2010 (n = 12′888′075) was analyzed using MySQL and R statistical software. Statistical methods included non-parametric hypothesis testing by means of Fisher’s exact test and Wilcoxon rank sum test, as well as linear models with generalized estimating equation to account for intra-patient variability. Exploratory multivariate approaches were also used for the identification of clusters of comorbidities in COPD patients. In 2.6% (6.3% in patients aged >70 years) of all hospitalization cases an active diagnosis of COPD was recorded. In 21% of these cases, COPD was the main reason for hospitalization. Patients with a diagnosis of COPD had more comorbidities (7 [IQR 4–9] vs. 3 [IQR 1–6]; ), were more frequently rehospitalized (annual hospitalization rate 0.33 [IQR 0.20–0.67] vs. 0.25 [IQR 0.14–0.43]/year; ), had a longer hospital stay (9 [IQR 4–15] vs. 5 [IQR 2–11] days; ), and had higher in-hospital mortality (5.9% [95% CI 5.8%–5.9%] vs. 3.4% [95% CI 3.3%–3.5%]; ) compared to matched controls. A set of comorbidities was associated with worse outcome. We could identify COPD-related clusters of COPD-comorbidities.
Motivation: Knowing the localization of a protein within the cell helps elucidate its role in biological processes, its function and its potential as a drug target. Thus, subcellular localization prediction is an active research area. Numerous localization prediction systems are described in the literature; some focus on specific localizations or organisms, while others attempt to cover a wide range of localizations. Results: We introduce SherLoc, a new comprehensive system for predicting the localization of eukaryotic proteins. It integrates several types of sequence and text-based features. While applying the widely used support vector machines (SVMs), SherLoc's main novelty lies in the way in which it selects its text sources and features, and integrates those with sequence-based features. We test SherLoc on previously used datasets, as well as on a new set devised specifically to test its predictive power, and show that SherLoc consistently improves on previous reported results. We also report the results of applying SherLoc to a large set of yetunlocalized proteins.
SherLoc2 is a comprehensive high-accuracy subcellular localization prediction system. It is applicable to animal, fungal, and plant proteins and covers all main eukaryotic subcellular locations. SherLoc2 integrates several sequence-based features as well as text-based features. In addition, we incorporate phylogenetic profiles and Gene Ontology (GO) terms derived from the protein sequence to considerably improve the prediction performance. SherLoc2 achieves an overall classification accuracy of up to 93% in 5-fold cross-validation. A novel feature, DiaLoc, allows users to manually provide their current background knowledge by describing a protein in a short abstract which is then used to improve the prediction. SherLoc2 is available both as a free Web service and as a stand-alone version at http://www-bs.informatik.uni-tuebingen.de/Services/SherLoc2.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.