2013
DOI: 10.1186/1471-2105-14-155
|View full text |Cite
|
Sign up to set email alerts
|

Greedy feature selection for glycan chromatography data with the generalized Dirichlet distribution

Abstract: BackgroundGlycoproteins are involved in a diverse range of biochemical and biological processes. Changes in protein glycosylation are believed to occur in many diseases, particularly during cancer initiation and progression. The identification of biomarkers for human disease states is becoming increasingly important, as early detection is key to improving survival and recovery rates. To this end, the serum glycome has been proposed as a potential source of biomarkers for different types of cancers.High-through… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 41 publications
0
5
0
Order By: Relevance
“…It is often claimed that glycans are by their nature compositions, and that percentage of glycan species in the whole is biologically relevant information. 28 To dispute such claims is not of our interest; our intention is to increase the awareness of spurious correlations caused by row-wise normalization. In this work, we did not try to disentangle the true correlation structure in view of biochemical pathways, but rather demonstrated that choosing one normalization method can cause several potential issues and problems for downstream analysis.…”
Section: Discussionmentioning
confidence: 99%
“…It is often claimed that glycans are by their nature compositions, and that percentage of glycan species in the whole is biologically relevant information. 28 To dispute such claims is not of our interest; our intention is to increase the awareness of spurious correlations caused by row-wise normalization. In this work, we did not try to disentangle the true correlation structure in view of biochemical pathways, but rather demonstrated that choosing one normalization method can cause several potential issues and problems for downstream analysis.…”
Section: Discussionmentioning
confidence: 99%
“…A similar variable selection strategy to the above was used in Raftery and Dean [34] and in Galligan et al [10] to select variables for inclusion in clustering and classification models respectively. Results across the 200 bootstrap samples were compiled.…”
Section: Methodsmentioning
confidence: 99%
“…These derived traits average glycosylation features such as branching, galactosylation, and sialylation across different individual glycan structures, and consequently, they may be more closely related to individual enzymatic activity. For the original traits, CLR transformation from the "compositions" R package (van den Boogaart and Tolosana-Delgado, 2008) was implemented to account for the compositional nature of the data (Galligan et al, 2013). For the derived traits, different approaches of compositional transformations were used depending on the type of the features (Supplementary Table 6).…”
Section: Normalization Batch Correction Of Glycan Peaks and Derived T...mentioning
confidence: 99%