2016
DOI: 10.1186/s12920-016-0228-z
|View full text |Cite
|
Sign up to set email alerts
|

Fuzzy-FishNET: a highly reproducible protein complex-based approach for feature selection in comparative proteomics

Abstract: BackgroundThe hypergeometric enrichment analysis approach typically fares poorly in feature-selection stability due to its upstream reliance on the t-test to generate differential protein lists before testing for enrichment on a protein complex, subnetwork or gene group.MethodsSwapping the t-test in favour of a fuzzy rank-based weight system similar to that used in network-based methods like Quantitative Proteomics Signature Profiling (QPSP), Fuzzy SubNets (FSNET) and paired FSNET (PFSNET) produces dramatic im… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
7

Relationship

4
3

Authors

Journals

citations
Cited by 9 publications
(10 citation statements)
references
References 47 publications
0
10
0
Order By: Relevance
“…Use of biological context (e.g., networks and complexes) increases biological signal over signal from other spurious correlations (e.g., batch) as only signal from same-complex members are summated. We already know that use of protein complexes increases power, and we believe the signal amplification is phenotypically relevant [ 35 ]. Previous tests have already demonstrated that complex-based features are specifically predictive for phenotype classes and that false-positive rates are low.…”
Section: Resultsmentioning
confidence: 99%
“…Use of biological context (e.g., networks and complexes) increases biological signal over signal from other spurious correlations (e.g., batch) as only signal from same-complex members are summated. We already know that use of protein complexes increases power, and we believe the signal amplification is phenotypically relevant [ 35 ]. Previous tests have already demonstrated that complex-based features are specifically predictive for phenotype classes and that false-positive rates are low.…”
Section: Resultsmentioning
confidence: 99%
“…31 ) Over-Representation Analysis (ORA) is a two-stage procedure: univariate feature selection on proteins followed by enrichment test. ORA can be highly unstable as it is very sensitive to the test type and stringency conditions in the univariate feature-selection step 30,41 as well as the appropriateness of the null hypothesis of the enrichment test. 42 As mitigation, Direct Group (DG) analysis does away with the univariate feature-selection step and directly determine whether a complex is differential by comparing the distribution of constituent protein expression between phenotype classes against that of proteins outside the complex.…”
Section: Complex-based Feature-selection Methodsmentioning
confidence: 99%
“…To incorporate this element, a standardized Euclidean distance (between 0 and 1) is determined for all significant protein pairs across all samples. The significant proteins are reordered based on similarity (i.e., 1 – Euclidean distance) . This reordered protein list is then split at regular intervals to generate 100 complexes of size 7, and one complex of size 10 (710 significant proteins; see Section above).…”
Section: Methodsmentioning
confidence: 99%