2015
DOI: 10.1101/020867
| View full text |Cite
Preprint
|
Sign up to set email alerts
|

Abstract: (176 words)Proteomics is poised to play critical roles in clinical research. However, due to limited coverage and high noise, integration with powerful analysis algorithms is necessary. In particular, network-based algorithms can improve selection of reproducible features in spite of incomplete proteome coverage, technical inconsistency or high inter-sample variability. We define analytical reliability on three benchmarks ---precision/recall rates, feature-selection stability and crossvalidation accuracy. Usin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2015
2015
2016
2016

Publication Types

Select...
1
1

Relationship

2
0

Authors

Journals

citations
Cited by 2 publications
(5 citation statements)
references
References 28 publications
(23 reference statements)
0
5
0
Order By: Relevance
“…In the second concern, as mentioned, while the most powerful RBNAs (PFSNET and PPFSNET) exhibit very high feature-selection stability, as well as precision-recall rates (14), it should be noted they also report a relatively large number of features as well. Significant features selected by PFSNET and PPFSNET have a rather flat p-value distribution i.e., many of the features had p-values at 0 or close to 0 (Supplementary Data 1) thus making it difficult to prioritize which features to test and validate experimentally.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…In the second concern, as mentioned, while the most powerful RBNAs (PFSNET and PPFSNET) exhibit very high feature-selection stability, as well as precision-recall rates (14), it should be noted they also report a relatively large number of features as well. Significant features selected by PFSNET and PPFSNET have a rather flat p-value distribution i.e., many of the features had p-values at 0 or close to 0 (Supplementary Data 1) thus making it difficult to prioritize which features to test and validate experimentally.…”
Section: Resultsmentioning
confidence: 99%
“…The existing suite of RBNAs include SubNetworks (SNET), Fuzzy SNet (FSNET), Paired FSNet (PFSNET) and class-Paired PFSNet (PPFSNET). PFSNET and PPFSNET were the two best techniques, and performed very well on all performance benchmarks (14). However, there are two limitations worth investigating further --- 1/ feature selection based on the modified t-statistic and t-distribution may not be a valid assumption and 2/ PFSNET and PPFSNET tend to be extremely sensitive, making a fairly large number of predictions for which the p-value distribution is relatively flat (many of these are 0 or close to 0), this makes it difficult to prioritize which subnets to test and validate first.…”
Section: Introductionmentioning
confidence: 95%
See 1 more Smart Citation
“…Going further, Boyanova et al showed that network-based categorization of proteomics data identifies network modules that correspond to cell-type origin [14]. Goh et al showed that networks could reproducibly (and robustly) select feature sets with strong class-discrimination power [15] (Box 2). Furthermore, network mining, based on protein complexes, robustly recovered otherwise-undiscovered proteins [9,16].…”
Section: Trendsmentioning
confidence: 99%
“…Instead of individual proteins, analysis at the level of networks can boost confidence levels (Box 3). Statistical evaluation, not only on significance, but also on reproducibilityfor example incorporating feature-selection stability test [3,15] or irreproducible discovery rate (IDR) [32] should become common practice.…”
Section: Focus On Quantitative Usesmentioning
confidence: 99%