2003
DOI: 10.1002/pmic.200300512
|View full text |Cite
|
Sign up to set email alerts
|

Multiple approaches to data‐mining of proteomic data based on statistical and pattern classification methods

Abstract: The data-mining challenge presented is composed of two fundamental problems. Problem one is the separation of forty-one subjects into two classifications based on the data produced by the mass spectrometry of protein samples from each subject. Problem two is to find the specific differences between protein expression data of two sets of subjects. In each problem, one group of subjects has a disease, while the other group is nondiseased. Each problem was approached with the intent to introduce a new and potenti… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
8
0
1

Year Published

2004
2004
2008
2008

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 16 publications
(9 citation statements)
references
References 7 publications
0
8
0
1
Order By: Relevance
“…Identification of disease specific proteins could yield mechanistic information as well as potential diagnostic markers or drug targets. The importance in recognizing these points is highlighted in recent comparative analyses of profile spectra [25][26][27][28][29][30][31], whereby varied biostatisticians all reported an accuracy >90% in classifying the same profile spectra from either tumor or non-tumor samples. However, in identifying which ions were most significant in determining the classifications there was little to no agreement.…”
Section: Introductionmentioning
confidence: 99%
“…Identification of disease specific proteins could yield mechanistic information as well as potential diagnostic markers or drug targets. The importance in recognizing these points is highlighted in recent comparative analyses of profile spectra [25][26][27][28][29][30][31], whereby varied biostatisticians all reported an accuracy >90% in classifying the same profile spectra from either tumor or non-tumor samples. However, in identifying which ions were most significant in determining the classifications there was little to no agreement.…”
Section: Introductionmentioning
confidence: 99%
“…Distinguishing protein ions as disease-specific requires isolating those abundance changes symptomatic of disease from those changes influenced by experimental conditions. From this perspective, various studies have explored techniques for pre-processing spectra as a means of minimizing overall variance (7)(8)(9)(10)(11) prior to statistical analysis, but these procedures vary and do not completely eliminate inconsistent classification results (7,(11)(12)(13)(14)(15). Nevertheless, the existing body of work is strong evidence that biomarker discovery by tissue profiling is possible and may one day benefit the clinical diagnosis and treatment of disease.…”
mentioning
confidence: 99%
“…A well‐designed data pre‐processing and multivariate analysis step must be performed for the recognition of the proteins evaluated as significant in the classification, otherwise from the same data set different results can be obtained. For example, in many recent comparative studies concerning the same tumor/non‐tumor protein patterns,26–32 a prediction ability >90% was always obtained, but there was no agreement about the proteins involved in the discrimination.…”
Section: Resultsmentioning
confidence: 96%