Background: Stratification by eosinophil and neutrophil counts increases our understanding of asthma and helps target therapy, but there is room for improvement in our accuracy in prediction of treatment responses and a need for better understanding of the underlying mechanisms. Objective: We sought to identify molecular subphenotypes of asthma defined by proteomic signatures for improved stratification. Methods: Unbiased label-free quantitative mass spectrometry and topological data analysis were used to analyze the proteomes of sputum supernatants from 246 participants (206 asthmatic patients) as a novel means of asthma stratification. Microarray analysis of sputum cells provided transcriptomics data additionally to inform on underlying mechanisms. Results: Analysis of the sputum proteome resulted in 10 clusters (ie, proteotypes) based on similarity in proteomic features, representing discrete molecular subphenotypes of asthma. Overlaying granulocyte counts onto the 10 clusters as metadata further defined 3 of these as highly eosinophilic, 3 as highly neutrophilic, and 2 as highly atopic with relatively low granulocytic inflammation. For each of these 3 phenotypes, logistic regression analysis identified candidate protein biomarkers, and matched transcriptomic data pointed to differentially activated underlying mechanisms.
Our study indicates that oxidative stress condition before allergen exposure due to an inadequate antioxidant response may prime for allergic Th2 responses.
Analysis of induced sputum supernatant is a minimally invasive approach to study the epithelial lining fluid and, thereby, provide insight into normal lung biology and the pathobiology of lung diseases. We present here a novel proteomics approach to sputum analysis developed within the U-BIOPRED (unbiased biomarkers predictive of respiratory disease outcomes) international project. We present practical and analytical techniques to optimize the detection of robust biomarkers in proteomic studies. The normal sputum proteome was derived using data-independent HDMS applied to 40 healthy nonsmoking participants, which provides an essential baseline from which to compare modulation of protein expression in respiratory diseases. The "core" sputum proteome (proteins detected in ≥40% of participants) was composed of 284 proteins, and the extended proteome (proteins detected in ≥3 participants) contained 1666 proteins. Quality control procedures were developed to optimize the accuracy and consistency of measurement of sputum proteins and analyze the distribution of sputum proteins in the healthy population. The analysis showed that quantitation of proteins by HDMS is influenced by several factors, with some proteins being measured in all participants' samples and with low measurement variance between samples from the same patient. The measurement of some proteins is highly variable between repeat analyses, susceptible to sample processing effects, or difficult to accurately quantify by mass spectrometry. Other proteins show high interindividual variance. We also highlight that the sputum proteome of healthy individuals is related to sputum neutrophil levels, but not gender or allergic sensitization. We illustrate the importance of design and interpretation of disease biomarker studies considering such protein population and technical measurement variance.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.