Glycosylation is a highly complex modification influencing the functions and activities of proteins. Interpretation of intact glycopeptide spectra is crucial but challenging. In this paper, we present a mass spectrometry-based automated glycopeptide identification platform (MAGIC) to identify peptide sequences and glycan compositions directly from intact N-linked glycopeptide collision-induced-dissociation spectra. The identification of the Y1 (peptideY0 + GlcNAc) ion is critical for the correct analysis of unknown glycoproteins, especially without prior knowledge of the proteins and glycans present in the sample. To ensure accurate Y1-ion assignment, we propose a novel algorithm called Trident that detects a triplet pattern corresponding to [Y0, Y1, Y2] or [Y0-NH3, Y0, Y1] from the fragmentation of the common trimannosyl core of N-linked glycopeptides. To facilitate the subsequent peptide sequence identification by common database search engines, MAGIC generates in silico spectra by overwriting the original precursor with the naked peptide m/z and removing all of the glycan-related ions. Finally, MAGIC computes the glycan compositions and ranks them. For the model glycoprotein horseradish peroxidase (HRP) and a 5-glycoprotein mixture, a 2- to 31-fold increase in the relative intensities of the peptide fragments was achieved, which led to the identification of 7 tryptic glycopeptides from HRP and 16 glycopeptides from the mixture via Mascot. In the HeLa cell proteome data set, MAGIC processed over a thousand MS(2) spectra in 3 min on a PC and reported 36 glycopeptides from 26 glycoproteins. Finally, a remarkable false discovery rate of 0 was achieved on the N-glycosylation-free Escherichia coli data set. MAGIC is available at http://ms.iis.sinica.edu.tw/COmics/Software_MAGIC.html .
Nature determines the complexity of disease etiology and the likelihood of revealing disease genes. While culprit genes for many monogenic diseases have been successfully unraveled, efforts to map major complex disease genes have not been as productive as hoped. The conceptual framework currently adopted to deal with the heterogeneous nature of complex diseases focuses on using homogeneous internal features of the disease phenotype for mapping. However, phenotypic homogeneity does not equal genotypic homogeneity. In this report, we advocate working with well-measured phenotypes portrayed by amounts of transcripts and activities of gene products or their metabolites, which are pertinent to relatively small pathway clusters. Reliable and controlled measures for oligogenic traits resulting from proper dissection efforts may enhance statistical power. The large amounts of information obtained on gene and protein expression from technological advances can add to the power of gene finding, particularly for diseases with unclear etiology. Data-mining tools for dimension reduction can assist biologists to reveal novel molecular endophenotypes. However, there are still hurdles to overcome, including high cost, relatively poor reproducibility and comparability among platforms, the cross-sectional nature of the information, and the accessibility of human tissues. Concerted efforts are required to carry out large-scale prospective studies that are integrated at the levels of phenotype characterization, high throughput experimental techniques, data analyses, and beyond.
Several lines of evidence indicate that the adaxial leaf domain possesses a unique competence to form shoot apical meristems. Factors required for this competence are expected to cause a defect in shoot apical meristem formation when inactivated and to be expressed or active preferentially in the adaxial leaf domain. PINHEAD, a member of a family of proteins that includes the translation factor eIF2C, is required for reliable formation of primary and axillary shoot apical meristems. In addition to high-level expression in the vasculature, we find that low-level PINHEAD expression defines a novel domain of positional identity in the plant. This domain consists of adaxial leaf primordia and the meristem. These findings suggest that the PINHEAD gene product may be a component of a hypothetical meristem forming competence factor. We also describe defects in floral organ number and shape, as well as aberrant embryo and ovule development associated with pinhead mutants, thus elaborating on the role of PINHEAD in Arabidopsis development. In addition, we find that embryos doubly mutant for PINHEAD and ARGONAUTE1, a related, ubiquitously expressed family member, fail to progress to bilateral symmetry and do not accumulate the SHOOT MERISTEMLESS protein. Therefore PINHEAD and ARGONAUTE1 together act to allow wild-type growth and gene expression patterns during embryogenesis.
Motivation: Identification of disease-related genes using high-throughput microarray data is more difficult for complex diseases as compared with monogenic ones. We hypothesized that an endophenotype derived from transcriptional data is associated with a set of genes corresponding to a pathway cluster. We assumed that a complex disease is associated with multiple endophenotypes and can be induced by their up/downregulated gene expression patterns. Thus, a neural network model was adopted to simulate the gene–endophenotype–disease relationship in which endophenotypes were represented by hidden nodes.Results: We successfully constructed a three-endophenotype model for Taiwanese hypertensive males with high identification accuracy. Of the three endophenotypes, one is strongly protective, another is weakly protective and the third is highly correlated with developing young-onset male hypertension. Sixteen of the involved 101 genes were highly and consistently influential to the endophenotypes. Identification of SLC4A5, SLC5A10 and LDOC1 indicated that sodium/bicarbonate transport, sodium/glucose transport and cell-proliferation regulation may play important upstream roles and identification of BNIP1, APOBEC3F and LDOC1 suggested that apoptosis, innate immune response and cell-proliferation regulation may play important downstream roles in hypertension. The involved genes not only provide insights into the mechanism of hypertension but should also be considered in future gene mapping endeavors.Availability: Microarray data and test program are available at http://ms.iis.sinica.edu.tw/microarray/index.htmContact: pan@ibms.sinica.edu.tw or hsu@iis.sinica.edu.twSupplementary information: Supplementary data are available at Bioinformatics online.
Non-alcoholic fatty liver disease (NAFLD) as a global health problem has clinical manifestations ranging from simple non-alcoholic fatty liver (NAFL) to non-alcoholic steatohepatitis (NASH), cirrhosis, and cancer. The role of different types of fatty acids in driving the early progression of NAFL to NASH is not understood. Lipid overload causing lipotoxicity and inflammation has been considered as an essential pathogenic factor. To correlate the lipid profiles with cellular lipotoxicity, we utilized palmitic acid (C16:0)- and especially unprecedented palmitoleic acid (C16:1)-induced lipid overload HepG2 cell models coupled with lipidomic technology involving labeling with stable isotopes. C16:0 induced inflammation and cell death, whereas C16:1 induced significant lipid droplet accumulation. Moreover, inhibition of de novo sphingolipid synthesis by myriocin (Myr) aggravated C16:0 induced lipoapoptosis. Lipid profiles are different in C16:0 and C16:1-treated cells. Stable isotope-labeled lipidomics elucidates the roles of specific fatty acids that affect lipid metabolism and cause lipotoxicity or lipid droplet formation. It indicates that not only saturation or monounsaturation of fatty acids plays a role in hepatic lipotoxicity but also Myr inhibition exasperates lipoapoptosis through ceramide in-direct pathway. Using the techniques presented in this study, we can potentially investigate the mechanism of lipid metabolism and the heterogeneous development of NAFLD.
BackgroundLack of power and reproducibility are caveats of genetic association studies of common complex diseases. Indeed, the heterogeneity of disease etiology demands that causal models consider the simultaneous involvement of multiple genes. Rothman’s sufficient-cause model, which is well known in epidemiology, provides a framework for such a concept. In the present work, we developed a three-stage algorithm to construct gene clusters resembling Rothman’s causal model for a complex disease, starting from finding influential gene pairs followed by grouping homogeneous pairs.ResultsThe algorithm was trained and tested on 2,772 hypertensives and 6,515 normotensives extracted from four large Caucasian and Taiwanese databases. The constructed clusters, each featured by a major gene interacting with many other genes and identified a distinct group of patients, reproduced in both ethnic populations and across three genotyping platforms. We present the 14 largest gene clusters which were capable of identifying 19.3% of hypertensives in all the datasets and 41.8% if one dataset was excluded for lack of phenotype information. Although a few normotensives were also identified by the gene clusters, they usually carried less risky combinatory genotypes (insufficient causes) than the hypertensive counterparts. After establishing a cut-off percentage for risky combinatory genotypes in each gene cluster, the 14 gene clusters achieved a classification accuracy of 82.8% for all datasets and 98.9% if the information-short dataset was excluded. Furthermore, not only 10 of the 14 major genes but also many other contributing genes in the clusters are associated with either hypertension or hypertension-related diseases or functions.ConclusionsWe have shown with the constructed gene clusters that a multi-causal pie-multi-component approach can indeed improve the reproducibility of genetic markers for complex disease. In addition, our novel findings including a major gene in each cluster and sufficient risky genotypes in a cluster for disease onset (which coincides with Rothman’s sufficient cause theory) may not only provide a new research direction for complex diseases but also help to reveal the disease etiology.
Tumour metabolomics and transcriptomics co-expression network as related to biological folate alteration and cancer malignancy remains unexplored in human non-small cell lung cancers (NSCLC). To probe the diagnostic biomarkers, tumour and pair lung tissue samples (n = 56) from 97 NSCLC patients were profiled for ultra-performance liquid chromatography tandem mass spectrometry (UPLC/MS/MS)-analysed metabolomics, targeted transcriptionomics, and clinical folate traits. Weighted Gene Co-expression Network Analysis (WGCNA) was performed. Tumour lactate was identified as the top VIP marker to predict advance NSCLC (AUC = 0.765, Sig = 0.017, CI 0.58–0.95). Low folate (LF)-tumours vs. adjacent lungs displayed higher glycolytic index of lactate and glutamine-associated amino acids in enriched biological pathways of amino sugar and glutathione metabolism specific to advance NSCLCs. WGCNA classified the green module for hub serine-navigated glutamine metabolites inversely associated with tumour and RBC folate, which module metabolites co-expressed with a predominant up-regulation of LF-responsive metabolic genes in glucose transport (GLUT1), de no serine synthesis (PHGDH, PSPH, and PSAT1), folate cycle (SHMT1/2 and PCFR), and down-regulation in glutaminolysis (SLC1A5, SLC7A5, GLS, and GLUD1). The LF-responsive WGCNA markers predicted poor survival rates in lung cancer patients, which could aid in optimizing folate intervention for better prognosis of NSCLCs susceptible to folate malnutrition.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.