This work describes the setup of a shared platform among the laboratories of the Alleanza Contro il Cancro (ACC) Italian Research Network for the identification of fusion transcripts in sarcomas by using Next Generation Sequencing (NGS). Different NGS approaches, including anchored multiplex PCR and hybrid capture-based panels, were employed to profile a large set of sarcomas of different histotypes. The analysis confirmed the reliability of NGS RNA-based approaches in detecting sarcoma-specific rearrangements. Overall, the anchored multiplex PCR assay proved to be a fast and easy-to-analyze approach for routine diagnostics laboratories.
The Bologna ENZyme Web Server (BENZ WS) annotates four-level Enzyme Commission numbers (EC numbers) as defined by the International Union of Biochemistry and Molecular Biology (IUBMB). BENZ WS filters a target sequence with a combined system of Hidden Markov Models, modelling protein sequences annotated with the same molecular function, and Pfams, carrying along conserved protein domains. BENZ returns, when successful, for any enzyme target sequence an associated four-level EC number. Our system can annotate both monofunctional and polyfunctional enzymes, and it can be a valuable resource for sequence functional annotation.
Enzymes are key proteins performing the basic functional activities in cells. In humans, enzymes can be also responsible for diseases, and the molecular mechanisms underlying the genotype to phenotype relationship are under investigation for diagnosis and medical care. Here, we focus on highlighting enzymes that are active in different metabolic pathways and become relevant hubs in protein interaction networks. We perform a statistics to derive our present knowledge on human metabolic pathways (the Kyoto Encyclopaedia of Genes and Genomes (KEGG)), and we found that activity aldehyde dehydrogenase (NAD(+)), described by Enzyme Commission number EC 1.2.1.3, and activity acetyl-CoA C-acetyltransferase (EC 2.3.1.9) are the ones most frequently involved. By associating functional activities (EC numbers) to enzyme proteins, we found the proteins most frequently involved in metabolic pathways. With our analysis, we found that these proteins are endowed with the highest numbers of interaction partners when compared to all the enzymes in the pathways and with the highest numbers of predicted interaction sites. As specific enzyme protein test cases, we focus on Alpha-Aminoadipic Semialdehyde Dehydrogenase (ALDH7A1, EC 2.3.1.9) and Acetyl-CoA acetyltransferase, cytosolic and mitochondrial (gene products of ACAT2 and ACAT1, respectively; EC 2.3.1.9). With computational approaches we show that it is possible, by starting from the enzyme structure, to highlight clues of their multiple roles in different pathways and of putative mechanisms promoting the association of genes to disease.
According to databases such as OMIM, Humsavar, Clinvar and Monarch, 1494 human enzymes are presently associated to 2539 genetic diseases, 75% of which are rare (with an Orphanet code). The Mondo ontology initiative allows a standardization of the disease name into specific codes, making it possible a computational association between genes, variants, diseases, and their effects on biological processes. Here, we tackle the problem of which biological processes enzymes can affect when the protein variant is disease-associated. We adopt Reactome to describe human biological processes, and by mapping disease-associated enzymes in the Reactome pathways, we establish a Reactome-disease association. This allows a novel categorization of human monogenic and polygenic diseases based on Reactome pathways and reactions. Our analysis aims at dissecting the complexity of the human genetic disease universe, highlighting all the possible links within diseases and Reactome pathways. The novel mapping helps understanding the biochemical/molecular biology of the disease and allows a direct glimpse on the present knowledge of other molecules involved. This is useful for a complete overview of the disease molecular mechanism/s and for planning future investigations. Data are collected in DAR, a database that is free for search and available at https://dar.biocomp.unibo.it.
e23552 Background: Epithelioid sarcoma (ES) is an ultra-rare sarcoma with distinctive pathologic and clinical features, marked by the loss of the expression of the SWI/SNF chromatin remodeling complex subunit SMARCB1. The current WHO classification recognizes two ES subtypes, different by morphology, clinical behavior and outcome: the “classic-type” and “proximal-type” ES. This study is aimed to better understand the molecular grounds sustaining this difference, and to identify new potential treatment targets to personalize therapy in this rare disease. Methods: RNA sequencing profiling was conducted on FFPE samples. Functional annotation enrichment was evaluated through over representation and gene set enrichment analyses (GSEA). Inference of immune contexture was obtained through deconvolution and single sample GSEA (ssGSEA) approaches. Results: Twelve samples from 5 proximal-type ES and 7 classic-type ES were profiled, 10 naïve and 2 pre-treated with chemotherapy. Proximal variant samples featured an overepresentation of MYC activity signatures and of other signatures impacting on cell cycle, protein synthesis and chromatin metabolism. Pathways enriched in the classic variant included NOTCH/HEDGEHOG and immune system regulation (e.g. inflammatory, interferon alpha and interferon gamma). Accordingly, deconvolution and ssGSEA approaches predicted an increased immune infiltration in classic subtype samples, essentially involving T cells, as well as increased expression of HLA class I molecules. Conclusions: Different regulatory networks seem to contribute to the different biologic and clinical behavior of proximal and classic ES. These preliminary data suggest a potential greater sensitivity of proximal type ES to drugs targeting the cell cycle, whilst immune checkpoint inhibitors might have some activity in classic variant ES. Further studies are ongoing to validate these preliminary observations.
MTHFR deficiency still deserves an investigation to associate the phenotype to protein structure variations. To this aim, considering the MTHFR wild type protein structure, with a catalytic and a regulatory domain and taking advantage of state-of-the-art computational tools, we explore the properties of 72 missense variations known to be disease associated. By computing the thermodynamic ΔΔG change according to a consensus method that we recently introduced, we find that 61% of the disease-related variations destabilize the protein, are present both in the catalytic and regulatory domain and correspond to known biochemical deficiencies. The propensity of solvent accessible residues to be involved in protein-protein interaction sites indicates that most of the interacting residues are located in the regulatory domain, and that only three of them, located at the interface of the functional protein homodimer, are both disease-related and destabilizing. Finally, we compute the protein architecture with Hidden Markov Models, one from Pfam for the catalytic domain and the second computed in house for the regulatory domain. We show that patterns of disease-associated, physicochemical variation types, both in the catalytic and regulatory domains, are unique for the MTHFR deficiency when mapped into the protein architecture.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.