Common SNPs in the chromosome 17q12-q21 region alter the risk for asthma, type 1 diabetes, primary biliary cirrhosis, and Crohn disease. Previous reports by us and others have linked the disease-associated genetic variants with changes in expression of GSDMB and ORMDL3 transcripts in human lymphoblastoid cell lines (LCLs). The variants also alter regulation of other transcripts, and this domain-wide cis-regulatory effect suggests a mechanism involving long-range chromatin interactions. Here, we further dissect the disease-linked haplotype and identify putative causal DNA variants via a combination of genetic and functional analyses. First, high-throughput resequencing of the region and genotyping of potential candidate variants were performed. Next, additional mapping of allelic expression differences in Yoruba HapMap LCLs allowed us to fine-map the basis of the cis-regulatory differences to a handful of candidate functional variants. Functional assays identified allele-specific differences in nucleosome distribution, an allele-specific association with the insulator protein CTCF, as well as a weak promoter activity for rs12936231. Overall, this study shows a common disease allele linked to changes in CTCF binding and nucleosome occupancy leading to altered domain-wide cis-regulation. Finally, a strong association between asthma and cis-regulatory haplotypes was observed in three independent family-based cohorts (p = 1.78 x 10(-8)). This study demonstrates the requirement of multiple parallel allele-specific tools for the investigation of noncoding disease variants and functional fine-mapping of human disease-associated haplotypes.
Cis-acting variants altering gene expression are a source of phenotypic differences. The cis-acting components of expression variation can be identified through the mapping of differences in allelic expression (AE), which is the measure of relative expression between two allelic transcripts. We generated a map of AE associated SNPs using quantitative measurements of AE on Illumina Human1M BeadChips. In 53 lymphoblastoid cell lines derived from donors of European descent, we identified common cis variants affecting 30% (2935/9751) of the measured RefSeq transcripts at 0.001 permutation significance. The pervasive influence of cis-regulatory variants, which explain 50% of population variation in AE, extend to full-length transcripts and their isoforms as well as to unannotated transcripts. These strong effects facilitate fine mapping of cis-regulatory SNPs, as demonstrated by dissection of heritable control of transcripts in the systemic lupus erythematosus-associated C8orf13-BLK region in chromosome 8. The dense collection of associations will facilitate large-scale isolation of cis-regulatory SNPs.
Plant O-methyltransferases (OMTs) constitute a large family of enzymes that methylate the oxygen atom of a variety of secondary metabolites including phenylpropanoids, flavonoids, and alkaloids. O-Methylation plays a key role in lignin biosynthesis, stress tolerance, and disease resistance in plants. To gain insights into the evolution of the extraordinary diversity of plant O-methyltransferases, and to develop a framework phylogenetic tree for improved prediction of the putative function of newly identified OMT-like gene sequences, we performed a comparative and phylogenetic analysis of 61 biochemically characterized plant OMT protein sequences. The resulting phylogenetic tree revealed two major groups. One of the groups included two sister clades, one comprising the caffeoyl CoA OMTs (CCoA OMTs) that methylate phenolic hydroxyl groups of hydroxycinnamoyl CoA esters, and the other containing the carboxylic acid OMTs that methylate aliphatic carboxyl groups. The other group comprised the remaining OMTs, which act on a diverse group of metabolites including hydroxycinnamic acids, flavonoids, and alkaloids. The results suggest that some OMTs may have undergone convergent evolution, while others show divergent evolution. The high number of unique conserved regions within the CCoA OMTs and carboxylic acid OMTs provide an opportunity to design oligonucleotide primers to selectively amplify and characterize similar OMT genes from many plant species.
With our increasing ability for generating whole-genome sequences, comparative analysis of whole genomes has become a powerful tool for understanding the structure, function, and evolutionary history of human and other vertebrate genomes. By virtue of their position basal to bony vertebrates, cartilaginous fishes (class Chondrichthyes) are a valuable outgroup in comparative studies of vertebrates. Recently, a holocephalan cartilaginous fish, the elephant shark, Callorhinchus milii (Subclass Holocephali: Order Chimaeriformes), has been proposed as a model genome, and low-coverage sequence of its genome has been generated. Despite such an increasing interest, the evolutionary history of the modern holocephalans-a previously successful and diverse group but represented by only 39 extant species-and their relationship with elasmobranchs and other jawed vertebrates has been poorly documented largely owing to a lack of well-preserved fossil materials after the end-Permian about 250 Ma. In this study, we assembled the whole mitogenome sequences for eight representatives from all the three families of the modern holocephalans and investigated their phylogenetic relationships and evolutionary history. Unambiguously aligned sequences from these holocephalans together with 17 other vertebrates (9,409 nt positions excluding entire third codon positions) were subjected to partitioned maximum likelihood analysis. The resulting tree strongly supported a single origin of the modern holocephalans and their sister-group relationship with elasmobranchs. The mitogenomic tree recovered the most basal callorhinchids within the chimaeriforms, which is sister to a clade comprising the remaining two families (rhinochimaerids and chimaerids). The timetree derived from a relaxed molecular clock Bayesian method suggests that the holocephalans originated in the Silurian about 420 Ma, having survived from the end-Permian (250 Ma) mass extinction and undergoing familial diversifications during the late Jurassic to early Cretaceous (170-120 Ma). This postulated evolutionary scenario agrees well with that based on the paleontological observations.
Regulatory cis-acting variants account for a large proportion of gene expression variability in populations. Cis-acting differences can be specifically measured by comparing relative levels of allelic transcripts within a sample. Allelic expression (AE) mapping for cis-regulatory variant discovery has been hindered by the requirements of having informative or heterozygous single nucleotide polymorphisms (SNPs) within genes in order to assign the allelic origin of each transcript. In this study we have developed an approach to systematically screen for heritable cis-variants in common human haplotypes across >1000 genes. In order to achieve the highest level of information per haplotype studied, we carried out allelic expression measurements by using both intronic and exonic SNPs in primary transcripts. We used a novel RNA pooling strategy in immortalized lymphoblastoid cell lines (LCLs) and primary human osteoblast cell lines (HObs) to allow for high-throughput AE. Screening hits from RNA pools were further validated by performing allelic expression mapping in individual samples. Our results indicate that >10% of expressed genes in human LCLs show genotype-linked AE. In addition, we have validated cis-acting variants in over 20 genes linked with common disease susceptibility in recent genome-wide studies. More generally, our results indicate that RNA pooling coupled with AE read-out by second generation sequencing or by other methods provides a high-throughput tool for cataloging the impact of common noncoding variants in the human genome.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.