The incomplete identification of structural variants (SVs) from whole-genome sequencing data limits studies of human genetic diversity and disease association. Here, we apply a suite of long-read, short-read, strand-specific sequencing technologies, optical mapping, and variant discovery algorithms to comprehensively analyze three trios to define the full spectrum of human genetic variation in a haplotype-resolved manner. We identify 818,054 indel variants (<50 bp) and 27,622 SVs (≥50 bp) per genome. We also discover 156 inversions per genome and 58 of the inversions intersect with the critical regions of recurrent microdeletion and microduplication syndromes. Taken together, our SV callsets represent a three to sevenfold increase in SV detection compared to most standard high-throughput sequencing studies, including those from the 1000 Genomes Project. The methods and the dataset presented serve as a gold standard for the scientific community allowing us to make recommendations for maximizing structural variation sensitivity for future genome sequencing studies.
MicroRNAs (miRNAs) are noncoding RNAs with 18–26 nucleotides; they pair with target mRNAs to regulate gene expression and produce significant changes in various physiological and pathological processes. In recent years, the interaction between miRNAs and their target genes has become one of the mainstream directions for drug development. As a large-scale biological database that mainly provides miRNA–target interactions (MTIs) verified by biological experiments, miRTarBase has undergone five revisions and enhancements. The database has accumulated >2 200 449 verified MTIs from 13 389 manually curated articles and CLIP-seq data. An optimized scoring system is adopted to enhance this update’s critical recognition of MTI-related articles and corresponding disease information. In addition, single-nucleotide polymorphisms and disease-related variants related to the binding efficiency of miRNA and target were characterized in miRNAs and gene 3′ untranslated regions. miRNA expression profiles across extracellular vesicles, blood and different tissues, including exosomal miRNAs and tissue-specific miRNAs, were integrated to explore miRNA functions and biomarkers. For the user interface, we have classified attributes, including RNA expression, specific interaction, protein expression and biological function, for various validation experiments related to the role of miRNA. We also used seed sequence information to evaluate the binding sites of miRNA. In summary, these enhancements render miRTarBase as one of the most research-amicable MTI databases that contain comprehensive and experimentally verified annotations. The newly updated version of miRTarBase is now available at https://miRTarBase.cuhk.edu.cn/.
Many ion channel genes have been associated with human genetic pain disorders. Here we report two large Chinese families with autosomal-dominant episodic pain. We performed a genome-wide linkage scan with microsatellite markers after excluding mutations in three known genes (SCN9A, SCN10A, and TRPA1) that cause similar pain syndrome to our findings, and we mapped the genetic locus to a 7.81 Mb region on chromosome 3p22.3-p21.32. By using whole-exome sequencing followed by conventional Sanger sequencing, we identified two missense mutations in the gene encoding voltage-gated sodium channel Nav1.9 (SCN11A): c.673C>T (p.Arg225Cys) and c.2423C>G (p.Ala808Gly) (one in each family). Each mutation showed a perfect cosegregation with the pain phenotype in the corresponding family, and neither of them was detected in 1,021 normal individuals. Both missense mutations were predicted to change a highly conserved amino acid residue of the human Nav1.9 channel. We expressed the two SCN11A mutants in mouse dorsal root ganglion (DRG) neurons and showed that both mutations enhanced the channel's electrical activities and induced hyperexcitablity of DRG neurons. Taken together, our results suggest that gain-of-function mutations in SCN11A can be causative of an autosomal-dominant episodic pain disorder.
Background Bread wheat is one of the most important and broadly studied crops. However, due to the complexity of its genome and incomplete genome collection of wild populations, the bread wheat genome landscape and domestication history remain elusive. Results By investigating the whole-genome resequencing data of 93 accessions from worldwide populations of bread wheat and its diploid and tetraploid progenitors, together with 90 published exome-capture data, we find that the B subgenome has more variations than A and D subgenomes, including SNPs and deletions. Population genetics analyses support a monophyletic origin of domesticated wheat from wild emmer in northern Levant, with substantial introgressed genomic fragments from southern Levant. Southern Levant contributes more than 676 Mb in AB subgenomes and enriched in the pericentromeric regions. The AB subgenome introgression happens at the early stage of wheat speciation and partially contributes to their greater genetic diversity. Furthermore, we detect massive alien introgressions that originated from distant species through natural and artificial hybridizations, resulting in the reintroduction of ~ 709 Mb and ~ 1577 Mb sequences into bread wheat landraces and varieties, respectively. A large fraction of these intra- and inter-introgression fragments are associated with quantitative trait loci of important traits, and selection events are also identified. Conclusion We reveal the significance of multiple introgressions from distant wild populations and alien species in shaping the genetic components of bread wheat, and provide important resources and new perspectives for future wheat breeding. Electronic supplementary material The online version of this article (10.1186/s13059-019-1744-x) contains supplementary material, which is available to authorized users.
Lineage-specific epigenomic changes during human corticogenesis have remained elusive due to challenges with sample availability and tissue heterogeneity. For example, previous studies used single-cell RNA sequencing to identify at least nine major cell types and up to 26 distinct subtypes in the dorsal cortex alone 1 , 2 . Here, we characterize cell type-specific cis-regulatory chromatin interactions, open chromatin peaks, and transcriptomes for radial glia, intermediate progenitor cells, excitatory neurons, and interneurons isolated from mid-gestational human cortex samples. We show that chromatin interactions underlie multiple aspects of gene regulation, with transposable elements and disease-associated variants enriched at distal interacting regions in a cell type-specific manner. In addition, promoters with significantly increased levels of chromatin interactivity, termed super interactive promoters, are enriched for lineage-specific genes, suggesting that interactions at these loci contribute to the fine-tuning of transcription. Finally, we develop CRISPRview, a novel technique integrating immunostaining, CRISPRi, RNAscope, and image analysis for validating cell type-specific cis-regulatory elements in heterogeneous populations of primary cells. Our study presents the first cell type-specific characterization of 3D epigenomes in the developing human cortex, advancing our understanding of gene regulation and lineage specification during this critical developmental window.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.