We performed whole-genome sequencing (WGS) of 208 genomes from 53 families affected by simplex autism. For the majority of these families, no copy-number variant (CNV) or candidate de novo gene-disruptive single-nucleotide variant (SNV) had been detected by microarray or whole-exome sequencing (WES). We integrated multiple CNV and SNV analyses and extensive experimental validation to identify additional candidate mutations in eight families. We report that compared to control individuals, probands showed a significant (p = 0.03) enrichment of de novo and private disruptive mutations within fetal CNS DNase I hypersensitive sites (i.e., putative regulatory regions). This effect was only observed within 50 kb of genes that have been previously associated with autism risk, including genes where dosage sensitivity has already been established by recurrent disruptive de novo protein-coding mutations (ARID1B, SCN2A, NR3C2, PRKCA, and DSCAM). In addition, we provide evidence of gene-disruptive CNVs (in DISC1, WNT7A, RBFOX1, and MBD5), as well as smaller de novo CNVs and exon-specific SNVs missed by exome sequencing in neurodevelopmental genes (e.g., CANX, SAE1, and PIK3CA). Our results suggest that the detection of smaller, often multiple CNVs affecting putative regulatory elements might help explain additional risk of simplex autism.
Accurate prediction of algal biofuel yield will require empirical determination of physiological responses to the environment, particularly light and temperature. One strain of interest, Nannochloropsis salina, was subjected to ranges of light intensity (5-850 μmol m −2 s −1 ) and temperature (13-40 °C) and its exponential growth rate, total fatty acids (TFA) and fatty acid composition were measured. The maximum acclimated growth rate was 1.3 day −1 at 23 °C and 250 μmol m −2 s −1 . Fatty acids were detected by gas chromatography with flame ionization detection (GC-FID) after transesterification to corresponding fatty acid methyl esters (FAMEs). A sharp increase in TFA containing elevated palmitic acid (C16:0) and palmitoleic acid (C16:1) during exponential growth at high light was observed, indicating likely triacylglycerol accumulation due to photo-oxidative stress. Lower light resulted in increases in the relative abundance of unsaturated fatty acids; in thin cultures, increases were observed in palmitoleic and eicosapentaenoic acids (C20:5ω3). As cultures aged and the effective light intensity per cell converged to very low levels, fatty acid profiles became more similar and there was a notable increase of oleic acid (C18:1ω9). The amount of unsaturated fatty acids was inversely proportional to temperature, demonstrating physiological adaptations to increase membrane fluidity. These data will improve prediction of fatty acid characteristics and yields relevant to biofuel production. OPEN ACCESSEnergies 2012, 5 732
Genetic variation modulating risk of sporadic Parkinson disease (PD) has been primarily explored through genome-wide association studies (GWASs). However, like many other common genetic diseases, the impacted genes remain largely unknown. Here, we used single-cell RNA-seq to characterize dopaminergic (DA) neuron populations in the mouse brain at embryonic and early postnatal time points. These data facilitated unbiased identification of DA neuron subpopulations through their unique transcriptional profiles, including a postnatal neuroblast population and substantia nigra (SN) DA neurons. We use these population-specific data to develop a scoring system to prioritize candidate genes in all 49 GWAS intervals implicated in PD risk, including genes with known PD associations and many with extensive supporting literature. As proof of principle, we confirm that the nigrostriatal pathway is compromised in Cplx1-null mice. Ultimately, this systematic approach establishes biologically pertinent candidates and testable hypotheses for sporadic PD, informing a new era of PD genetic research.
The completion of a telomere-to-telomere human reference genome, T2T-CHM13, has resolved complex regions of the genome, including repetitive and homologous regions. Here, we present a high-resolution epigenetic study of previously unresolved sequences, representing entire acrocentric chromosome short arms, gene family expansions, and a diverse collection of repeat classes. This resource precisely maps CpG methylation (32.28 million CpGs), DNA accessibility, and short-read datasets (166,058 previously unresolved chromatin immunoprecipitation sequencing peaks) to provide evidence of activity across previously unidentified or corrected genes and reveals clinically relevant paralog-specific regulation. Probing CpG methylation across human centromeres from six diverse individuals generated an estimate of variability in kinetochore localization. This analysis provides a framework with which to investigate the most elusive regions of the human genome, granting insights into epigenetic regulation.
The progressive loss of midbrain (MB) dopaminergic (DA) neurons defines the motor features of Parkinson disease (PD), and modulation of risk by common variants in PD has been well established through genome-wide association studies (GWASs). We acquired open chromatin signatures of purified embryonic mouse MB DA neurons because we anticipated that a fraction of PD-associated genetic variation might mediate the variants' effects within this neuronal population. Correlation with >2,300 putative enhancers assayed in mice revealed enrichment for MB cis-regulatory elements (CREs), and these data were reinforced by transgenic analyses of six additional sequences in zebrafish and mice. One CRE, within intron 4 of the familial PD gene SNCA, directed reporter expression in catecholaminergic neurons from transgenic mice and zebrafish. Sequencing of this CRE in 986 individuals with PD and 992 controls revealed two common variants associated with elevated PD risk. To assess potential mechanisms of action, we screened >16,000 proteins for DNA binding capacity and identified a subset whose binding is impacted by these enhancer variants. Additional genotyping across the SNCA locus identified a single PD-associated haplotype, containing the minor alleles of both of the aforementioned PD-risk variants. Our work posits a model for how common variation at SNCA might modulate PD risk and highlights the value of cell-context-dependent guided searches for functional non-coding variation.
We recently demonstrated that the gene encoding the RNA binding motif protein 24 (RBM24) is expressed during mouse cardiogenesis, and determined the developmental requirement for its zebrafish homologs Rbm24a and Rbm24b during cardiac development. We demonstrate here that both Rbm24a and Rbm24b are also required for normal somite and craniofacial development. Diminution of rbm24a or rbm24b gene products by morpholino knockdown resulted in significant disruption of somite formation. Detailed in situ hybridization-based analyses of a spectrum of somitogenesis-associated transcripts revealed reduced expression of the cyclic muscle pattering genes dlc and dld encoding Notch ligands, as well as their respective target genes her7, her1. By contrast expression of the Notch receptors notch1a and notch3 appears unchanged. Some RBM-family members have been implicated in pre-mRNA processing. Analysis of affected Notch-pathway mRNAs in rbm24a and rbm24b morpholino-injected embryos revealed aberrant transcript fragments of dlc and dld, but not her1 or her7, suggesting the reduction in transcription levels of Notch pathway components may result from aberrant processing of its ligands. These data imply a previously unknown requirement for Rbm24a and Rbm24b in somite and craniofacial development. Although we anticipate the influence of disrupting RBM24 homologs likely extends beyond the Notch pathway, our results suggest their perturbation may directly, or indirectly, compromise post-transcriptional processing, exemplified by imprecise processing of dlc and dld.
Genome-wide association studies have implicated thousands of noncoding variants across common human phenotypes. However, they cannot directly inform the cellular context in which disease-associated variants act. Here, we use open chromatin profiles from discrete mouse cell populations to address this challenge. We applied stratified linkage disequilibrium score regression and evaluated heritability enrichment in 64 genome-wide association studies, emphasizing schizophrenia. We provide evidence that mouse-derived human open chromatin profiles can serve as powerful proxies for difficult to obtain human cell populations, facilitating the illumination of common disease heritability enrichment across an array of human phenotypes. We demonstrate that signatures from discrete subpopulations of cortical excitatory and inhibitory neurons are significantly enriched for schizophrenia heritability with maximal enrichment in cortical layer V excitatory neurons. We also show that differences between schizophrenia and bipolar disorder are concentrated in excitatory neurons in cortical layers II-III, IV, and V, as well as the dentate gyrus. Finally, we leverage these data to fine-map variants in 177 schizophrenia loci nominating variants in 104/177. We integrate these data with transcription factor binding site, chromatin interaction, and validated enhancer data, placing variants in the cellular context where they may modulate risk.
The human Y chromosome has been notoriously difficult to sequence and assemble because of its complex repeat structure including long palindromes, tandem repeats, and segmental duplications. As a result, more than half of the Y chromosome is missing from the GRCh38 reference sequence and it remains the last human chromosome to be finished. Here, the Telomere-to-Telomere (T2T) consortium presents the complete 62,460,029 base pair sequence of a human Y chromosome from the HG002 genome (T2T-Y) that corrects multiple errors in GRCh38-Y and adds over 30 million base pairs of sequence to the reference, revealing the complete ampliconic structures of TSPY, DAZ, and RBMY; 42 additional protein-coding genes, mostly from the TSPY gene family; and an alternating pattern of human satellite 1 and 3 blocks in the heterochromatic Yq12 region. We have combined T2T-Y with a prior assembly of the CHM13 genome and mapped available population variation, clinical variants, and functional genomics data to produce a complete and comprehensive reference sequence for all 24 human chromosomes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.