The functions of most long non-coding RNAs (lncRNAs) are unknown. In contrast to proteins, lncRNAs with similar functions often lack linear sequence homology; thus, the identification of function in one lncRNA rarely informs the identification of function in others. We developed a sequence comparison method to deconstruct linear sequence relationships in lncRNAs and evaluate similarity based on the abundance of short motifs called k-mers. We found that lncRNAs of related function often had similar k-mer profiles despite lacking linear homology, and that k-mer profiles correlated with protein binding to lncRNAs and with their subcellular localization. Using a novel assay to quantify Xist-like regulatory potential, we directly demonstrated that evolutionarily unrelated lncRNAs can encode similar function through different spatial arrangements of related sequence motifs. K-mer-based classification is a powerful approach to detect recurrent relationships between sequence and function in lncRNAs.
Patients with late-onset Alzheimer's disease (LOAD) frequently manifest comorbid neuropsychiatric symptoms with depression and anxiety being most frequent, and individuals with major depressive disorder (MDD) have an increased prevalence of LOAD. This suggests shared etiologies and intersecting pathways between LOAD and MDD. We performed pleiotropy analyses using LOAD and MDD GWAS data sets from the International Genomics of Alzheimer's Project (IGAP) and the Psychiatric Genomics Consortium (PGC), respectively. We found a moderate enrichment for SNPs associated with LOAD across increasingly stringent levels of significance with the MDD GWAS association (LOAD| MDD), of maximum four and eightfolds, including and excluding the APOE-region, respectively. Association analysis excluding the APOE-region identified numerous SNPs corresponding to 40 genes, 9 of which are known LOAD-risk loci primarily in chromosome 11 regions that contain the SPI1 gene and MS4A genes cluster, and others were novel pleiotropic risk-loci for LOAD conditional with MDD. The most significant associated SNPs on chromosome 11 overlapped with eQTLs found in whole-blood and monocytes, suggesting functional roles in gene regulation. The reverse conditional association analysis (MDD|LOAD) showed a moderate level,~sevenfold, of polygenic overlap, however, no SNP showed significant association. Pathway analyses replicated previously reported LOAD biological pathways related to immune response and regulation of endocytosis. In conclusion, we provide insights into the overlapping genetic signatures underpinning the common phenotypic manifestations and inter-relationship between LOAD and MDD. This knowledge is crucial to the development of actionable targets for novel therapies to treat depression preceding dementia, in an effort to delay or ultimately prevent the onset of LOAD.
The marsupial inactive X chromosome expresses a long noncoding RNA (lncRNA) called Rsx that has been proposed to be the functional analog of eutherian Xist. Despite the possibility that Xist and Rsx encode related functions, the two lncRNAs harbor no linear sequence similarity. However, both lncRNAs harbor domains of tandemly repeated sequence. In Xist, these repeat domains are known to be critical for function. Using k-mer based comparison, we show that the repeat domains of Xist and Rsx unexpectedly partition into two major clusters that each harbor substantial levels of nonlinear sequence similarity. Xist Repeats B, C, and D were most similar to each other and to Rsx Repeat 1, whereas Xist Repeats A and E were most similar to each other and to Rsx Repeats 2, 3, and 4. Similarities at the level of k-mers corresponded to domain-specific enrichment of protein-binding motifs. Within individual domains, protein-binding motifs were often enriched to extreme levels. Our data support the hypothesis that Xist and Rsx encode similar functions through different spatial arrangements of functionally analogous protein-binding domains. We propose that the two clusters of repeat domains in Xist and Rsx function in part to cooperatively recruit PRC1 and PRC2 to chromatin. The physical manner in which these domains engage with protein cofactors may be just as critical to the function of the domains as the protein cofactors themselves. The general approaches we outline in this report should prove useful in the study of any set of RNAs.
BRD4, a member of the bromodomain and extraterminal domain (BET) family, has emerged as a promising epigenetic target in cancer and inflammatory disorders. All reported BET family ligands bind within the bromodomain acetyl-lysine binding sites and competitively inhibit BET protein interaction with acetylated chromatin. Alternative chemical probes that act orthogonally to the highly conserved acetyl-lysine binding sites may exhibit selectivity within the BET family and avoid recently reported toxicity in clinical trials of BET bromodomain inhibitors. Here, we report the first identification of a ligandable site on a bromodomain outside the acetyl-lysine binding site. Inspired by our computational prediction of hotspots adjacent to nonhomologous cysteine residues within the C-terminal BRD4 bromodomain (BRD4-BD2), we performed a midthroughput mass spectrometry screen to identify cysteine-reactive fragments that covalently and selectively modify BRD4. Subsequent mass spectrometry, NMR, and computational docking analyses of electrophilic fragment hits revealed a novel ligandable site near Cys356 that is unique to BRD4 among human bromodomains. This site is orthogonal to the BRD4-BD2 acetyl-lysine binding site as Cys356 modification did not impact binding of the pan-BET bromodomain inhibitor JQ1 in fluorescence polarization assays nor an acetylated histone peptide in AlphaScreen assays. Finally, we tethered our top-performing covalent fragment to JQ1 and performed NanoBRET assays to provide proof of principle that this orthogonal site can be covalently targeted in intact human cells. Overall, we demonstrate the potential of targeting sites orthogonal to bromodomain acetyl-lysine binding sites to develop bivalent and covalent inhibitors that displace BRD4 from chromatin.
The drug praziquantel (PZQ) is the key clinical therapy for treating schistosomiasis and other infections caused by parasitic flatworms. A schistosome target for PZQ was recently identified— a transient receptor potential ion channel in the melastatin subfamily (TRPM PZQ )—however, little is known about the properties of TRPM PZQ in other parasitic flatworms. Here, TRPM PZQ orthologs were scrutinized from all currently available parasitic flatworm genomes. TRPM PZQ is present in all parasitic flatworms, and the consensus PZQ binding site was well conserved. Functional profiling of trematode, cestode, and a free-living flatworm TRPM PZQ ortholog revealed differing sensitives (~300-fold) of these TRPM PZQ channels toward PZQ, which matched the varied sensitivities of these different flatworms to PZQ. Three loci of variation were defined across the parasitic flatworm TRPM PZQ pocketome with the identity of an acidic residue in the TRP domain acting as a gatekeeper residue impacting PZQ residency within the TRPM PZQ ligand binding pocket. In trematodes and cyclophyllidean cestodes, which display high sensitivity to PZQ, this TRP domain residue is an aspartic acid which is permissive for potent activation by PZQ. However, the presence of a glutamic acid residue found in other parasitic and free-living flatworm TRPM PZQ was associated with lower sensitivity to PZQ. The definition of these different binding pocket architectures explains why PZQ shows high therapeutic effectiveness against specific fluke and tapeworm infections and will help the development of better tailored therapies toward other parasitic infections of humans, livestock, and fish.
Introduction: Genome-wide association studies (GWAS) discovered multiple late-onset Alzheimer's disease (LOAD)-associated SNPs and inferred the genes based on proximity; however, the actual causal genes are yet to be identified. Methods: We defined LOAD-GWAS regions by the most significantly associated SNP 60.5 Mb and developed a bioinformatics pipeline that uses and integrates chromatin state segmentation track to map active enhancers and virtual 4C software to visualize interactions between active enhancers and gene promoters. We augmented our pipeline with biomedical and functional information. Results: We applied the bioinformatics pipeline using three w1 Mb LOAD-GWAS loci: BIN1, PIC-ALM, CELF1. These loci contain 10-24 genes, an average of 106 active enhancers and 80 CTCF sites. Our strategy identified all genes corresponding to the promoters that interact with the active enhancer that is closest to the LOAD-GWAS-SNP and generated a shorter list of prioritized candidate LOAD genes (5-14/loci), feasible for post-GWAS investigations of causality. Discussion: Interpretation of LOAD-GWAS discoveries requires the integration of brain-specific functional genomic data sets and information related to regulatory activity.
The periselective Diels-Alder reaction of 5-substituted pentamethylcyclopentadienes and nitroethylene has been realized by helical-chiral hydrogen bond donor catalysts. To our knowledge, this represents the first asymmetric catalytic nitroalkene Diels-Alder reaction via activation of nitroalkene, and thus establishes its proof-of-principle.
Chiral diamine-derived hydrogen-bond donors were evaluated for their ability to effect stereocontrol in an intramolecular hetero-Diels–Alder (HDA) reaction hypothesized in the biosynthesis of brevianamides A and B. Collectively, these results provide proof of principle that small-molecule hydrogen-bond catalysis, if even based on a hypothetical biosynthesis construct, holds significant potential within enantioselective natural product synthesis.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.