MotivationProtein contacts contain key information for the understanding of protein structure and function and thus, contact prediction from sequence is an important problem. Recently exciting progress has been made on this problem, but the predicted contacts for proteins without many sequence homologs is still of low quality and not very useful for de novo structure prediction.MethodThis paper presents a new deep learning method that predicts contacts by integrating both evolutionary coupling (EC) and sequence conservation information through an ultra-deep neural network formed by two deep residual neural networks. The first residual network conducts a series of 1-dimensional convolutional transformation of sequential features; the second residual network conducts a series of 2-dimensional convolutional transformation of pairwise information including output of the first residual network, EC information and pairwise potential. By using very deep residual networks, we can accurately model contact occurrence patterns and complex sequence-structure relationship and thus, obtain higher-quality contact prediction regardless of how many sequence homologs are available for proteins in question.ResultsOur method greatly outperforms existing methods and leads to much more accurate contact-assisted folding. Tested on 105 CASP11 targets, 76 past CAMEO hard targets, and 398 membrane proteins, the average top L long-range prediction accuracy obtained by our method, one representative EC method CCMpred and the CASP11 winner MetaPSICOV is 0.47, 0.21 and 0.30, respectively; the average top L/10 long-range accuracy of our method, CCMpred and MetaPSICOV is 0.77, 0.47 and 0.59, respectively. Ab initio folding using our predicted contacts as restraints but without any force fields can yield correct folds (i.e., TMscore>0.6) for 203 of the 579 test proteins, while that using MetaPSICOV- and CCMpred-predicted contacts can do so for only 79 and 62 of them, respectively. Our contact-assisted models also have much better quality than template-based models especially for membrane proteins. The 3D models built from our contact prediction have TMscore>0.5 for 208 of the 398 membrane proteins, while those from homology modeling have TMscore>0.5 for only 10 of them. Further, even if trained mostly by soluble proteins, our deep learning method works very well on membrane proteins. In the recent blind CAMEO benchmark, our fully-automated web server implementing this method successfully folded 6 targets with a new fold and only 0.3L-2.3L effective sequence homologs, including one β protein of 182 residues, one α+β protein of 125 residues, one α protein of 140 residues, one α protein of 217 residues, one α/β of 260 residues and one α protein of 462 residues. Our method also achieved the highest F1 score on free-modeling targets in the latest CASP (Critical Assessment of Structure Prediction), although it was not fully implemented back then.Availabilityhttp://raptorx.uchicago.edu/ContactMap/
To broaden our understanding of human neurodevelopment, we profiled transcriptomic and epigenomic landscapes across brain regions and/or cell types for the entire span of prenatal and postnatal development. Integrative analysis revealed temporal, regional, sex, and cell type–specific dynamics. We observed a global transcriptomic cup-shaped pattern, characterized by a late fetal transition associated with sharply decreased regional differences and changes in cellular composition and maturation, followed by a reversal in childhood-adolescence, and accompanied by epigenomic reorganizations. Analysis of gene coexpression modules revealed relationships with epigenomic regulation and neurodevelopmental processes. Genes with genetic associations to brain-based traits and neuropsychiatric disorders (including MEF2C, SATB2, SOX5, TCF4, and TSHZ3) converged in a small number of modules and distinct cell types, revealing insights into neurodevelopment and the genomic basis of neuropsychiatric risks.
All-perovskite–based polycrystalline thin-film tandem solar cells have the potential to deliver efficiencies of >30%. However, the performance of all-perovskite–based tandem devices has been limited by the lack of high-efficiency, low–band gap tin-lead (Sn-Pb) mixed-perovskite solar cells (PSCs). We found that the addition of guanidinium thiocyanate (GuaSCN) resulted in marked improvements in the structural and optoelectronic properties of Sn-Pb mixed, low–band gap (~1.25 electron volt) perovskite films. The films have defect densities that are lower by a factor of 10, leading to carrier lifetimes of greater than 1 microsecond and diffusion lengths of 2.5 micrometers. These improved properties enable our demonstration of >20% efficient low–band gap PSCs. When combined with wider–band gap PSCs, we achieve 25% efficient four-terminal and 23.1% efficient two-terminal all-perovskite–based polycrystalline thin-film tandem solar cells.
Organic luminogens with persistent room temperature phosphorescence (RTP) have attracted great attention for their wide applications in optoelectronic devices and bioimaging. However, these materials are still very scarce, partially due to the unclear mechanism and lack of designing guidelines. Herein we develop seven 10-phenyl-10H-phenothiazine-5,5-dioxide-based derivatives, reveal their different RTP properties and underlying mechanism, and exploit their potential imaging applications. Coupled with the preliminary theoretical calculations, it is found that strong π–π interactions in solid state can promote the persistent RTP. Particularly, CS-CF3 shows the unique photo-induced phosphorescence in response to the changes in molecular packing, further confirming the key influence of the molecular packing on the RTP property. Furthermore, CS-F with its long RTP lifetime could be utilized for real-time excitation-free phosphorescent imaging in living mice. Thus, our study paves the way for the development of persistent RTP materials, in both the practical applications and the inherent mechanism.
Modern sugarcanes are polyploid interspecific hybrids, combining high sugar content from Saccharum officinarum with hardiness, disease resistance and ratooning of Saccharum spontaneum. Sequencing of a haploid S. spontaneum, AP85-441, facilitated the assembly of 32 pseudo-chromosomes comprising 8 homologous groups of 4 members each, bearing 35,525 genes with alleles defined. The reduction of basic chromosome number from 10 to 8 in S. spontaneum was caused by fissions of 2 ancestral chromosomes followed by translocations to 4 chromosomes. Surprisingly, 80% of nucleotide binding site-encoding genes associated with disease resistance are located in 4 rearranged chromosomes and 51% of those in rearranged regions. Resequencing of 64 S. spontaneum genomes identified balancing selection in rearranged regions, maintaining their diversity. Introgressed S. spontaneum chromosomes in modern sugarcanes are randomly distributed in AP85-441 genome, indicating random recombination among homologs in different S. spontaneum accessions. The allele-defined Saccharum genome offers new knowledge and resources to accelerate sugarcane improvement.
Graphene quantum dots (GQDs) have various alluring properties and potential applications, but their large-scale applications are limited by current synthetic methods that commonly produce GQDs in small amounts. Moreover, GQDs usually exhibit polycrystalline or highly defective structures and thus poor optical properties. Here we report the gram-scale synthesis of single-crystalline GQDs by a facile molecular fusion route under mild and green hydrothermal conditions. The synthesis involves the nitration of pyrene followed by hydrothermal treatment in alkaline aqueous solutions, where alkaline species play a crucial role in tuning their size, functionalization and optical properties. The single-crystalline GQDs are bestowed with excellent optical properties such as bright excitonic fluorescence, strong excitonic absorption bands extending to the visible region, large molar extinction coefficients and long-term photostability. These high-quality GQDs can find a large array of novel applications in bioimaging, biosensing, light emitting diodes, solar cells, hydrogen production, fuel cells and supercapacitors.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.