Large serine recombinases (LSRs) are DNA integrases that facilitate the site-specific integration of mobile genetic elements into bacterial genomes. Only a few LSRs, such as Bxb1 and PhiC31, have been characterized to date, with limited efficiency as tools for DNA integration in human cells. In this study, we developed a computational approach to identify thousands of LSRs and their DNA attachment sites, expanding known LSR diversity by >100-fold and enabling the prediction of their insertion site specificities. We tested their recombination activity in human cells, classifying them as landing pad, genome-targeting or multi-targeting LSRs. Overall, we achieved up to seven-fold higher recombination than Bxb1 and genome integration efficiencies of 40–75% with cargo sizes over 7 kb. We also demonstrate virus-free, direct integration of plasmid or amplicon libraries for improved functional genomics applications. This systematic discovery of recombinases directly from microbial sequencing data provides a resource of over 60 LSRs experimentally characterized in human cells for large-payload genome insertion without exposed DNA double-stranded breaks.
SummaryThousands of proteins localize to the nucleus; however, it remains unclear which contain transcriptional effectors. Here, we develop HT-recruit - a pooled assay where protein libraries are recruited to a reporter, and their transcriptional effects are measured by sequencing. Using this approach, we measure gene silencing and activation for thousands of domains. We find a relationship between repressor function and evolutionary age for the KRAB domains, discover Homeodomain repressor strength is collinear with Hox genetic organization, and identify activities for several Domains of Unknown Function. Deep mutational scanning of the CRISPRi KRAB maps the co-repressor binding surface and identifies substitutions that improve stability/silencing. By tiling 238 proteins, we find repressors as short as 10 amino acids. Finally, we report new activator domains, including a divergent KRAB. Together, these results provide a resource of 600 human proteins containing effectors and demonstrate a scalable strategy for assigning functions to protein domains.
Schistosomiasis is one of the most important and widespread neglected tropical diseases (NTD), with over 200 million people infected in more than 70 countries; the disease has nearly 800 million people at risk in endemic areas. Although mass drug administration is a cost-effective approach to reduce occurrence, extent, and severity of the disease, it does not provide protection to subsequent reinfection. Interventions that target the parasites’ intermediate snail hosts are a crucial part of the integrated strategy required to move toward disease elimination. The recent revolution in gene drive technology naturally leads to questions about whether gene drives could be used to efficiently spread schistosome resistance traits in a population of snails and whether gene drives have the potential to contribute to reduced disease transmission in the long run. Responsible implementation of gene drives will require solutions to complex challenges spanning multiple disciplines, from biology to policy. This Review Article presents collected perspectives from practitioners of global health, genome engineering, epidemiology, and snail/schistosome biology and outlines strategies for responsible gene drive technology development, impact measurements of gene drives for schistosomiasis control, and gene drive governance. Success in this arena is a function of many factors, including gene-editing specificity and efficiency, the level of resistance conferred by the gene drive, how fast gene drives may spread in a metapopulation over a complex landscape, ecological sustainability, social equity, and, ultimately, the reduction of infection prevalence in humans. With combined efforts from across the broad global health community, gene drives for schistosomiasis control could fortify our defenses against this devastating disease in the future.
The ENCODE4 Consortiums efforts to annotate non-coding, cis-regulatory elements (CREs) have advanced our understanding of gene regulatory landscapes which play a major role in health and disease. Pooled, non-coding CRISPR screens are a promising approach for systematically investigating gene regulatory mechanisms. Here, the ENCODE4 Functional Characterization Centers report 109 screens comprising 346,970 individual perturbations across 13.3Mb of the genome, using a variety of methods, readouts, and statistical analyses. Across 332 functionally confirmed CRE-gene links, we identify principles for screening endogenous, non-coding elements for causal regulatory mechanisms. Nearly all CREs show strong evidence of open chromatin, and targeting accessibility peak summits is a critical component of our proposed sgRNA design rules. We provide experimental guidelines to accurately detect CREs with variable, often low, transcriptional effects. We discover a previously undescribed DNA strand-bias for CRISPRi in transcribed regions with implications for screen design and analysis. Benchmarking five screen analysis tools, we find CASA produces the most conservative CRE calls and is robust to artifacts of low-specificity sgRNAs. Together, we provide an accessible data resource, predesigned sgRNAs targeting 3,275,697 ENCODE SCREEN cCREs, and screening guidelines to accelerate functional characterization of the non-coding genome.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.