The piggyBac DNA transposon is an active element initially isolated from the cabbage looper moth, but members of this superfamily are also present in most eukaryotic evolutionary lineages. The functionally important regions of the transposase are well described. There is an RNase H-like fold containing the DDD motif responsible for the catalytic DNA cleavage and joining reactions and a C-terminal cysteine-rich domain important for interaction with the transposon DNA. However, the protein also contains a ~100 amino acid long N-terminal disordered region (NTDR) whose function is currently unknown. Here we show that deletion of the NTDR significantly impairs piggyBac transposition, although the extent of decrease is strongly cell-type specific. Moreover, replacing the NTDR with scrambled but similarly disordered sequences did not rescue transposase activity, indicating the importance of sequence conservation. Cell-based transposon excision and integration assays reveal that the excision step is more severely affected by NTDR deletion. Finally, bioinformatic analyses indicated that the NTDR is specific for the piggyBac superfamily and is also present in domesticated, transposase-derived proteins incapable of catalyzing transposition. Our results indicate an essential role of the NTDR in the “fine-tuning” of transposition and its significance in the functions of piggyBac-originated co-opted genes.
The evolution and functional integration of new genes, especially those that become core to key functions, remains enigmatic. We consider the mammal-specific gene, piggyBac transposable element derived 1 (PGBD1), implicated in neuronal disorders. While it no longer recognises piggyBac transposon-like inverted repeats and transposase functionality having been lost, it has evolved a core role in neural homeostasis. Depletion of PGBD1 triggers accumulation of mammal-specific paraspeckles and neural differentiation. It acts by two modalities, DNA binding and protein-protein interaction. As a transcriptional repressor of (lnc)NEAT1, the backbone of paraspeckles, it inhibits paraspeckle formation in neural progenitor cells (NPCs). At the protein level it is associated with the stress response system, a function partially shared with (lnc)NEAT1. PGBD1 thus presents as an unusual exemplar of new gene creation, being a recently acquired multi-function, multi-modal gene. Mammalian specificity associated with control of a mammal-specific structure implies coevolution of new genes with new functions.
Although new genes can arrive from modes other than duplication, few examples are well characterized. Given high expression in some human brain subregions and a putative link to psychological disorders [e.g., schizophrenia (SCZ)], suggestive of brain functionality, here we characterize piggyBac transposable element-derived 1 (PGBD1). PGBD1 is nonmonotreme mammal-specific and under purifying selection, consistent with functionality. The gene body of human PGBD1 retains much of the original DNA transposon but has additionally captured SCAN and KRAB domains. Despite gene body retention, PGBD1 has lost transposition abilities, thus transposase functionality is absent. PGBD1 no longer recognizes piggyBac transposon-like inverted repeats, nonetheless PGBD1 has DNA binding activity. Genome scale analysis identifies enrichment of binding sites in and around genes involved in neuronal development, with association with both histone activating and repressing marks. We focus on one of the repressed genes, the long noncoding RNA NEAT1, also dysregulated in SCZ, the core structural RNA of paraspeckles. DNA binding assays confirm specific binding of PGBD1 both in the NEAT1 promoter and in the gene body. Depletion of PGBD1 in neuronal progenitor cells (NPCs) results in increased NEAT1/paraspeckles and differentiation. We conclude that PGBD1 has evolved core regulatory functionality for the maintenance of NPCs. As paraspeckles are a mammal-specific structure, the results presented here show a rare example of the evolution of a novel gene coupled to the evolution of a contemporaneous new structure.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.