SummaryThroughout evolution, primate genomes have been modified by waves of retrotransposon insertions1,2,3. For each wave, the host eventually finds a way to repress retrotransposon transcription and prevent further insertions. In mouse embryonic stem cells (mESCs), transcriptional silencing of retrotransposons requires TRIM28 (KAP1) and it’s repressive complex, which can be recruited to target sites by KRAB zinc finger proteins such as murine-specific ZFP809 which binds to integrated murine leukemia virus DNA elements and recruits KAP1 to repress them4,5. KZNF genes are one of the fastest growing gene families in primates and this expansion is hypothesized to enable primates to respond to newly emerged retrotransposons6,7. However, the identity of KZNF genes battling retrotransposons currently active in the human genome, such as SINE-VNTR-Alu (SVA)8 and Long Interspersed Nuclear Element-1 (L1)9, is unknown. We find that two primate-specific KZNF genes rapidly evolved to repress these two distinct retrotransposon families shortly after they began to spread in our ancestral genome. ZNF91 underwent a series of structural changes 8-12 MYA that enabled it to repress SVA elements. ZNF93 evolved earlier to repress the primate L1 lineage until ~12.5 MYA when the L1PA3-subfamily escaped ZNF93’s restriction through purge of the ZNF93 binding site. Our data support a model where KZNF gene expansion limits the activity of newly emerged retrotransposon classes, and this is followed by mutations in these retrotransposons to evade repression, a cycle of events that could explain the rapid expansion of lineage-specific KZNF genes.
Phytoplasmas ("CandidatusPhytoplasmaThe phylogenetic tree of mollicutes is composed of two major clades that diverged early in evolution (51). One clade contains the orders Acholeplasmatales and Anaeroplasmatales (AAA clade mollicutes), and the other clade contains the orders Mycoplasmatales and Entomoplasmatales (SEM clade mollicutes) (9). Phytoplasmas, formerly known as mycoplasma-like organisms of plants, form a monophyletic group in the order Acholeplasmatales (51) and were recently assigned to a novel genus, "Candidatus Phytoplasma" (41). Approximately 20 phytoplasma phylogenetic groups have been proposed based on 16S rRNA gene sequences, and new branches are continuously being discovered (69,85). Members of the order Acholeplasmatales are distinct from other mollicutes in several ways. For instance, whereas most mollicutes use UGA as a tryptophan codon instead of a stop codon, a feature they share with mitochondria, the acholeplasmas and phytoplasmas retained UGA as a stop codon (80).
Using high-throughput sequencing, we devised a technique to determine the insertion sites of virtually all members of the human-specific L1 retrotransposon family in any human genome. Using diagnostic nucleotides, we were able to locate the approximately 800 L1Hs copies corresponding specifically to the pre-Ta, Ta-0, and Ta-1 L1Hs subfamilies, with over 90% of sequenced reads corresponding to human-specific elements. We find that any two individual genomes differ at an average of 285 sites with respect to L1 insertion presence or absence. In total, we assayed 25 individuals, 15 of which are unrelated, at 1139 sites, including 772 shared with the reference genome and 367 nonreference L1 insertions. We show that L1Hs profiles recapitulate genetic ancestry, and determine the chromosomal distribution of these elements. Using these data, we estimate that the rate of L1 retrotransposition in humans is between 1/95 and 1/270 births, and the number of dimorphic L1 elements in the human population with gene frequencies greater than 0.05 is between 3000 and 10,000.
Genetic changes causing brain size expansion in human evolution have remained elusive. Notch signaling is essential for radial glia stem cell proliferation and is a determinant of neuronal number in the mammalian cortex. We find that three paralogs of human-specific NOTCH2NL are highly expressed in radial glia. Functional analysis reveals that different alleles of NOTCH2NL have varying potencies to enhance Notch signaling by interacting directly with NOTCH receptors. Consistent with a role in Notch signaling, NOTCH2NL ectopic expression delays differentiation of neuronal progenitors, while deletion accelerates differentiation into cortical neurons. Furthermore, NOTCH2NL genes provide the breakpoints in 1q21.1 distal deletion/duplication syndrome, where duplications are associated with macrocephaly and autism and deletions with microcephaly and schizophrenia. Thus, the emergence of human-specific NOTCH2NL genes may have contributed to the rapid evolution of the larger human neocortex, accompanied by loss of genomic stability at the 1q21.1 locus and resulting recurrent neurodevelopmental disorders.
SummarySomatic LINE-1 (L1) retrotransposition during neurogenesis is a potential source of genotypic variation among neurons. As a neurogenic niche, the hippocampus supports pronounced L1 activity. However, the basal parameters and biological impact of L1-driven mosaicism remain unclear. Here, we performed single-cell retrotransposon capture sequencing (RC-seq) on individual human hippocampal neurons and glia, as well as cortical neurons. An estimated 13.7 somatic L1 insertions occurred per hippocampal neuron and carried the sequence hallmarks of target-primed reverse transcription. Notably, hippocampal neuron L1 insertions were specifically enriched in transcribed neuronal stem cell enhancers and hippocampus genes, increasing their probability of functional relevance. In addition, bias against intronic L1 insertions sense oriented relative to their host gene was observed, perhaps indicating moderate selection against this configuration in vivo. These experiments demonstrate pervasive L1 mosaicism at genomic loci expressed in hippocampal neurons.
The detection of somatic mutations from cancer genome sequences is key to understanding the genetic basis of disease progression, patient survival and response to therapy. Benchmarking is needed for tool assessment and improvement but is complicated by a lack of gold standards, by extensive resource requirements and by difficulties in sharing personal genomic information. To resolve these issues, we launched the ICGC-TCGA DREAM Somatic Mutation Calling Challenge, a crowdsourced benchmark of somatic mutation detection algorithms. Here we report the BAMSurgeon tool for simulating cancer genomes and the results of 248 analyses of three in silico tumors created with it. Different algorithms exhibit characteristic error profiles, and, intriguingly, false positives show a trinucleotide profile very similar to one found in human tumors. Although the three simulated tumors differ in sequence contamination (deviation from normal cell sequence) and in subclonality, an ensemble of pipelines outperforms the best individual pipeline in all cases. BAMSurgeon is available at https://github.com/adamewing/bamsurgeon/.
L1 retrotransposons comprise 17% of the human genome and are its only autonomous mobile elements. Although L1-induced insertional mutagenesis causes Mendelian disease, their mutagenic load in cancer has been elusive. Using L1-targeted resequencing of 16 colorectal tumor and matched normal DNAs, we found that certain cancers were excessively mutagenized by human-specific L1s, while no verifiable insertions were present in normal tissues. We confirmed de novo L1 insertions in malignancy by both validating and sequencing 69/107 tumor-specific insertions and retrieving both 5′ and 3′ junctions for 35. In contrast to germline polymorphic L1s, all insertions were severely 5′ truncated. Validated insertion numbers varied from up to 17 in some tumors to none in three others, and correlated with the age of the patients. Numerous genes with a role in tumorigenesis were targeted, including ODZ3, ROBO2, PTPRM, PCM1, and CDH11. Thus, somatic retrotransposition may play an etiologic role in colorectal cancer.
The family Rhizobiaceae contains plant-associated bacteria with critical roles in ecology and agriculture. Within this family, many Rhizobium and Sinorhizobium strains are nitrogen-fixing plant mutualists, while many strains designated as Agrobacterium are plant pathogens. These contrasting lifestyles are primarily dependent on the transmissible plasmids each strain harbors. Members of the Rhizobiaceae also have diverse genome architectures that include single chromosomes, multiple chromosomes, and plasmids of various sizes. Agrobacterium strains have been divided into three biovars, based on physiological and biochemical properties. The genome of a biovar I strain, A. tumefaciens C58, has been previously sequenced. In this study, the genomes of the biovar II strain A. radiobacter K84, a commercially available biological control strain that inhibits certain pathogenic agrobacteria, and the biovar III strain A. vitis S4, a narrow-host-range strain that infects grapes and invokes a hypersensitive response on nonhost plants, were fully sequenced and annotated. Comparison with other sequenced members of the Alphaproteobacteria provides new data on the evolution of multipartite bacterial genomes. Primary chromosomes show extensive conservation of both gene content and order. In contrast, secondary chromosomes share smaller percentages of genes, and conserved gene order is restricted to short blocks. We propose that secondary chromosomes originated from an ancestral plasmid to which genes have been transferred from a progenitor primary chromosome. Similar patterns are observed in select Beta-and Gammaproteobacteria species. Together, these results define the evolution of chromosome architecture and gene content among the Rhizobiaceae and support a generalized mechanism for second-chromosome formation among bacteria.The family Rhizobiaceae (order Rhizobiales) of the Alphaproteobacteria includes the plant pathogens of the genus Agrobacterium and the nitrogen-fixing plant mutualists of the genera Rhizobium and Sinorhizobium. Members house single and multiple chromosome arrangements, linear replicons, and plasmids of various sizes. Genes of pathogenicity, mutualism, and other symbiotic properties are primarily encoded on large transmissible plasmids. Given the promiscuous nature of these elements, different genomic lineages within the Rhizobiaceae exhibit a variety of symbiotic phenotypes that range from pathogenesis to nitrogen-fixing mutualism.Agrobacterium taxonomy and phylogeny display a marked disparity. Empirically, organisms of the genus Agrobacterium are grouped into five species based on the disease phenotype associated with the resident disease-inducing plasmid: A. tumefaciens causes crown gall on dicotyledonous plants, including stone fruit and nut trees; A. rubi causes crown gall on raspberries; A. vitis causes gall formation that is limited to grapes; A. rhizogenes causes hairy root disease; and A. radiobacter is avirulent. An alternative classification scheme
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.