Although many de novo genome assembly projects have recently been conducted using high-throughput sequencers, assembling highly heterozygous diploid genomes is a substantial challenge due to the increased complexity of the de Bruijn graph structure predominantly used. To address the increasing demand for sequencing of nonmodel and/or wildtype samples, in most cases inbred lines or fosmid-based hierarchical sequencing methods are used to overcome such problems. However, these methods are costly and time consuming, forfeiting the advantages of massive parallel sequencing. Here, we describe a novel de novo assembler, Platanus, that can effectively manage high-throughput data from heterozygous samples. Platanus assembles DNA fragments (reads) into contigs by constructing de Bruijn graphs with automatically optimized k-mer sizes followed by the scaffolding of contigs based on paired-end information. The complicated graph structures that result from the heterozygosity are simplified during not only the contig assembly step but also the scaffolding step. We evaluated the assembly results on eukaryotic samples with various levels of heterozygosity. Compared with other assemblers, Platanus yields assembly results that have a larger scaffold NG50 length without any accompanying loss of accuracy in both simulated and real data. In addition, Platanus recorded the largest scaffold NG50 values for two of the three low-heterozygosity species used in the de novo assembly contest, Assemblathon 2. Platanus therefore provides a novel and efficient approach for the assembly of gigabase-sized highly heterozygous genomes and is an attractive alternative to the existing assemblers designed for genomes of lower heterozygosity.
The lager beer yeast Saccharomyces pastorianus is considered an allopolyploid hybrid species between S. cerevisiae and S. eubayanus. Many S. pastorianus strains have been isolated and classified into two groups according to geographical origin, but this classification remains controversial. Hybridization analyses and partial PCR-based sequence data have indicated a separate origin of these two groups, whereas a recent intertranslocation analysis suggested a single origin. To clarify the evolutionary history of this species, we analysed 10 S. pastorianus strains and the S. eubayanus type strain as a likely parent by Illumina next-generation sequencing. In addition to assembling the genomes of five of the strains, we obtained information on interchromosomal translocation, ploidy, and single-nucleotide variants (SNVs). Collectively, these results indicated that the two groups of strains share S. cerevisiae haploid chromosomes. We therefore conclude that both groups of S. pastorianus strains share at least one interspecific hybridization event and originated from a common parental species and that differences in ploidy and SNVs between the groups can be explained by chromosomal deletion or loss of heterozygosity.
Coelacanths are known as ''living fossils,'' as they show remarkable morphological resemblance to the fossil record and belong to the most primitive lineage of living Sarcopterygii (lobe-finned fishes and tetrapods). Coelacanths may be key to elucidating the tempo and mode of evolution from fish to tetrapods. Here, we report the genome sequences of five coelacanths, including four Latimeria chalumnae individuals (three specimens from Tanzania and one from Comoros) and one L. menadoensis individual from Indonesia. These sequences cover two African breeding populations and two known extant coelacanth species. The genome is~2.74 Gbp and contains a high proportion (~60%) of repetitive elements. The genetic diversity among the individuals was extremely low, suggesting a small population size and/or a slow rate of evolution. We found a substantial number of genes that encode olfactory and pheromone receptors with features characteristic of tetrapod receptors for the detection of airborne ligands. We also found that limb enhancers of bmp7 and gli3, both of which are essential for limb formation, are conserved between coelacanth and tetrapods, but not ray-finned fishes. We expect that some tetrapod-like genes may have existed early in the evolution of primitive Sarcopterygii and were later co-opted to adapt to terrestrial environments. These coelacanth genomes will provide a cornerstone for studies to elucidate how ancestral aquatic vertebrates evolved into terrestrial animals.
How genetic information is modified to generate phenotypic variation within a species is one of the central questions in evolutionary biology. Here we focus on the striking intraspecific diversity of >200 aposematic elytral (forewing) colour patterns of the multicoloured Asian ladybird beetle, Harmonia axyridis, which is regulated by a tightly linked genetic locus h. Our loss-of-function analyses, genetic association studies, de novo genome assemblies, and gene expression data reveal that the GATA transcription factor gene pannier is the major regulatory gene located at the h locus, and suggest that repeated inversions and cis-regulatory modifications at pannier led to the expansion of colour pattern variation in H. axyridis. Moreover, we show that the colour-patterning function of pannier is conserved in the seven-spotted ladybird beetle, Coccinella septempunctata, suggesting that H. axyridis’ extraordinary intraspecific variation may have arisen from ancient modifications in conserved elytral colour-patterning mechanisms in ladybird beetles.
The ultimate goal for diploid genome determination is to completely decode homologous chromosomes independently, and several phasing programs from consensus sequences have been developed. These methods work well for lowly heterozygous genomes, but the manifold species have high heterozygosity. Additionally, there are highly divergent regions (HDRs), where the haplotype sequences differ considerably. Because HDRs are likely to direct various interesting biological phenomena, many genomic analysis targets fall within these regions. However, they cannot be accessed by existing phasing methods, and we have to adopt costly traditional methods. Here, we develop a de novo haplotype assembler, Platanus-allee ( http://platanus.bio.titech.ac.jp/platanus2 ), which initially constructs each haplotype sequence and then untangles the assembly graphs utilizing sequence links and synteny information. A comprehensive benchmark analysis reveals that Platanus-allee exhibits high recall and precision, particularly for HDRs. Using this approach, previously unknown HDRs are detected in the human genome, which may uncover novel aspects of genome variability.
Bacterial cytidine deaminase fused to the DNA binding domains of transcription activator-like effector nucleases was recently reported to transiently substitute a targeted C to a T in mitochondrial DNA of mammalian cultured cells1. We applied this system to targeted base editing in the Arabidopsis thaliana plastid genome. The targeted Cs were homoplasmically substituted to Ts in some plantlets of the T1 generation and the mutations were inherited by their offspring independently of their nuclear-introduced vectors.
We recently achieved targeted disruptions of cytoplasmic male sterility (CMS)-associated genes in the mitochondrial genomes of rice and rapeseed by using mitochondria-targeted transcription activator-like effector nucleases (mitoTALENs). It was the first report of stable and heritable targeted gene modification of plant mitochondrial genomes. Here, we attempted to use mitoTALENs to disrupt two mitochondrial genes in the model plant Arabidopsis thaliana(Arabidopsis) using three different promoters and two types of TALENs. The targets were the two isoforms of the ATP synthase subunit 6 gene, atp6-1 and atp6-2. Each of these genes was successfully deleted and the mitochondrial genomes were recovered in a homoplasmic state. The nuclear genome also has a copy of atp6-1, and we were able to confirm that it was the mitochondrial gene and not the nuclear pseudogene that was knocked out. Among the three mitoTALEN promoters tried, the RPS5A promoter was the most effective. Conventional mitoTALENs were more effective than singlemolecule mito-compactTALENs. Targeted mitochondrial gene deletion was achieved by crossing as well as by floral-dip transformation to introduce the mitoTALEN constructs into the nucleus. The gene disruptions were caused by large (kb-size) deletions. The ends of the remaining sequences were connected to distant loci, mostly by illegitimate homologous recombinations between repeats.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.