The flour beetle Tribolium freemani is a sibling species of the model organism and important pest Tribolium castaneum. The two species are so closely related that they can produce hybrid progeny, but the genetic basis of their differences has not been revealed. In this work, we sequenced the T. freemani genome by applying PacBio HiFi technology. Using the well-assembled T. castaneum genome as a reference, we assembled 262 Mb of the T. freemani genomic sequence and anchored it in 10 linkage groups corresponding to nine autosomes and sex chromosome X. The assembly showed 99.8% completeness of conserved insect genes, indicating a high-quality reference genome. Comparison with the T. castaneum assembly revealed that the main differences in genomic sequence between the two sibling species come from repetitive DNA, including interspersed and tandem repeats. In this work, we also provided the complete assembled mitochondrial genome of T. freemani. Although the genome assembly needs to be ameliorated in tandemly repeated regions, the first version of the T. freemani reference genome and the complete mitogenome presented here represent useful resources for comparative evolutionary studies of related species and for further basic and applied research on different biological aspects of economically important pests.
The red flour beetle Tribolium castaneum is an important pest of stored agricultural products and the first beetle whose genome was sequenced. So far, one high-copy-number and ten moderate-copy-number satellite DNAs (satDNAs) have been described in the assembled part of its genome. In this work, we aimed to catalog the entire collection of T. castaneum satDNAs. We resequenced the genome using Illumina technology and predicted potential satDNAs via graph-based sequence clustering. In this way, we discovered 46 novel satDNAs that occupied a total of 2.1% of the genome and were, therefore, considered low-copy-number satellites. Their repeat units, preferentially 140–180 bp and 300–340 bp long, showed a high A + T composition ranging from 59.2 to 80.1%. In the current assembly, we annotated the majority of the low-copy-number satDNAs on one or a few chromosomes, discovering mainly transposable elements in their vicinity. The current assembly also revealed that many of the in silico predicted satDNAs were organized into short arrays not much longer than five consecutive repeats, and some of them also had numerous repeat units scattered throughout the genome. Although 20% of the unassembled genome sequence masked the genuine state, the predominance of scattered repeats for some low-copy satDNAs raises the question of whether these are essentially interspersed repeats that occur in tandem only sporadically, with the potential to be satDNA “seeds”.
Eukaryotic genomes are replete with satellite DNAs (satDNAs), large stretches of tandemly repeated sequences which are mostly underrepresented in genome assemblies. As a result, we know little about their spread, movement, and rearrangement across the genome or about their potential impact on the genome. Here, we have combined Oxford Nanopore long-read sequencing with a reference-guided assembly approach to generate a new, high-quality TcasONT genome assembly of the model beetle Tribolium castaneum. The new TcasONT is enriched by 50 Mb in the repetitive genome part. Therefore, we used the enhanced assembly to conduct global and in-depth analyses of ten abundant non-(peri)centromeric satDNAs, Cast1-Cast9. We found a large fraction of long arrays embedded in gene-rich regions common for almost all Cast1-Cast9 satDNAs and a negative correlation between the abundance of these satDNAs and transposable elements. Contrary to accepted opinion, our finding clearly speaks to the ability of satDNAs to expand in euchromatic regions and also demonstrate the flexibility of gene-rich regions to tolerate and accommodate satDNA sequences. In addition, we find that suppressed recombination on X chromosomes has less effect on the spread of satDNAs in euchromatin, but rather tolerates amplification of satDNAs into longer arrays. Our phylogenetic analyses show the extensive spread of Cast1-Cast9 satDNAs, characterized by distribution along the entire chromosome length and between (non)homologous chromosomes. This observed pattern points out very efficient mechanisms of propagation satDNAs on the whole genome scale including gene-rich regions. In conclusion, the presence of such a large portion of satDNAs with extensive propagation and amplification potential in gene-rich regions of the T. castaneum genome can strongly influence both gene expression and the dynamics of genome evolution.
The long-read Nanopore sequencing has been recently applied for assembly of complex genomes and analysis of linear genome organization. The most critical factor for successful long-read sequencing is extraction of high molecular weight (HMW) DNA of sufficient purity and quantity. The challenges associated with input DNA quality are further amplified when working with extremely small insects with hard exoskeletons. Here, we optimized the isolation of HMW DNA from the model beetle Tribolium and tested for use in Nanopore sequencing. We succeeded in overcoming all the difficulties in HMW handling and library preparation that were encountered when using published protocols and commercial kits. Isolation of nuclei and subsequent purification of DNA on an anion-exchange chromatography column resulted in genomic HMW DNA that was efficiently relaxed, of optimal quality and in sufficient quantity for Nanopore MinION sequencing. DNA shearing increased average N50 read values up to 26 kb and allowed us to use a single flow cell in multiple library loads for a total output of more than 13 Gb. Although our focus was on T. castaneum and closely related species, we expect that this protocol, with appropriate modifications, could be extended to other insects, particularly beetles.
Introduction The development of colorectal cancer (CRC) is a multistep process accompanied by the accumulation of mutations that start from specific precancerous lesion – colorectal adenomas (CA). CRC incidence and mortality can be reduced by the early identification of these neoplasm. Colonoscopy is the most widely used screening method for CRC identification. Nowadays, clinical research interest is shifting to the use of liquid biopsy that may help with the early diagnosis of CA and CRC. In our previous study, we identified long non-coding RNA MALAT1 gene amplification associated with the development of CA. Methods This study aimed to describe the potential of MALAT1 expression levels in the adenoma tissue of patients used in the previous study by real-time qPCR. Furthermore, we analysed the plasma samples of an independent group of patients with CA (n=97), CRC (n=101), and cancer-free individuals (CFI, n=48). Results There was no difference in the MALAT1 expression level between CA patients with or without MALAT1 amplification. However, the plasma MALAT1 expression levels were significantly upregulated in patients with CRC and CA compared to CFI (for both p<0.001). Moreover, a correlation between MALAT1 expression and histological types of adenomas was identified– high-CRC-risk adenomas also displayed the highest MALAT1 expression levels. Furthermore, in CRC patients, MALAT1 levels were associated with a response to therapy. Conclusion MALAT1 expression levels could serve as a promising circulating biomarker for early CA and CRC diagnosis, and even as a predictor of therapy response in CRC patients.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.