Trematomus species (suborder Notothenioidei; family Nototheniidae) are widely distributed in the southern oceans near Antarctica. There are 11 recognized species in the genus Trematomus, and notothenioids are known to have high chromosomal diversity (2n = 24–58) because of relatively recent and rapid adaptive radiation. Herein, we report the chromosomal-level genome assembly of T. loennbergii, the first characterized genome representative of the genus Trematomus. The final genome assembly of T. loennbergii was obtained using a Pacific Biosciences long-read sequencing platform and high-throughput chromosome conformation capture technology. Twenty-three chromosomal-level scaffolds were assembled to 940 Mb in total size, with a longest contig size of 48.5 Mb and contig N50 length of 24.7 Mb. The genome contained 42.03% repeat sequences, and a total of 24,525 protein-coding genes were annotated. We produced a high-quality genome assembly of T. loennbergii. Our results provide a first reference genome for the genus Trematomus and will serve as a basis for studying the molecular taxonomy and evolution of Antarctic fish.
The complete mitochondrial genome of Trematomus loennbergii was studied using NGS technology with PacBio platform. The mitochondrial genome size was 19,374bp and it had 13 protein-coding genes, 22 tRNAs and 2 rRNAs. There were 4 types of stop codons which were TAA, TAG, AGG and T(AA) but start codon type was only one (ATG). The contents of GC were 44.09% and AT contents were 55.91%. To conduct phylogenetic analysis, 12 species in 3 families were used. The result suggested that T. loennbergii was close to Pagothenia borchgrevinki in Nototheniidae. This study would provide a fundamental data for molecular evolution of T. loennbergii.
Gastrodia elata, an obligate mycoheterotrophic orchid, requires complete carbon and mineral nutrient supplementation from mycorrhizal fungi during its entire life cycle. Although full mycoheterotrophy occurs most often in the Orchidaceae family, no chromosome-level reference genome from this group has been assembled to date. Here, we report a high-quality chromosome-level genome assembly of G. elata, using Illumina and PacBio sequencing methods with Hi-C technique. The assembled genome size was found to be 1,045 Mb, with an N50 of 50.6 Mb and 488 scaffolds. A total of 935 complete (64.9%) matches to the 1,440 embryophyte Benchmarking Universal Single-Copy Orthologs were identified in this genome assembly. Hi-C scaffolding of the assembled genome resulted in 18 pseudochromosomes, 1,008 Mb in size and containing 96.5% of the scaffolds. A total of 18,844 protein-coding sequences (CDSs) were predicted in the G. elata genome, of which 15,619 CDSs (82.89%) were functionally annotated. In addition, 74.92% of the assembled genome was found to be composed of transposable elements. Phylogenetic analysis indicated a significant contraction of genes involved in various biosynthetic processes and cellular components and an expansion of genes for novel metabolic processes and mycorrhizal association. This result suggests an evolutionary adaptation of G. elata to a mycoheterotrophic lifestyle. In summary, the genomic resources generated in this work will provide a valuable reference genome for investigating the molecular mechanisms of G. elata biological functions. Further, the complete G. elata genome will greatly improve our understanding of the genetics of Orchidaceae and its mycoheterotrophic evolution.
Trematomus loennbergii Regan, 1913, is an evolutionarily important marine fish species distributed in the Antarctic Ocean. However, its genome has not been studied to date. In the present study, whole genome sequencing was performed using next-generation sequencing (NGS) technology to characterize its genome and develop genomic microsatellite markers. The 25-mer frequency distribution was estimated to be the best, and the genome size was predicted to be 815,042,992 bp. The heterozygosity, average rate of read duplication, and sequencing error rates were 0.536%, 0.724%, and 0.292%, respectively. These data were used to analyze microsatellite markers, and a total of 2,264,647 repeat motifs were identified. The most frequent repeat motif was di-nucleotide with 87.00% frequency, followed by tri-nucleotide (10.45%), tetra-nucleotide (1.94%), penta-nucleotide (0.34%), and hexa-nucleotide (0.27%). The AC repeat motif was the most abundant motif among di-nucleotides and among all repeat motifs. Among microsatellite markers, 181 markers were selected and PCR technology was used to validate several markers. A total of 15 markers produced only one band. In summary, these results provide a good basis for further studies, including evolutionary biology studies and population genetics of Antarctic fish species.
The Muraenolepididae family of fishes, known as eel cods, inhabits continental slopes and shelves in the Southern Hemisphere. This family belongs to the Gadiformes order, which constitutes one of the most important commercial fish resources worldwide, but the classification of the fish species in this order is ambiguous because it is only based on the morphological and habitat characteristics of the fishes. Here, the genome of Patagonian moray cod was sequenced using the Illumina HiSeq platform, and screened for microsatellite motifs. The genome was predicted to be 748.97 Mb, with a heterozygosity rate of 0.768%, via K-mer analysis (K = 25). The genome assembly showed that the total size of scaffolds was 711.92 Mb and the N50 scaffold length was 1522 bp. Additionally, 4,447,517 microsatellite motifs were identified from the genome survey assembly, and the most abundant motif type was found to be AC/GT. In summary, these data may facilitate the identification of molecular markers in Patagonian moray cod, which would be a good basis for further whole-genome sequencing with long read sequencing technology and chromosome conformation capture technology, as well as population genetics.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.