The walking catfish Clarias magur (Hamilton, 1822) (magur) is an important catfish species inhabiting the Indian subcontinent. It is considered as a highly nutritious food fish and has the capability to walk to some distance, and survive a considerable period without water. Assembly, scaffolding and several rounds of iterations resulted in 3484 scaffolds covering ∼94% of estimated genome with 9.88 Mb largest scaffold, and N50 1.31 Mb. The genome possessed 23748 predicted protein encoding genes with annotation of 19,279 orthologous genes. A total of 166 orthologous groups represented by 222 genes were found to be unique for this species. The Computational Analysis of gene Family Evolution (CAFÉ) analysis revealed expansion of 207 gene families and 100 gene families have rapidly evolved. Genes specific to important environmental and terrestrial adaptation, viz. urea cycle, vision, locomotion, olfactory and vomeronasal receptors, immune system, anti-microbial properties, mucus, thermoregulation, osmoregulation, air-breathing, and detoxification etc. were identified and critically analyzed. The analysis clearly indicated that C. magur genome possessed several unique and duplicate genes similar to that of terrestrial or amphibians’ counterparts in comparison to other teleostean species. The genome information will be useful in conservation genetics, not only for this species but also be very helpful in such studies in other catfishes.
Indian magur (Clarias batrachus) is an important freshwater catfish, which is listed as endangered under A3cde+4acde ver. 3.1 categories by the IUCN (2015) due to decreasing population trend. Microsatellites or short sequence repeats (SSRs) tagged to genes have been utilized as gene marker. In the present study, 31,814 SSRs of C. batrachus (magur) were identified using microsatellite identification tool programme from the next generation sequencing data generated on Roche 454 and Ion Torrent platforms. A bioinformatics pipeline, with stringent criteria resulted in selection of 1672 microsatellite loci falling in the genic region. Initially, a total of 30 loci were selected for primer development; and of these 14 were successfully amplified and five were found to be polymorphic in 30 individuals of C. batrachus (magur). The observed as well as expected heterozygosity ranged from 0.038 to 0.526 and 0.434 to 0.784, respectively, and the number of observed alleles ranged from three to five. The study reported the application of next generation sequencing technologies for rapid development of microsatellite loci in Indian catfish species, C. batrachus (magur).
Whole genome sequencing (WGS) using next generation sequencing technologies paves the way to sequence the mitochondrial genomes with greater ease and lesser time. Here, we used the WGS data of Clarias batrachus, generated from Roche 454 and Ion Torrent sequencing platforms, to assemble the complete mitogenome using both de novo and reference based approaches. Both the methods yielded almost similar results and the best assembled mitogenome was of 16,510 bp size (GenBank Acc. No. KM259918). The mitogenome annotation resulted in 13 coding genes, 22 tRNA genes, 2 rRNA genes and one control region, and the gene order was found to be identical with other catfishes. Variation analyses between assembled and the reference (GenBank Acc. No. NC_023923) mitogenome revealed 51 variations. The phylogenetic analysis of coding DNA sequences and tRNA supports the monophyly of catfishes. Two SSRs were identified in C. batrachus mitogenome, out of which one was unique to this species. Based on the relative rate of gene evolution, protein coding mitochondrial genes were found to evolve at a much faster pace than the d-loop, which in turn are followed by the rRNAs; the tRNAs showed wide variability in the rate of sequence evolution, and on average evolve the slowest. Among the coding genes, ND2 evolves most rapidly. The variations present in the coding regions of the mitogenome and their comparative analyses with other catfish species may be useful in species conservation and management programs.
Mining and characterization of Simple Sequence Repeat (SSR) markers from whole genomes provide valuable information about biological significance of SSR distribution and also facilitate development of markers for genetic analysis. Whole genome sequencing (WGS)-SSR Annotation Tool (WGSSAT) is a graphical user interface pipeline developed using Java Netbeans and Perl scripts which facilitates in simplifying the process of SSR mining and characterization. WGSSAT takes input in FASTA format and automates the prediction of genes, noncoding RNA (ncRNA), core genes, repeats and SSRs from whole genomes followed by mapping of the predicted SSRs onto a genome (classified according to genes, ncRNA, repeats, exonic, intronic, and core gene region) along with primer identification and mining of cross-species markers. The program also generates a detailed statistical report along with visualization of mapped SSRs, genes, core genes, and RNAs. The features of WGSSAT were demonstrated using Takifugu rubripes data. This yielded a total of 139 057 SSR, out of which 113 703 SSR primer pairs were uniquely amplified in silico onto a T. rubripes (fugu) genome. Out of 113 703 mined SSRs, 81 463 were from coding region (including 4286 exonic and 77 177 intronic), 7 from RNA, 267 from core genes of fugu, whereas 105 641 SSR and 601 SSR primer pairs were uniquely mapped onto the medaka genome. WGSSAT is tested under Ubuntu Linux. The source code, documentation, user manual, example dataset and scripts are available online at https://sourceforge.net/projects/wgssat-nbfgr.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.