The compilation of simple sequence repeats (SSRs) in viruses and its analysis with reference to incidence, distribution and variation would be instrumental in understanding the functional and evolutionary aspects of repeat sequences. Present study encompasses the analysis of SSRs across 30 species of alphaviruses. The full length genome sequences, assessed from NCBI were used for extraction and analysis of repeat sequences using IMEx software. The repeats of different motif sizes (mono- to penta-nucleotide) observed therein exhibited variable incidence across the species. Expectedly, mononucleotide A/T was the most prevalent followed by dinucleotide AG/GA and trinucleotide AAG/GAA in these genomes. The conversion of SSRs to imperfect microsatellite or compound microsatellite (cSSR) is low. cSSR, primarily constituted by variant motifs accounted for up to 12.5% of the SSRs. Interestingly, seven species lacked cSSR in their genomes. However, the SSR and cSSR are predominantly localized to the coding region ORFs for non structural protein and structural proteins. The relative frequencies of different classes of simple and compound microsatellites within and across genomes have been highlighted.
Background:Microsatellites have evoked the interest of researchers owing to their applications in different fields such as DNA fingerprinting, genetic mapping, population genetics, forensics, paternity studies and evolution. Objectives: The present study focused on the analysis of simple sequence repeats (SSRs) in genomes of seven species from three genera of the Filoviridae family. Materials and Methods: Genome sequences of seven species from the Filoviridae family were assessed by the National Center for Biotechnology Information (NCBI), microsatellites were extracted using the IMEx software, and statistical analysis was performed Microsoft Office Excel 2007. Results: A total of 516 microsatellites and 14 Compound Simple Sequence Repeats (cSSR) (also known as compound microsatellites) were extracted. Evidently, the conversion of SSRs to cSSR was low. Mononucleotide A/T was the most prevalent followed by dinucleotide AC/CA and trinucleotide AAC/CAA. Highest incidence of SSRs (mon-/di-nucleotide motif) was observed in RNA Dependent RNA Polymerase (RDRP) gene whereas tri-nucleotide motif was maximally localized in nucleoproteins (NP).
Conclusions:The salient features of simple and compound microsatellites in Filoviridae family have been highlighted herein. Microsatellite regions with higher mutation rates compared to the rest of the genome play a crucial role in genome evolution by acting as a source of quantitative genetic variation. The SSR mutation rate is known to be affected by motif length, motif sequence, and number of repeats and purity of repetition. The functional role of tandem repeats in viruses, remains to be fully elucidated. However, with the repetitive sequence allegedly acting as a hot spot for recombination, we postulate their involvement in genetic events such as recombination, replication, and repair mechanisms that drive sequence diversity leading to the formation of the genetic basis of adaptation.
Most of the viral diseases of plants are caused by RNA viruses which drastically reduce crop yield. In order to generate resistance against RNA viruses infecting plants, we isolated the dicer 1 protein (CaDcr1), a member of RNAse III family (enzyme that cleaves double stranded RNA) from an opportunistic fungus Candida albicans. In vitro analysis revealed that the CaDcr1 cleaved dsRNA of the coat protein gene of cucumber mosaic virus (genus Cucumovirus, family Bromoviridae). Furthermore, we developed transgenic tobacco plants (Nicotiana tabacum cv. Xanthi) over-expressing expressing CaDcr1 by Agrobacterium mediated transformation. Transgenic tobacco lines were able to suppress infection of an Indian isolate of potato virus X (genus Potexvirus, family Alphaflexiviridae). The present study demonstrates that CaDcr1 can cleave double stranded replicative intermediate and provide tolerance to plant against RNA viruses.
The present study focused on Staphylococcus phage genomes, which have been classified to 3 categories on the basis of their size. Overall, 18 classes of II Staphylococcus phage genomes with genome size around 40 kbp were investigated to elucidate the presence, distribution, and complexity of SSRs therein. The full length genome sequences from NCBI were analyzed using the IMEx software. A total of 3656 simple sequence repeats (SSRs) and 213 compound SSRs (cSSR) were present in the studied genomes. The incident frequency of SSR and cSSR per genome ranged from 183 to 308 and 8 to 19, respectively. The SSRs distribution across genomes was non-linear and so was its conversion to cSSR (the range of cSSR percentage was from 4.15 to 9.13) implicating a non-uniform incidence and clustering in genomes. The AT rich content of genomes was reflected in the prevalence of repeats with A, AT/TA, and AAG/GAA being the highest represented mono-, di-, and tri-nucleotide repeat motifs, respectively. An increase in dMAX was accompanied by greater cSSRs in the genomes yet the increase was neither uniform across genomes nor linear. The SSRs and cSSRs are predominantly localized on the coding region. The non-coding region accounts for~19% in SSR and~30% in cSSR while a hypothetical protein accounted for~30% in SSRs as well as cSSR. The relative frequencies and distribution of different classes of simple and compound microsatellites within and across genomes are suggestive of these sequences being involved in genome evolution and adaptation.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.