The fission yeast Schizosaccharomyces pombe is an important model organism, but its natural diversity and evolutionary history remain under-studied. In particular, the population genomics of the S. pombe mitochondrial genome (mitogenome) has not been thoroughly investigated. Here, we assembled the complete circular-mapping mitogenomes of 192 S. pombe isolates de novo, and found that these mitogenomes belong to 69 nonidentical sequence types ranging from 17,618 to 26,910 bp in length. Using the assembled mitogenomes, we identified 20 errors in the reference mitogenome and discovered two previously unknown mitochondrial introns. Analyzing sequence diversity of these 69 types of mitogenomes revealed two highly distinct clades, with only three mitogenomes exhibiting signs of inter-clade recombination. This diversity pattern suggests that currently available S. pombe isolates descend from two long-separated ancestral lineages. This conclusion is corroborated by the diversity pattern of the recombination-repressed K-region located between donor mating-type loci mat2 and mat3 in the nuclear genome. We estimated that the two ancestral S. pombe lineages diverged about 31 million generations ago. These findings shed new light on the evolution of S. pombe and the data sets generated in this study will facilitate future research on genome evolution.
Species identification is vital for protecting species diversity and selecting high-quality germplasm resources. Wild Fragaria spp. comprise rich and excellent germplasm resources; however, the variation and evolution of the whole chloroplast (cp) genomes in the genus Fragaria have been ignored. In the present study, 27 complete chloroplast genomes of 11 wild Fragaria species were sequenced using the Illumina platform. Then, the variation among complete cp genomes of Fragaria was analyzed, and phylogenetic relationships were reconstructed from those genome sequences. There was an overall high similarity of sequences, with some divergence. According to analysis with mVISTA, non-coding regions were more variable than coding regions. Inverted repeats (IRs) were observed to contract or expand to different degrees, which resulted in different sizes of cp genomes. Additionally, five variable loci, trnS-trnG, trnR-atpA, trnC-petN, rbcL-accD, and psbE-petL, were identified that could be used to develop DNA barcoding for identification of Fragaria species. Phylogenetic analyses based on the whole cp genomes supported clustering all species into two groups (A and B). Group A species were mainly distributed in western China, while group B contained several species from Europe and Americas. These results support allopolyploid origins of the octoploid species F. chiloensis and F. virginiana and the tetraploid species F. moupinensis and F. tibetica. The complete cp genomes of these Fragaria spp. provide valuable information for selecting high-quality Fragaria germplasm resources in the future.
The fission yeast Schizosaccharomyces pombe is an important model organism, but its natural diversity and evolutionary history remain under-studied. In particular, the population genomics of S. pombe mitochondrial genome (mitogenome) has not been thoroughly investigated. Here, we de novo assembled the complete circular-mapping mitogenomes of 192 S. pombe isolates, and found that these mitogenomes belong to 69 non-identical types ranging in size from 17618 bp to 26910 bp. Using the assembled mitogenomes, we identified 20 errors in the reference mitogenome and discovered two previously unknown mitochondrial introns. Analysing sequence diversity of these 69 types of mitogenomes revealed that, unexpectedly, they mainly fall into two highly distinct clades, with only three mitogenomes exhibiting signs of inter-clade recombination. This diversity pattern suggests that currently available S. pombe isolates descend from two long-separated ancestral lineages. This conclusion is corroborated by the diversity pattern of the recombinationrepressed K-region located between donor mating-type loci mat2 and mat3 in the nuclear genome. We estimated that the two ancestral S. pombe lineages diverged about 40 million generations ago. These findings shed new light on the evolution of S. pombe and the datasets generated in this study will facilitate future research on genome evolution.
Mosaic variants resulting from postzygotic mutations are prevalent in the human genome and play important roles in human diseases. However, except for cancer-related variants, there is no collection of postzygotic mosaic variants in noncancer disease-related and healthy individuals. Here, we present MosaicBase, a comprehensive database that includes 6698 mosaic variants related to 266 noncancer diseases and 27,991 mosaic variants identified in 422 healthy individuals. Genomic and phenotypic information of each variant was manually extracted and curated from 383 publications. MosaicBase supports the query of variants with Online Mendelian Inheritance in Man (OMIM) entries, genomic coordinates, gene symbols, or Entrez IDs. We also provide an integrated genome browser for users to easily access mosaic variants and their related annotations for any genomic region. By analyzing the variants collected in MosaicBase, we find that mosaic variants that directly contribute to disease phenotype show features distinct from those of variants in individuals with mild or no phenotypes, in terms of their genomic distribution, mutation signatures, and fraction of mutant cells. MosaicBase will not only assist clinicians in genetic counseling and diagnosis but also provide a useful resource to understand the genomic baseline of postzygotic mutations in the general human population. MosaicBase is publicly available at http://mosaicbase.com/ or http://49.4.21.8:8000.
Drosophila melanogaster is a well-established model organism that is widely used in genetic studies. This species enjoys the availability of a wide range of research tools, well-annotated reference databases and highly similar gene circuitry to other insects. To facilitate molecular mechanism studies in Drosophila, we present the Predicted Drosophila Interactome Resource (PDIR), a database of high-quality predicted functional gene interactions. These interactions were inferred from evidence in 10 public databases providing information for functional gene interactions from diverse perspectives. The current version of PDIR includes 102 835 putative functional associations with balanced sensitivity and specificity, which are expected to cover 22.56% of all Drosophila protein interactions. This set of functional interactions is a good reference for hypothesis formulation in molecular mechanism studies. At the same time, these interactions also serve as a high-quality reference interactome for gene set linkage analysis (GSLA), which is a web tool for the interpretation of the potential functional impacts of a set of changed genes observed in transcriptomics analyses. In a case study, we show that the PDIR/GSLA system was able to produce a more comprehensive and concise interpretation of the collective functional impact of multiple simultaneously changed genes compared with the widely used gene set annotation tools, including PANTHER and David. PDIR and its associated GSLA service can be accessed at http://drosophila.biomedtzc.cn.
To facilitate biomedical studies of disease mechanisms, a high-quality interactome that connects functionally related genes is needed to help investigators formulate pathway hypotheses and to interpret the biological logic of a phenotype at the biological process level. Interactions in the updated version of the human interactome resource (HIR V2) were inferred from 36 mathematical characterizations of six types of data that suggest functional associations between genes. This update of the HIR consists of 88 069 pairs of genes (23.2% functional interactions of HIR V2 are in common with the previous version of HIR), representing functional associations that are of strengths similar to those between well-studied protein interactions. Among these functional interactions, 57% may represent protein interactions, which are expected to cover 32% of the true human protein interactome. The gene set linkage analysis (GSLA) tool is developed based on the high-quality HIR V2 to identify the potential functional impacts of the observed transcriptomic changes, helping to elucidate their biological significance and complementing the currently widely used enrichment-based gene set interpretation tools. A case study shows that the annotations reported by the HIR V2/GSLA system are more comprehensive and concise compared to those obtained by the widely used gene set annotation tools such as PANTHER and DAVID. The HIR V2 and GSLA are available at http://human.biomedtzc.cn.
BackgroundKandelia obovata is an important mangrove species extensively distributed in Eastern Asia that is susceptible to low-temperature stress. NAC (NAM, ATAF1/2 and CUC2) domain proteins are transcription factors (TFs) that play various roles in plant growth and development and in the plant response to environmental stresses. Nevertheless, genome-wide analyses of K. obovata NAC genes (KoNACs) and their responses to chilling stress have rarely been studied.MethodsThe KoNAC gene family was identified and characterized using bioinformatic analysis, the subcellular location of some NAC proteins was confirmed using confocal microscopy analysis, and the KoNACs that responded to chilling stress were screened using RNA-seq and qRT-PCR analysis.ResultsA total of 79 KoNACs were identified, and they were unequally distributed across all 18 chromosomes of K. obovata. The KoNAC proteins could be divided into 16 subgroups according to the phylogenetic tree based on NAC family members of Arabidopsis thaliana. The KoNACs exhibited greater synteny with A. thaliana sequences than with Oryza sativa sequences, indicating that KoNACs underwent extensive evolution after the divergence of dicotyledons and monocotyledons. Segmental duplication was the main driving force of the expansions of KoNAC genes. Confocal microscopy analysis verified that the four randomly selected KoNACs localized to the nucleus, indicating the accuracy of the bioinformatic predictions. Tissue expression pattern analysis demonstrated that some KoNAC genes showed tissue-specific expression, suggesting that these KoNACs might be important for plant development and growth. Additionally, the expression levels of 19 KoNACs were significantly (15 positively and 4 negatively) induced by cold treatment, demonstrating that these KoNACs might play important roles during cold stress responses and might be candidate genes for the genetic engineering of K. obovata with enhanced chilling stress tolerance. Coexpression network analysis revealed that 381 coexpressed pairs (between 13 KoNACs and 284 other genes) were significantly correlated.ConclusionsSeventy-nine KoNACs were identified in K. obovata, nineteen of which displayed chilling-induced expression patterns. These genes may serve as candidates for functional analyses of KoNACs engaged in chilling stress. Our results lay the foundation for evolutionary analyses of KoNACs and their molecular mechanisms in response to environmental stress.
Background The nematode worm, Caenorhabditis elegans, is a saprophytic species that has been emerging as a standard model organism since the early 1960s. This species is useful in numerous fields, including developmental biology, neurobiology, and ageing. A high-quality comprehensive molecular interaction network is needed to facilitate molecular mechanism studies in C. elegans. Results We present the predicted functional interactome of Caenorhabditis elegans (FIC), which integrates functional association data from 10 public databases to infer functional gene interactions on diverse functional perspectives. In this work, FIC includes 108,550 putative functional associations with balanced sensitivity and specificity, which are expected to cover 21.42% of all C. elegans protein interactions, and 29.25% of these associations may represent protein interactions. Based on FIC, we developed a gene set linkage analysis (GSLA) web tool to interpret potential functional impacts from a set of differentially expressed genes observed in transcriptome analyses. Conclusion We present the predicted C. elegans interactome database FIC, which is a high-quality database of predicted functional interactions among genes. The functional interactions in FIC serve as a good reference interactome for GSLA to annotate differentially expressed genes for their potential functional impacts. In a case study, the FIC/GSLA system shows more comprehensive and concise annotations compared to other widely used gene set annotation tools, including PANTHER and DAVID. FIC and its associated GSLA are available at the website http://worm.biomedtzc.cn.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.