Despite many years of intense work investigating the function of nucleoid‐associated proteins in prokaryotes, their role in bacterial physiology remains largely unknown. The two‐dimensional protein patterns were compared and expression profiling was carried out on H‐NS‐deficient and wild‐type strains of Escherichia coli K‐12. The expression of approximately 5% of the genes and/or the accumulation of their protein was directly or indirectly altered in the hns mutant strain. About one‐fifth of these genes encode proteins that are involved in transcription or translation and one‐third are known to or were in silico predicted to encode cell envelope components or proteins that are usually involved in bacterial adaptation to changes in environmental conditions. The increased expression of several genes in the mutant resulted in a better ability of this strain to survive at low pH and high osmolarity than the wild‐type strain. In particular, the putative regulator, YhiX, plays a central role in the H‐NS control of genes required in the glutamate‐dependent acid stress response. These results suggest that there is a strong relationship between the H‐NS regulon and the maintenance of intracellular homeostasis.
The identification of molecular evolutionary mechanisms in eukaryotes is approached by a comparative genomics study of a homogeneous group of species classified as Hemiascomycetes. This group includes Saccharomyces cerevisiae, the first eukaryotic genome entirely sequenced, back in 1996. A random sequencing analysis has been performed on 13 different species sharing a small genome size and a low frequency of introns. Detailed information is provided in the 20 following papers. Additional tables available on websites describe the ca. 20 000 newly identified genes. This wealth of data, so far unique among eukaryotes, allowed us to examine the conservation of chromosome maps, to identify the`yeast-specific' genes, and to review the distribution of gene families into functional classes. This project conducted by a network of seven French laboratories has been designated`Gënolevures'. ß
Since its completion more than 4 years ago, the sequence of Saccharomyces cerevisiae has been extensively used and studied. The original sequence has received a few corrections, and the identification of genes has been completed, thanks in particular to transcriptome analyses and to specialized studies on introns, tRNA genes, transposons or multigene families. In order to undertake the extensive comparative sequence analysis of this program, we have entirely revisited the S. cerevisiae sequence using the same criteria for all 16 chromosomes and taking into account publicly available annotations for genes and elements that cannot be predicted. Comparison with the other yeast species of this program indicates the existence of 50 novel genes in segments previously considered as`intergenic' and suggests extensions for 26 of the previously annotated genes. ß
The primary analysis of the sequences for our Hemiascomycete random sequence tag (RST) project was performed using a combination of classical methods for sequence comparison and contig assembly, and of specifically written scripts and computer visualization routines. Comparisons were performed first against DNA and protein sequences from Saccharomyces cerevisiae, then against protein sequences from other completely sequenced organisms and, finally, against protein sequences from all other organisms. Blast alignments were individually inspected to help recognize genes within our random genomic sequences despite the fact that only parts of them were available. For each yeast species, validated alignments were used to infer the proper genetic code, to determine codon usage preferences and to calculate their degree of sequence divergence with S. cerevisiae. The quality of each genomic library was monitored from contig analysis of the DNA sequences. Annotated sequences were submitted to the EMBL database, and the general annotation tables produced served as a basis for our comparative description of the evolution, redundancy and function of the Hemiascomycete genomes described in other articles of this issue. ß
We report here the complete sequence of the mitochondrial (mt) genome of the pathogenic yeast Candida glabrata. This 20 kb mt genome is the smallest among sequenced hemiascomycetous yeasts. Despite its compaction, the mt genome contains the genes encoding the apocytochrome b (COB), three subunits of ATP synthetase (ATP6, 8 and 9), three subunits of cytochrome oxidase (COX1, 2 and 3), the ribosomal protein VAR1, 23 tRNAs, small and large ribosomal RNAs and the RNA subunit of RNase P. Three group I introns each with an intronic open reading frame are present in the COX1 gene. This sequence is available under accession number AJ511533.
We have analyzed the evolution of chromosome maps of Hemiascomycetes by comparing gene order and orientation of the 13 yeast species partially sequenced in this program with the genome map of Saccharomyces cerevisiae. From the analysis of nearly 8000 situations in which two distinct genes having homologs in S. cerevisiae could be identified on the sequenced inserts of another yeast species, we have quantified the loss of synteny, the frequency of single gene deletion and the occurrence of gene inversion. Traces of ancestral duplications in the genome of S. cerevisiae could be identified from the comparison with the other species that do not entirely coincide with those identified from the comparison of S. cerevisiae with itself. From such duplications and from the correlation observed between gene inversion and loss of synteny, a model is proposed for the molecular evolution of Hemiascomycetes. This model, which can possibly be extended to other eukaryotes, is based on the reiteration of events of duplication of chromosome segments, creating transient merodiploids that are subsequently resolved by single gene deletion events. ß
BackgroundMicroarray technologies produced large amount of data. In a previous study, we have shown the interest of k-Nearest Neighbour approach for restoring the missing gene expression values, and its positive impact of the gene clustering by hierarchical algorithm. Since, numerous replacement methods have been proposed to impute missing values (MVs) for microarray data. In this study, we have evaluated twelve different usable methods, and their influence on the quality of gene clustering. Interestingly we have used several datasets, both kinetic and non kinetic experiments from yeast and human.ResultsWe underline the excellent efficiency of approaches proposed and implemented by Bo and co-workers and especially one based on expected maximization (EM_array). These improvements have been observed also on the imputation of extreme values, the most difficult predictable values. We showed that the imputed MVs have still important effects on the stability of the gene clusters. The improvement on the clustering obtained by hierarchical clustering remains limited and, not sufficient to restore completely the correct gene associations. However, a common tendency can be found between the quality of the imputation method and the gene cluster stability. Even if the comparison between clustering algorithms is a complex task, we observed that k-means approach is more efficient to conserve gene associations.ConclusionsMore than 6.000.000 independent simulations have assessed the quality of 12 imputation methods on five very different biological datasets. Important improvements have so been done since our last study. The EM_array approach constitutes one efficient method for restoring the missing expression gene values, with a lower estimation error level. Nonetheless, the presence of MVs even at a low rate is a major factor of gene cluster instability. Our study highlights the need for a systematic assessment of imputation methods and so of dedicated benchmarks. A noticeable point is the specific influence of some biological dataset.
Comparisons of the 6213 predictedSaccharomyces cerevisiae open reading frame (ORF) products with sequences from organisms of other biological phyla differentiate genes commonly conserved in evolution from`maverick' genes which have no homologue in phyla other than the Ascomycetes. We show that a majority of the`maverick' genes have homologues among other yeast species and thus define a set of 1892 genes that, from sequence comparisons, appear`Ascomycetes-specific'. We estimate, retrospectively, that the S. cerevisiae genome contains 5651 actual protein-coding genes, 50 of which were identified for the first time in this work, and that the present public databases contain 612 predicted ORFs that are not real genes. Interestingly, the sequences of the`Ascomycetes-specific' genes tend to diverge more rapidly in evolution than that of other genes. Half of the`Ascomycetes-specific' genes are functionally characterized in S. cerevisiae, and a few functional categories are over-represented in them. ß
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.