The entire DNA sequence of chromosome III of the yeast Saccharomyces cerevisiae has been determined. This is the first complete sequence analysis of an entire chromosome from any organism. The 315-kilobase sequence reveals 182 open reading frames for proteins longer than 100 amino acids, of which 37 correspond to known genes and 29 more show some similarity to sequences in databases. Of 55 new open reading frames analysed by gene disruption, three are essential genes; of 42 non-essential genes that were tested, 14 show some discernible effect on phenotype and the remaining 28 have no overt function.
With the goal of solving the whole-cell problem with Escherichia coli K-12 as a model cell, highly accurate genomes were determined for two closely related K-12 strains, MG1655 and W3110. Completion of the W3110 genome and comparison with the MG1655 genome revealed differences at 267 sites, including 251 sites with short, mostly single-nucleotide, insertions or deletions (indels) or base substitutions (totaling 358 nucleotides), in addition to 13 sites with an insertion sequence element or defective prophage in only one strain and two sites for the W3110 inversion. Direct DNA sequencing of PCR products for the 251 regions with short indel and base disparities revealed that only eight sites are true differences. The other 243 discrepancies were due to errors in the original MG1655 sequence, including 79 frameshifts, one amino-acid residue deletion, five amino-acid residue insertions, 73 missense, and 17 silent changes within coding regions. Errors in the original MG1655 sequence (o1 per 13 000 bases) were mostly within portions sequenced with out-dated technology based on radioactive chemistry.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.