The complete genome of Mycoplasma gallisepticum strain R low has been sequenced. The genome is composed of 996 422 bp with an overall G+C content of 31 mol%. It contains 742 putative coding DNA sequences (CDSs), representing a 91 % coding density. Function has been assigned to 469 of the CDSs, while 150 encode conserved hypothetical proteins and 123 remain as unique hypothetical proteins. The genome contains two copies of the rRNA genes and 33 tRNA genes. The origin of replication has been localized based on sequence analysis in the region of the dnaA gene. The vlhA family (previously termed pMGA) contains 43 genes distributed among five loci containing 8, 2, 9, 12 and 12 genes. This family of genes constitutes 10?4 % (103 kb) of the total genome. Two CDSs were identified immediately downstream of gapA and crmA encoding proteins that share homology to cytadhesins GapA and CrmA. Based on motif analysis it is predicted that 80 genes encode lipoproteins and 149 proteins contain multiple transmembrane domains. The authors have identified 75 proteins putatively involved in transport of biomolecules, 12 transposases, and a number of potential virulence factors. The completion of this sequence has spawned multiple projects directed at defining the biological basis of M. gallisepticum.
INTRODUCTIONPhylogenetic analyses indicate that mycoplasmas (class Mollicutes) have undergone a degenerative evolution from related, low G+C content, Gram-positive eubacteria (Rogers et al., 1985;Woese et al., 1980). The reduction of the mycoplasma genome has resulted in the loss of the cell wall and has limited the biosynthetic capabilities of these organisms. As a consequence of this loss of biosynthetic machinery, mycoplasmas are obligate parasites and rely on the uptake of many essential molecules from their environment.Mycoplasmas have long been considered model systems for defining the minimal set of genes required for a living cell (Morowitz, 1984). For this reason, it was not surprising when Mycoplasma genitalium (580 kb) was selected as one of the first targets for complete genome sequencing (Fraser et al., 1995). Since this initial report, the genomes of four additional mycoplasmas have been sequenced, Mycoplasma pneumoniae (816 kb; Dandekar et al., 2000;Himmelreich et al., 1996), Ureaplasma urealyticum (752 kb; Glass et al., 2000), Mycoplasma pulmonis (964 kb; Chambaud et al., 2001) and Mycoplasma penetrans (1358 kb; Sasaki et al., 2000). Theoretical and experimental approaches have estimated the minimum number of essential mycoplasma genes to be between 265 and 350 (Hutchison et al., 1999;Mushegian & Koonin, 1996).Abbreviations: CDS, coding DNA sequence; COGs, conserved orthologous groups.The GenBank accession number for the sequence reported in this paper is AE015450. Mycoplasma gallisepticum is an avian pathogen involved in chronic respiratory disease in chickens resulting in considerable economic losses in poultry production. Infection with this bacterium is spread by aerosol exposure and egg transmission. Outbreaks spread...