Here we report the draft annotated genome sequence of Escherichia coli XH140A, which is used to produce L-threonine in industry. The genome sequence will allow the characterization of the molecular mechanisms underlying its beneficial properties.L-Threonine has been widely used as a supplement in the food, pharmaceutical, and cosmetics industries. Currently L-threonine is produced mainly by microbial fermentation. To date, E. coli mutant strains remain the dominant industrial producers of L-threonine (6). Although there are many papers (8,10,14) and patents (4, 12) reporting L-threonine-producing strains of E. coli, not much is known about the genome and plasmid sequences of these producer strains. Here, we sequenced the genome of Escherichia coli XH140A containing a plasmid, a high-yield L-threonine producer in industry.The genome was sequenced using the Illumina Solexa GA IIx instrument at Beijing Genomics Institute (BGI; Shenzhen, China). A library containing 500-bp inserts was constructed. Sequencing was performed with the paired-end strategy of 75-bp reads to produce 634 Mb of filtered sequences, representing a 136.6-fold coverage of the genome. The sequences were assembled into 184 contigs and 95 scaffolds using the SOAPdenovo package (11). The remaining intrascaffold gaps were closed by Sanger sequencing of PCR products. The complete nucleotide sequences of the plasmid pTHR were determined by Sanger sequencing using the primer walking strategy.Genome annotation was performed at the NCBI Prokaryotic Genomes Automatic Annotation Pipeline. Open reading frames were identified by Glimmer 3.02 (5) and Genemark (2). The resulting translations were used for a BLASTP (1) search against the GenBank NR database as well as the KEGG (7) and COG (15) databases. tRNA and rRNA genes were identified by tRNAscan-SE (13) and RNAmmer (9), respectively. The XH140A chromosome is 4,395,442 bp in length, with an average GϩC content of 50.80%. A total of 4,159 proteincoding genes, 72 tRNA genes, and 3 rRNA fragments were identified. Eighteen pseudogene were also identified, which suggested that Escherichia coli XH140A was selected by genetic engineering treatment. The plasmid pTHR of XH140A is 15,774 bp in length with an average GϩC content of 49.07%, which contains 13 protein-coding genes.In a comparative analysis of Escherichia coli XH140A with the reference genome Escherichia coli K-12 (3), we found an important mutation that may explain the molecular mechanisms for the production of L-threonine. A nonsense mutation in the acetolactate synthase 2 catalytic subunit gene ilvG blocked the pathway of L-threonine to L-isoleucine, which could make XH140A accumulate L-threonine. Interestingly, the plasmid pTHR did not have the threonine operon thrABC, which was different from what has been reported in several public papers and patents about L-threonine-producing strains (4, 6, 12). It is worth noting that the plasmid pTHR contained NAD/NADP transhydrogenase alpha-subunit and beta-subunit genes, which suggested that the balance of NAD...