Streptococcus thermophilus strain ND03 is a Chinese commercial dairy starter used for the manufacture of yogurt. It was isolated from naturally fermented yak milk in Qinghai, China. We present here the complete genome sequence of ND03 and compare it to three other published genomes of Streptococcus thermophilus strains.Streptococcus thermophilus strain ND03 was isolated from naturally fermented yak milk in Qinghai, China (10). It has many excellent processing properties, such as flavor, acidity, viscosity, and water holding. This strain has been implemented in the industrial production of dairy starter cultures by Inner Mongolia Yili Industrial Group Company, Ltd., the largest dairy corporation in China.Whole-genome sequencing of S. thermophilus strain ND03 was performed with a combined strategy of 454 sequencing (9) and Solexa paired-end sequencing technology (1). Genomic libraries containing 3-kb inserts were constructed, and 124,126 paired-end reads and 28,120 singleend reads were generated using the GS FLX system, giving 20.5-fold coverage of the genome. The majority (93.5%) of reads were assembled into seven large scaffolds, including 86 nonredundant contigs, using the 454 Newbler assembler (454 Life Sciences, Branford, CT). A total of 5,647,930 reads (2.5-kb library) were generated to reach a depth of 163-fold coverage with an Illumina Solexa GA IIx (Illumina, San Diego, CA) and mapped to the scaffolds using BurrowsWheeler alignment (BWA) (7). The gaps between scaffolds were filled by sequencing PCR products using an ABI 3730 capillary sequencer. The genome analysis was performed as described previously (4, 5).The complete genome sequence of ND03 contains a circular 1,831,957-bp chromosome with a GC content of 39.1%. There are 2,038 genes in total, including 1,919 coding genes, five rRNA operons, and 56 tRNAs in the ND03 genome.Comparison of the LMG18311 (2), CNRZ1066 (2), LMD-9 (8), and ND03 genomes revealed that they were highly similar, with the exception of 73 encoding genes that are uniquely present in ND03 but not in the other three strains. Some of the unique genes formed six large insertion islands that were comprised by transposase, glutamate decarboxylase, acetyltransferase, glycosyltransferase, polysaccharide biosynthesis protein, and the exopolysaccharide (EPS) biosynthesis gene cluster.Similar to other dairy bacteria, S. thermophilus is able to synthesize EPSs that lead to an improvement in the viscosity and texture of yogurt (3). The ND03 genome carries a unique 23.4-kb EPS gene cluster (STND_1010 to STND_1035), which contains 10 EPS-related genes and six intact or truncated insertions (IS). Four of the EPS-related genes in the cluster, epsA, epsB, epsC, and epsD, were conserved between all four genomes in comparisons. These genes are involved in the regulation, polymerization, and chain length determination and export of the EPS. The remaining six genes (epsE, epsF, epsG, epsI, epsJ, and epsP) in the EPS gene cluster were uniquely present in ND03 and regarded as the key enzymes to determin...