Lactobacillus pentosus is the most prevalent lactic acid bacterium in Spanish-style green olive fermentations. Here we present the draft genome sequence of L. pentosus IG1, a bacteriocin-producing strain with biotechnological and probiotic properties isolated from this food fermentations.Lactobacilli have been largely associated with food fermentations, and more recently, they have been used as probiotics, since they may promote health in humans and animals (6). Lactobacillus pentosus is the lactic acid bacterium most frequently isolated from Spanish-style green olive fermentations, and it has been successfully used as a starter culture for these fermentations (8).In addition, certain strains of L. pentosus have been shown to exert probiotic effects, improving the mucosal immunity and the resistance to bacterial infections (2, 5). The genome sequence of Lactobacillus pentosus IG1, a bile-resistant strain displaying bacteriocin activity against a wide range of spoilage and pathogen bacteria, will allow us to explore its biotechnological and probiotic properties.The whole genome of IG1 was sequenced using a paired-end strategy by pyrosequencing technology with the platform 454 Life Sciences, GS FLX Titanium system. The sequence data consisted of 8-kb paired-end libraries (481,661 reads totaling ϳ175.5 Mb, ϳ45-fold coverage of the genome). The paired-end reads were assembled into 13 scaffolds (3,982,537 nt), of which scaffold 1 (3,687,424 nt) corresponds to the entire chromosome, while scaffolds 2 (125,904 nt), 3 (77,782 nt), 4 (36,648 nt), 5 (21,059 nt), 6 (10,521 nt), 11 (2,654 nt), and 12 (2,509 nt) seem to belong to seven distinct plasmids, according to the plasmid profile obtained for L. pentosus IG1.The draft genome of L. pentosus IG1 includes 3,982,537 nucleotide bases, making it the biggest lactobacillus genome described to date. The GϩC content of the chromosome (scaffold 1) was determined to be 44.9 mol%. There is a single predicted copy of the 16S and 23S rRNAs, 3 copies of the 5S rRNA, and 44 predicted tRNAs. The genome contains 3,133 predicted coding sequences, of which 2,972 belong to scaffold 1.Comparative genomic analysis indicated that the closest genome is that of Lactobacillus plantarum (4, 9, 10). Putative functions could be assigned to 82% of the encoded proteins, and the remaining 18% could not be placed (putative uncharacterized proteins). Remarkably, this strain harbors 16 putative two-component regulatory systems, which may reflect an extensive ability to adapt to changing environmental conditions. L. pentosus IG1 contains several genes that could encode putative probiotic functions involved in adhesion (collagen-binding proteins, exopolysaccharides, lipoteichoic acids), resistance to stress (mannose phosphotransferase systems, bile salt hydrolases), and microbial competition (LuxS-mediated production of AI-2-and AI-3-like molecules and bacteriocin production). The genes responsible for the bacteriocin activity of L. pentosus IG1 are identical to those of the two-component class IIb bacteri...