PRD1 is a broad-host-range virus that infects Escherichia coli cells. It has a linear double-stranded DNA genome that replicates by a protein-primed mechanism. The virus particle is composed of a protein coat enclosing a lipid membrane. On the basis of this structure, PRD1 is being used as a membrane biosynthesis and structure model. In this investigation, we constructed the transcription map of the 15-kb-long phage genome. This was achieved by a computer search of putative promoters, which were then tested for activity by primer extension and for the capability to promote the synthesis of chloramphenicol acetyltransferase.Bacteriophage PRD1 belongs to a group of closely related viruses infecting various gram-negative bacteria harboring plasmids of the P, N, or W incompatibility group (4,7,23 (3,9,33).From genetic and biochemical information, it was evident that at least 25 gene products are encoded from the genome (19,20). Most of them are small membrane-associated proteins. Sequencing of the 14,925-bp-long genome led to the firm assignment of 19 genes and localization of 14 additional open reading frames (6,28,29,32). The early proteins needed for DNA replication and regulation are located at genome ends, whereas the genes coding for structural proteins and assembly factors are located in the central part of the genome. The genes with identified functions are designated by roman numerals, and the corresponding gene products (proteins) are designated by P and the corresponding arabic numeral.On the basis of their temporal expression, the PRD1 proteins were divided into early, middle early, and late classes (20). The early proteins were the genome terminal protein (P8) and the DNA polymerase (P1) from the left end of the genome (33, 34) and a single-stranded-DNA-binding protein, P12, from the right end of the genome (27 (13,14).From the PRD1 genomic sequence data, it was obvious that transcriptional regulation at both the promoter and terminator levels was complex, and only two of the genes are transcribed from right to left. We decided to begin the promoter search with a computer screen. Statistical and neural network programs, trained on a data base of E. coli promoters, were used to search the PRD1 sequence for promoter sites (24, 25). On E. coli targets, these programs would be expected to capture >75% of true promoters while generating one or two false positives per 1,000 bases. The activities of the promoters, among the promoter-like sequences obtained by computer search, were determined by two experimental methods: (i) screening the ability of cloned PRD1 genomic fragments to promote chloramphenicol acetyltransferase (CAT) expression and (ii) mapping the transcriptional start sites by primer extension. Possible terminators were screened by searching for local highly stable hairpin loops from the genomic sequence. Here we present the transcription map of the PRD1 genome, including the previously determined two early promoters.
MATERUILS AND METHODSBacterial strains, plasmids, and phage. Bacteriophage PRD1 ...