Initiation of transcription by RNA polymerase II (RNAP II) onSaccharomyces cerevisiae messenger RNA (mRNA) genes typically occurs at multiple sites 40 -120 bp downstream of the TATA box. The mechanism that accommodates this extended and variable promoter architecture is unknown, but one model suggests that RNAP II forms an open promoter complex near the TATA box and then scans the template DNA strand for start sites. Unlike most proteincoding genes, small nuclear RNA gene transcription starts predominantly at a single position. We identify a highly efficient initiator element as the primary start site determinant for the yeast U4 small nuclear RNA gene, SNR14. Consistent with the scanning model, transcription of an SNR14 allele with tandemly duplicated start sites initiates primarily from the upstream site, yet the downstream site is recognized with equivalent efficiency by the diminished population of RNAP II molecules that encounter it. A quantitative in vivo assay revealed that SNR14 initiator efficiency is nearly perfect (ϳ90%), which explains the precision of U4 RNA 5 end formation. Initiator efficiency was reduced by cis-acting mutations at ؊8, ؊7, ؊1, and ؉1 and trans-acting substitutions in the TFIIB B-finger. These results expand our understanding of RNAP II initiation preferences and provide new support for the scanning model.
Eukaryotes rely on RNA polymerase II (RNAP II)2 to synthesize all messenger RNAs (mRNAs) and most of the small nuclear RNAs (snRNAs) and small nucleolar RNAs (snoRNAs) encoded within their nuclear genomes. Efficient and accurate transcription initiation is vital to ensure the proper expression and function of these RNAs. The recruitment of RNAP II to gene promoters is mediated through the assembly of a pre-initiation complex (PIC). RNAP II accessory proteins provide promoter specificity and the structural core for assembly of the PIC. These accessory proteins include the general transcription factors TFIID, TFIIB, TFIIF, TFIIE, and TFIIH (1-3). Some transcription factors engage in sequence-specific contacts with core promoter elements (4); one of the most fundamental interactions for PIC assembly is between the TATA-binding protein subunit of TFIID and the TATA box (5-6). In a stepwise model for PIC assembly, TATA-binding protein binding is followed by the addition of TFIIB, RNAP II-TFIIF, TFIIE, and TFIIH (7).In metazoans, the assembly of a PIC at the TATA box results in start site selection 25-30 bp downstream (4). The architecture of the PIC is such that the transcription start site is placed precisely within the active center of RNAP II (8 -9). In the yeast Saccharomyces cerevisiae, RNAP II initiation typically occurs at multiple sites at variable distances from the TATA box, with most start sites ranging from 40 to 120 bp downstream of the TATA box (10). The initiation mechanism that accommodates this extended and variable promoter architecture is unknown, but it does not appear to be dependent on assembling the yeast PIC in a manner different from that of metazoans. Yeast p...