A series of BAL 31 deletions were constructed in the upstream region of the Saccharomyces cerevisiae CYC7 gene to determine sequences required for transcriptional initiation. These deletions identified the TATA box as an alternating A-T sequence at -160 and the initiation sequences as well as the spatial relationship between them. The TATA box was necessary for wild-type levels of expression of the CYC7 gene. Decreasing the distance between the TATA sequence and the initiation site did not alter gene expression, but the site of transcription was shifted 3'-ward. In most cases, transcription initiated at a number of sites, the 5'-most of which was the first suitable site greater than 45 base pairs 3' of the TATA sequence, suggesting a spatial relationship between these sequences, Consensus sequences previously proposed for initiation sites were evaluated with respect to the start sites identified in this study as well as the start sites of other yeast genes.The selection of the initiation site for the transcription of class II genes of higher eucaryotes is governed by an A+T-rich sequence called the TATA box which directs RNA polymerase II to begin transcription 30 base pairs (bp) downstream (2). Deletion of a TATA sequence results in either decreased gene expression, initiation at novel sites, or both (1,9,28,30). If the distance between the TATA sequence and the initiation site is altered, transcription begins at a new location 30 bp downstream from the relocated TATA sequence (1). No sequence preference at the start site is evident, although transcription often begins at an A residue.In Saccharomyces cerevisiae, TATA-like sequences have been identified in the upstream region of most class II genes and are thought to play a role in transcriptional initiation (45). However, for several reasons it is difficult to assess their importance by inspection of the sequence alone. For one, the regions upstream of yeast genes are generally A+T-rich, containing many candidates for TATA boxes. Second, many yeast genes have multiple start sites, making the assignments of TATA boxes to start sites difficult. Finally, in those cases in which the TATA sequence has been studied, there is not a clear relationship between the spacing between the TATA box and the initiation site.Recent studies of the CYC1 (17), HIS3-DEDI (6), HIS4 (35), and PH05 (42) genes have defined a range over which TATA sequences can act; deletions altering the spacing between the TATA sequence and the initiation site point to an effective range of 40 to 120 bp over which TATA sequences can direct transcription. Given the variability in the distance between TATA sequences and the initiation site, it follows that signals contained in sequences at the initiation site must play a role in determining where transcription should begin (4,6,13,17,32,35,42). However, the different studies have identified different sequences. Clearly, more data are required to carry out any comprehensive comparative study.The S. cerevisiae CYC7 gene codes for 5% of the cytochrome c pr...