In recent years a great deal of biochemical and genetic research has focused on the C-terminal domain (CTD) of the largest subunit (RPB1) of DNA-dependent RNA polymerase II. This strongly conserved domain of tandemly repeated heptapeptides has been linked functionally to important steps in the initiation and processing of mRNA transcripts in both animals and fungi. Although they are absolutely required for viability in these organisms, C-terminal tandem repeats do not occur in RPB1 sequences from diverse eukaryotic taxa. Here we present phylogenetic analyses of RPB1 sequences showing that canonical CTD heptads are strongly conserved in only a subset of eukaryotic groups, all apparently descended from a single common ancestor. Moreover, eukaryotic groups in which the most complex patterns of ontogenetic development occur are descended from this CTD-containing ancestor. Consistent with the results of genetic and biochemical investigations of CTD function, these analyses suggest that the enhanced control over RNA polymerase II transcription conveyed by acquired CTD͞protein interactions was an important step in the evolution of intricate patterns of gene expression that are a hallmark of large, developmentally complex eukaryotic organisms. development ͉ RPB1 ͉ transcription E ach of the core subunits present in all cellular DNAdependent RNA polymerase (pol) enzymes shares a common evolutionary origin (1, 2). The largest of these subunits contains eight highly conserved domains, designated regions A through H (2), which are common to all prokaryotic and eukaryotic homologues (3). Unlike all other members of this protein family, however, the largest subunit of eukaryotic RNA pol II has an additional C-terminal domain (CTD), comprising a varied number of tandemly repeated heptapeptides with the consensus sequence Tyr-1-Ser-2-Pro-3-Thr-4 -Ser-5-Pro-6 -Ser-7 (4). In animals and yeast, where the mechanics of transcription are understood most clearly, the CTD has become a focal point of investigations into the interactions between RNA pol II and a variety of transcription-related proteins.Reversible phosphorylation of the CTD regulates the cycling of RNA pol II between a hypophosphorylated (IIO) form, which is competent to enter the preinitiation complex, and a hyperphosphorylated (IIA) form capable of processive transcript elongation (5). Throughout this cycle the CTD binds essential transcription-related proteins that help to regulate gene expression (6-9), promote efficient elongation (10), and effectively couple transcription to pre-mRNA processing (11-15). So central is its role in these interactions that the CTD has been called, by one reviewer, ''the tail that wags the dog'' of RNA pol II (16).Tandemly repeated CTD heptads occur in all RNA pol II largest subunits isolated to date from animals, fungi, and green plants. Given the importance of the CTD for pol II function, it is not surprising that it has been conserved so strongly during the evolution of these groups. A typical CTD also is present in certain protists;...