The primary structure and phosphorylation pattern of the tandem Y 3 and the spacing between coding units is flexible; the coding unit must contain two Tyr1 residues and the spacing between consecutive tyrosines is important; Ser5-PO 4 -Pro6 comprises an essential two-letter code "word" that is read by the mRNA capping apparatus; and a threshold number of Ser5-PO 4 -Pro6 words are needed to comprise a readable "sentence" of CTD information. Bypassing the essentiality of the Ser5 and Pro6 letters by fusion of capping enzymes to the CTD helped reveal how CTD phosphorylation circuits are wired in vivo. We found that the Ser2-PO 4 mark is independent of Ser5, Pro6, Ser7, and Thr4, whereas the Ser5-PO 4 mark is independent of Ser2, Ser7, and Thr4. These results provide unique insights to the reading and writing of the CTD code.CTD receptors | mRNA capping enzymes | serine phosphorylation T he carboxyl-terminal domain (CTD) of the Rpb1 subunit of RNA polymerase II (Pol II) consists of tandemly repeated heptapeptides of the consensus sequence Y 1 S 2 P 3 T 4 S 5 P 6 S 7 . The CTD is dispensable for RNA polymerase activity per se but is essential for cell viability because it recruits proteins that regulate transcription, modify chromatin structure, and catalyze or regulate mRNA capping, splicing, and polyadenylation (1, 2). The inherently plastic CTD structure is modulated by phosphorylation of the heptad serine, threonine, and tyrosine residues. Phosphates are added by CTD kinases that have varying positional specificities and act at different stages of the transcription cycle. Phosphates are removed by diverse classes of CTD phosphatase enzymes that also have varying positional specificities and temporal windows of action during the transcription cycle.The combinatorial complexity of the CTD serine-2,5,7, threonine-4, and tyrosine-1 phosphorylation array comprises up to 32 n distinct primary structures, where n is the number of heptad repeats; cis-trans isomerization at CTD residues Pro3 and Pro6 imparts another 4 n conformational complexity. Thus, the total number of potential CTD structures is 128 n . The instantaneous primary structure of the CTD provides informational cues about the state of the transcription machinery-a CTD code-that is "read" by CTD receptor proteins (3). Whereas one may gain important insights to CTD coding principles by probing biochemically and structurally how individual receptor proteins recognize the CTD, it is technically impossible with present crude methods (and perhaps conceptually unattainable à la the uncertainty principle) to know the instantaneous primary structure of the CTD of a transcribing Pol II complex and the ensemble of proteins that are simultaneously engaged on the CTD.These issues notwithstanding, the informational rules that govern the CTD code on a cellular and organismal level can be probed genetically by manipulating the composition and structure of the Rpb1 CTD (4-11). The fission yeast Schizosaccharomyces pombe is an attractive model system for CTD structure-functi...