The ribosome, as a catalyst for protein synthesis, is universal and essential for all organisms. Here we describe the structure of the genes encoding human ribosomal proteins (RPs) and compare this class of genes among several eukaryotes. Using genomic and full-length cDNA sequences, we characterized 73 RP genes and found that (1) transcription starts at a C residue within a characteristic oligopyrimidine tract; (2) the promoter region is GC rich, but often has a TATA box or similar sequence element; (3) the genes are small (4.4 kb), but have as many as 5.6 exons on average; (4) the initiator ATG is in the first or second exon and is within ± 5 bp of the first intron boundaries in about half of cases; and (5) 5Ј-and 3Ј-UTRs are significantly smaller (42 bp and 56 bp, respectively) than the genome average. Comparison of RP genes from humans, Drosophila melanogaster, Caenorhabditis elegans, and Saccharomyces cerevisiae revealed the coding sequences to be highly conserved (63% homology on average), although gene size and the number of exons vary. The positions of the introns are also conserved among these species as follows: 44% of human introns are present at the same position in either D. melanogaster or C. elegans, suggesting RP genes are highly suitable for studying the evolution of introns.[The sequence data described in this paper have been submitted to the DDBJ/EMBL/GenBank databases under accession nos. AB055762-AB055780, AB056456, AB061820-AB061859, AB062066-AB062071, and AB070559.]The ribosome is the cellular organelle responsible for protein synthesis in all cells. Recent analyses of the ribosome's structure using X-ray crystallography have enhanced our understanding of the structural basis of ribosome function (Ban et al. 2000;Schluenzen et al. 2000;Wimberly et al. 2000;Yusupov et al. 2001). In contrast, comparatively little is known about ribosome biogenesis, especially in higher eukaryotes. In mammalian cells, the biogenesis of cytoplasmic ribosomes requires assembly of 4 RNA molecules and 79 different proteins (Wool 1979). With the exception of two proteins, all of these components are present as single copies within the ribosome. Typically, mammalian cells contain ∼4 ן 10 6 cytoplasmic ribosomes, which account for 80% of all cellular RNA and 5%-10% of cellular proteins.Investigation of the mechanism that controls the coordinated expression of these components is a challenge. Three different RNA polymerases are involved in production of these RNAs and proteins, RNA polymerase I (POL I) is involved in production of the 28S, 18S, and 5.8S rRNAs, POL II in production of ribosomal proteins (RPs), and POL III in production of the 5S rRNA. The amino acid sequences of all rat and human RPs have been deduced (Wool et al. 1996), and the nucleotide sequences of thousands of eukaryotic rRNAs are now known (The Ribosome Database Project; Maidak et al. 2001). On the other hand, only a handful of mammalian RP genes have been studied in terms of their genomic structure. Unlike rRNAs, which are encoded by several ...