The centromeric regions of all human chromosomes are characterized by distinct subsets of a diverse tandemly repeated DNA family, alpha satellite. On human chromosome 17, the predominant form of alpha satellite is a 2.7-kilobase-pair higher-order repeat unit consisting of 16 alphoid monomers. We present the complete nucleotide sequence of the 16-monomer repeat, which is present in 500 to 1,000 copies per chromosome 17, as well as that of a less abundant 15-monomer repeat, also from chromosome 17. These repeat units were -98% identical in sequence, differing by the exclusion of precisely 1 monomer from the 15-monomer repeat. Homologous unequal crossing-over is suggested as a probable mechanism by which the different repeat lengths on chromosome 17 were generated, and the putative site of such a recombination event is identified. The monomer organization of the chromosome 17 higher-order repeat unit is based, in part, on tandemly repeated pentamers. A similar pentameric suborganization has been previously demonstrated for alpha satellite of the human X chromosome. Despite the organizational similarities, substantial sequence divergence distinguishes these subsets. Hybridization experiments indicate that the chromosome 17 and X subsets are more similar to each other than to the subsets found on several other human chromosomes. We suggest that the chromosome 17 and X alpha satellite subsets may be related components of a larger alphoid subfamily which have evolved from a common ancestral repeat into the contemporary chromosome-specific subsets.Alpha satellite (alphoid DNA) is a complex family of tandemly repeated DNA found in primate genomes. Long tandem arrays of alpha satellite DNA based on a monomer repeat length of -171 base paiirs (bp) are located principally at the centromeres of primate chromosomes (3,15,16,19,20). In the human genome, these sequences have been identified at the centromeric regions of each human chromosome and constitute some 5% of total human DNA. Human alpha satellite was initially described as a 340-bp EcoRI repeat, comprising two diverged monomer halves of 169 and 171 bp (16,34). However, more recent studies have indicated that this DNA family is substantially more heterogeneous than was once believed. The dimeric EcoRI repeat itself is made up of several distinct subfamilies (11,22), and a number of different molecular configurations in the human genome have been described (7,10,18,28,29,32,33,35). Alpha satellite subsets have been identified on several human chromosomes, and the suggestion has been made that individual human chromosomes may each be characterized by distinct alpha satellite subsets defined by restriction enzyme periodicity and primary sequence (15,18,29).Alpha satellite DNA specific for the human X chromosome (alphaX) has been cloned and extensively characterized (28, 32, 35). The 2.0-kilobase-pair (kb) BamHI higherorder repeat unit from this chromosome consists of 12 tandem but diverged alpha satellite monomers arranged as two adjacent and related pentamer blocks plus ...