A family of short, interspersed repeats in the human genome, designated the Mst II family, is described. The canonical structure of the repeat consists of a 220-base-pair (bp) left arm joined to a 160-bp right arm by a 39-bp junction sequence. The right arm is absent in some isolates. Some homology with the "0" and "THE" (transposon-like element) families of repeats was observed, suggesting that the Mst II elements could be a subgroup of a SINE superfamily. The 39-bp junction sequence is tandemly repeated in one of our clones.The association of tandemly repetitive sequences with Mst H elements or the putative superfamily is probably nonrandom; a search of DNA sequence data bases revealed that approximately 80 bp of the Mst II left arm occurs immediately adjacent to the tandem repeat that comprises the human homologue to the BK virus enhancer. The fortuitous occurrence of a gene duplication event involving an Mst II repeat has allowed us to estimate a mutation rate for human DNA.The structural basis for genetic variation in many highly polymorphic regions of the human genome is the tandem repetition of 14-to 65-base-pair (bp) nucleotide sequences (1-5). The variable tandem repeat (VTR) approximately 1.5 kilobases (kb) downstream from the cellular homologue of the Harvey rat sarcoma virus oncogene, c-Ha-rasl (HRASI in human gene nomenclature) (2), generates at least 25 allelic forms in which a 28-bp consensus sequence is reiterated 30-100 times (6, 7). Many of the rare alleles at this locus are detected only in cancer patients (6, 7). Since such hyperallelism might be exploited in quantitating the degree of genomic instability in human populations, we have sought to identify other regions related to the HRASI VTR.As a result of this search, we identified a highly polymorphic region distinct from the HRASI VTR (7). Sequence analysis of this region, designated 4.1, revealed the head-totail aggregation of a 35-bp monomer (8). When 4.1 was used as a probe for other polymorphic clones, 0.5-1% of the phage in a human DNA library were recognized by in situ plaque hybridization. We report here the characterization of the repeat family responsible for this result, including the observation of its intimate association with VTRs.
MATERIALS AND METHODSGenomic Library, Vectors, and Bacterial Strains. A human genomic library in Charon 4A (9) was kindly provided by Tom Maniatis. It was propagated in Escherichia coli LE392. Phage lysates were also prepared from this strain. Plasmid subclones were propagated in HB101 or JM83 with pBR322 or pUC9, respectively. DNA Transfer Hybridization. DNA was transferred to nitrocellulose filters (Schleicher & Schuell) by methods previously described for phage plaques (10) or agarose gels (11). Low-stringency hybridization was performed at 50'C in 5x SET buffer (1x SET buffer is 0.15 M NaCl/10 mM EDTA/10 mM Tris, pH 7.5) and 0.5% NaDodSO4. Three or four washes were performed at 37°C (unless otherwise stated) in 2x SET buffer/0.5% NaDodSO4 for 30 min each. The radioactive probe was prepar...