Retroviral integration involves two DNA substrates that play different roles. The viral DNA substrate is recognized by virtue of specific nucleotide sequences near the end of a double-stranded DNA molecule. The target DNA substrate is recognized at internal sites with little sequence preference; nucleosomal DNA appears to be preferred for this role. Despite this apparent asymmetry in the sequence, structure, and roles of the DNA substrates in the integration reaction, the existence of distinct binding sites for viral and target DNA substrates has been controversial. In this report, we describe the expression in Escherichia coli and purification of Moloney murine leukemia virus integrase as a fusion protein with glutathione S-transferase, characterization of its activity by using several model DNA substrates, and the initial kinetic characterization of its interactions with a model viral DNA substrate. We provide evidence for functionally and kinetically distinct binding sites for viral and target DNA substrates and describe a cross-linking assay for DNA binding at a site whose specificity is consistent with the target DNA binding site.Integration of the retroviral genome into host cell chromosomal DNA is required for viral replication. Genetic studies have demonstrated that retroviral integration requires two viral functions: the viral IN protein, encoded by the 3Ј end of the pol gene (14,28,38,41), and att sites located at the termini of the viral long terminal repeats (11,12,37; for reviews, see references 2,25,26,48,53). Integration proceeds in three steps. After reverse transcription in the cytoplasm of the infected cell, the 3Ј ends of each of the termini of the linear viral DNA are cleaved to remove the terminal two nucleotides. This step, termed 3Ј-end processing, generates a 2-nucleotide 5Ј-end overhang and 3Ј ends that terminate with the phylogenetically conserved CA dinucleotide (4, 24, 39). In the nucleus, a concerted cleavage and ligation joins the processed 3Ј viral DNA ends to 5Ј staggered sites on opposite strands of the host chromosomal DNA (3,4,16,20,24). This strand transfer reaction results in a gapped intermediate in which the viral DNA 5Ј ends and the host chromosomal DNA 3Ј ends are unjoined. The integration process is completed by repair of the gapped intermediate. This step generates short direct repeats that flank the integrated provirus. The proteins involved in this final step remain to be identified.The end-processing and strand transfer steps in integration require the activity of only a single protein, the virally encoded IN. Purified recombinant IN has been shown both to remove two nucleotides from the 3Ј end of model viral DNA substrates and to join the recessed 3Ј ends to target DNA in vitro (6, 8, 13, 30-32, 35, 36, 43, 46, 49, 50, 52). Both the 3Ј-end processing and strand transfer steps of integration proceed via one-step transesterification reactions (17) that require no exogenous energy source (3,16,20). The strand transfer reaction has been shown to be reversible in vitro b...