Two theories of the origins of the IndoEuropeans currently compete. M. Gimbutas believes that early Indo-Europeans entered southeastern Europe from the Pontic Steppes starting ca. 4500 B.C. and spread from there. C. Renfrew equates early Indo-Europeans with early farmers who entered southeastern Europe from Asia Minor ca. 7000 BC and spread through the continent. We tested genetic distance matrices for each of 25 systems in numerous Indo-Europeanspeaking samples from Europe. To match each of these matrices, we created other distance matrices representing geography, language, time since origin of agriculture, Gimbutas' model, and Renfrew's model. The correlation between genetics and language is signi t. Geography, when held constant, produces a markedly lower, yet still highly significant partial correlation between genetics and language, showing that more remains to be explained. However, none of the remaining three distances-time since origin of agriculture, Gimbutas' model, or Renfrew's model-reduces the partial correlation further. Thus, neither of the two theories appears able to explain the orign of the Indo-Europeans as gauged by the geneticslanguage correlation. Here we examine whether the genetic evidence available from modern European populations favors one of the two hypotheses on IE origins. Our approach is to examine correlations between genetic and linguistic distances in Europe and to estimate the effects ofvarious factors (geography, origin of agriculture) and hypothesized movements (Gimbutas' and Renfrew's models) on the magnitude of these correlations.
MATERIALS AND METHODSWe studied 25 genetic systems (erythrocyte antigens, plasma proteins, enzymes, histocompatibility alleles, immunoglobulins; Table 1) from 2111 IE-speaking samples in Europe. Details are specified elsewhere (10-14). We computed Prevosti's genetic distances (15, 16) (GEN) separately for the 479 to 27 localities (mean = 84) of each genetic system. Linguistic distances (LAN) were subjective estimates furnished by M. Ruhlen, based on his current classification of IE languages (17). A dendrogram (Fig. 1) resulting from UPGMA clustering (18) of the linguistic distance matrix shows the relations between the IE languages in that matrix. We computed great-circle geographic distances (GEO) between pairs of localities. The origin-of-agriculture distances (OOA) between any pair of points were described earlier (8). They sum distances from their respective starting times of agriculture back to their putative common agricultural origins.The Renfrew hypothesis distance (REN) matrix was based on ref. 4 and discussions with Professor Renfrew. In his view, most ofthe introduction and subsequent diversification ofthe IE language families in Europe was concurrent with the spread of agriculture in the continent. Nevertheless, Renfrew explains the final branching into the major language families by a series of 10 so-called transitions illustrated in ref. 4 (figure 7.7). These transitions are associated with specific archaeological assemblage...