“…The 497 population samples were carefully screened for duplications (i.e., the same individuals represented in two datasets), but some populations are represented by more than one sample (e.g., at the HLA-B locus, there are five different samples of the Japanese population.) Each population data file was annotated with a three-letter abbreviation assigning it to one of the 10 world regions shown in Figure 1 [3,13]: Sub-Saharan Africa (SSA), North Africa (NAF), Europe (EUR), Southwest Asia (SWA), Southeast Asia (SEA), Oceania (OCE), Australia (AUS), Northeast Asia (NEA), North America (NAM), South America (SAM) . These regional assignments were based on the origin of the population, in effect ignoring the last 1000 years of known human migration (e.g., people of European descent in the United States are assigned to the region EUR).…”