In both pedigree linkage studies and in population-based association studies there has been much interest in the use of modern dense genetic marker data to infer segments of gene identity by descent (ibd) among individuals not known to be related, to increase power and resolution in localizing genes affecting complex traits. In this article, we present a hidden Markov model (HMM) for ibd among a set of chromosomes and describe methods and software for inference of ibd among the four chromosomes of pairs of individuals, using either phased (haplotypic) or unphased (genotypic) data. The model allows for missing data and typing error, but does not model linkage disequilibrium (LD), because fitting an accurate LD model requires large samples from well-studied populations. However, LD remains a major confounding factor, since LD is itself a reflection of coancestry at the population level. To study the impact of LD, we have developed a novel simulation approach to generate realistic dense marker data for the same set of markers but at varying levels of LD. Using this approach, we present results of a study of the impact of LD on the sensitivity and specificity of our HMM model in estimating segments of ibd among sets of four chromosomes and between genotype pairs. We show that, despite not incorporating LD, our model has been quite successful in detecting segments as small as 10 6 bp (1 Mpb); we present also comparisons with fastIBD which uses an LD model in estimating ibd.
EVEN in large populations, sampled individuals may share genome inherited from a common ancestor on the order of tens of generations ago, at a time depth of up to 2000 years, and the probabilities are increased in samples from small populations, structured populations, or in individuals ascertained for particular traits. Generally, such segments of genome that are shared identically by descent (ibd) in remote relatives are rare but not short (Donnelly 1983). For example, the probability that a pair of human individuals separated by 20 meioses share any of their autosomal genome is 0.001, but, if it exists, an ibd segment is of order 5 · 10 6 bp (5 Mbp). For closer relatives, separated by 12 meioses, the chance of sharing some segment of autosomal genome is 0.148, while the expected length of the segment is of the same order of magnitude (8 Mbp).Modern dense SNP marker data provide information to detect such ibd segments in unknown relatives and thus increase information for genetic linkage mapping (Leutenegger et al. 2003;Albrechtsen et al. 2009). Indeed, the first such method developed by Leutenegger et al. (2003) to detect segments of homozygosity by descent in individuals affected by Taybi-Linder syndrome has recently led to gene discovery (Edery et al. 2011). In population samples also, detection of unknown remote segments of ibd enables association tests either to use (Browning and Browning 2010) or to adjust for (Choi et al. 2009) this coancestry in association methods of gene mapping. Combining inheritance information within the pe...