The background linkage disequilibrium (LD) in genetic isolates is of great interest in human genetics. Although many empirical studies have evaluated the background LD in European isolates, such as the Finnish and Sardinians, few data from other regions, such as Asia, have been reported. To evaluate the extent of background LD in East Asian genetic isolates, we analyzed the X chromosome in the Japanese population and in four Mongolian populations (Khalkh, Khoton, Uriankhai, and Zakhchin), the demographic histories of which are quite different from one another. Fisher's exact test revealed that the Japanese and Khalkh, which are the expanded populations, had the same or a relatively higher level of LD than did the Finnish, European American, and Sardinian populations. In contrast, the Khoton, Uriankhai, and Zakhchin populations, which have kept their population size constant, had a higher background LD. These results were consistent with previous genetic anthropological studies in European isolates and indicate that the Japanese and Khalkh populations could be utilized in the fine mapping of both complex and monogenic diseases, whereas the Khoton, Uriankhai, and Zakhchin populations could play an important role in the initial mapping of complex disease genes.
Diffuse panbronchiolitis (DPB) is an unusual form of bronchiolar disease affecting exclusively East Asians. Strong associations of DPB with the class I human leukocyte antigens HLA-B54 in Japan and China and HLA-A11 in Korea suggest that the susceptible locus for DPB is located between the HLA-B and HLA-A genes. We have previously reported that the susceptibility gene for DPB could be localized within a 200-kb segment between the S and TFIIH loci in the HLA class I region, using refined microsatellite-based association mapping. However, no genes have been recognized in this candidate region to date. In order to identify a novel candidate gene for DPB from this segment, the expressed sequence tag databases were searched using the genomic sequence. As a result, a cDNA clone was isolated from a human lung cDNA library. This gene, designated C6orf37 (Chromosome 6 open reading frame 37), spans approximately 2.5 kb and consists of two exons encoding a 235-amino acid protein, sharing homology with the mucin-like domain of human zonadhesin, which is a sperm multiple-domain transmembrane protein with the sperm zona pellucida binding activity. Unexpectedly, RT-PCR analysis detected transcripts from the anti-sense DNA strand of this C6orf37 locus. The gene designated as C6orf37OS (C6orf37 Opposite Strand) and represented by these anti-sense transcripts contained no open reading frame. The transcripts from C6orf37 and C6orf37OS were observed in numerous tissues, with most-abundant expression in lung, kidney, and testis. Taken together, these results, especially the abundant expression in lung, indicate that C6orf37 and C6orf37OS are excellent candidate genes for DPB.
SummaryThe maximum likelihood estimation (MLE) is one of the most popular ways to estimate haplotype frequencies of a population with genotype data whose linkage phases are unknown. The MLE is commonly implemented in the use of the Expectation-Maximization (EM) algorithm. It is known that the EM algorithm carries the risk that an estimator may converge erroneously to one of the local maxima or saddle points of the likelihood surface, resulting in serious errors in the MLE of haplotype frequencies. In this note, by theoretical treatments we present the necessary and sufficient conditions that the local maxima or saddle points on the likelihood surface appear. As a rule of thumb, that the difference between the coupling and repulsive haplotype frequencies in phase known individuals is 3/2 times larger than the frequency of phase ambiguous individuals is the sufficient condition that the likelihood surface is unimodal. Moreover, we present the analytic solution to the biallelic two-locus problem, and construct a general algorithm to obtain the global maximum.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.