Chromosome 6 is a metacentric chromosome that constitutes about 6% of the human genome. The finished sequence comprises 166,880,988 base pairs, representing the largest chromosome sequenced so far. The entire sequence has been subjected to high-quality manual annotation, resulting in the evidence-supported identification of 1,557 genes and 633 pseudogenes. Here we report that at least 96% of the protein-coding genes have been identified, as assessed by multi-species comparative sequence analysis, and provide evidence for the presence of further, otherwise unsupported exons/genes. Among these are genes directly implicated in cancer, schizophrenia, autoimmunity and many other diseases. Chromosome 6 harbours the largest transfer RNA gene cluster in the genome; we show that this cluster co-localizes with a region of high transcriptional activity. Within the essential immune loci of the major histocompatibility complex, we find HLA-B to be the most polymorphic gene on chromosome 6 and in the human genome.
Olfactory receptor (OR) loci frequently cluster and are present on most human chromosomes. They are members of the seven transmembrane receptor (7-TM) superfamily and, as such, are part of one of the largest mammalian multigene families, with an estimated copy number of up to 1000 ORs per haploid genome. As their name implies, ORs are known to be involved in the perception of odors and possibly also in other, nonolfaction-related, functions. Here, we report the characterization of ORs that are part of the MHC-linked OR clusters in human and mouse (partial sequence only). These clusters are of particular interest because of their possible involvement in olfaction-driven mate selection. In total, we describe 50 novel OR loci (36 human, 14 murine), making the human MHC-linked cluster the largest sequenced OR cluster in any organism so far. Comparative and phylogenetic analyses confirm the cluster to be MHC-linked but divergent in both species and allow the identification of at least one ortholog that will be useful for future regulatory and functional studies. Quantitative feature analysis shows clear evidence of duplications of blocks of OR genes and reveals the entire cluster to have a genomic environment that is very different from its neighboring regions. Based on in silico transcript analysis, we also present evidence of extensive long-distance splicing in the 5Ј-untranslated regions and, for the first time, of alternative splicing within the single coding exon of ORs. Taken together with our previous finding that ORs are also polymorphic, the presented data indicate that the expression, function, and evolution of these interesting genes might be more complex than previously thought.
Human major histocompatibility (human leucocyte antigen (HLA)) complex-linked odorant receptor (OR) genes are among the best characterized OR genes in the human genome. In addition to their functions as odorant receptors in olfactory epithelium, they have been suggested to play a role in the fertilization process. Here, we report the first in-depth analysis of their expression and regulation within testicular tissue. Sixteen HLA-linked OR and three non-HLA-linked OR were analyzed. One OR gene (hs6M1-16, in positive transcriptional orientation) exhibited six different transcriptional start sites combined with extensive alternative splicing within the 5 -untranslated region, the coding exon, and the 3 -untranslated region. Long distance splicing, exon sharing, and premature polyadenylation were features of another three OR loci (hs6M1-18, -21, and -27, all upstream of hs6M1-16, but in negative transcriptional orientation). Determination of the transcriptional start sites of these OR genes identified a region of 81 bp with potential bi-directional transcriptional activity. The results demonstrate that HLA-linked OR genes are subject to unusually complex transcriptional regulatory mechanisms.
The ParaHox genes consist of 3 homeobox gene families, Gsx, Xlox, and Cdx, all of which have fundamental roles in development. Xlox (known as IPF1 or PDX1 in vertebrates), for example, is crucial for development of the vertebrate pancreas and is also involved in regulation of insulin expression. The invertebrate amphioxus has a gene cluster containing one gene from each of the gene families, whereas in all vertebrates examined to date there are additional copies resultant from ParaHox gene cluster duplications at the base of the vertebrate lineage. Extant vertebrates basal to bony and cartilaginous fish are central to the question of when and how these multiple genes arose in the vertebrate genome. Here, we report the mapping of a ParaHox gene cluster in 2 species of hagfishes. Unexpectedly, these basal vertebrates have lost a functional Xlox gene from this cluster, unlike every other vertebrate examined to date. Furthermore, our phylogenetic analyses suggest that hagfishes may have diverged from the vertebrate lineage before the duplications, which created the multiple ParaHox clusters in jawed vertebrates.
We report the 897 kb sequence of a cluster of olfactory receptor (OR) genes located at the distal end of the major histocompatibility complex (MHC) class I region on mouse chromosome 17 of strain 129/SvJ (H2bc). With additional information from the mouse genome draft sequence, we identified 59 OR loci (approximately 20% pseudogenes) in contrast to only 25 OR loci (approximately 50% pseudogenes) in the corresponding centromeric OR cluster that is part of the 'extended MHC class I region' on human chromosome 6. Comparative analysis leads to three major observations: (i) most of the OR subfamilies have evolved independently in the two species, expanding more in the mouse, and resulting in co-orthologs--subfamilies of highly similar paralogs that keep orthologous relationships with their human counterparts; (ii) three of the mouse OR subfamilies have no orthologs in humans; and (iii) MHC class I loci are interspersed in the OR cluster in mouse but not in human, and were subjected to co-duplication with OR genes. Screening of our sequence against the available sequences of other strains/haplotypes revealed that most of the OR loci are polymorphic and that the number of OR loci may vary among strains/haplotypes. Our findings that MHC-linked OR loci share duplication with MHC class I loci, have duplicated extensively and are polymorphic revives questions about potential reciprocal influences acting on the dynamics and evolution of the H2 region and the H2-linked OR loci.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.