The importance of the Gallus gallus (chicken) as a model organism and agricultural animal merits a continuation of sequence assembly improvement efforts. We present a new version of the chicken genome assembly (Gallus_gallus-5.0; GCA_000002315.3), built from combined long single molecule sequencing technology, finished BACs, and improved physical maps. In overall assembled bases, we see a gain of 183 Mb, including 16.4 Mb in placed chromosomes with a corresponding gain in the percentage of intact repeat elements characterized. Of the 1.21 Gb genome, we include three previously missing autosomes, GGA30, 31, and 33, and improve sequence contig length 10-fold over the previous Gallus_gallus-4.0. Despite the significant base representation improvements made, 138 Mb of sequence is not yet located to chromosomes. When annotated for gene content, Gallus_gallus-5.0 shows an increase of 4679 annotated genes (2768 noncoding and 1911 protein-coding) over those in Gallus_gallus-4.0. We also revisited the question of what genes are missing in the avian lineage, as assessed by the highest quality avian genome assembly to date, and found that a large fraction of the original set of missing genes are still absent in sequenced bird species. Finally, our new data support a detailed map of MHC-B, encompassing two segments: one with a highly stable gene copy number and another in which the gene copy number is highly variable. The chicken model has been a critical resource for many other fields of study, and this new reference assembly will substantially further these efforts.
BackgroundThe major histocompatibility complex (MHC) is present within the genomes of all jawed vertebrates. MHC genes are especially important in regulating immune responses, but even after over 80 years of research on the MHC, much remains to be learned about how it influences adaptive and innate immune responses. In most species, the MHC is highly polymorphic and polygenic. Strong and highly reproducible associations are established for chicken MHC-B haplotypes in a number of infectious diseases. Here, we report (1) the development of a high-density SNP (single nucleotide polymorphism) panel for MHC-B typing that encompasses a 209,296 bp region in which 45 MHC-B genes are located, (2) how this panel was used to define chicken MHC-B haplotypes within a large number of lines/breeds and (3) the detection of recombinants which contributes to the observed diversity.MethodsA SNP panel was developed for the MHC-B region between the BG2 and CD1A1 genes. To construct this panel, each SNP was tested in end-point read assays on more than 7500 DNA samples obtained from inbred and commercially used egg-layer lines that carry known and novel MHC-B haplotypes. One hundred and one SNPs were selected for the panel. Additional breeds and experimentally-derived lines, including lines that carry MHC-B recombinant haplotypes, were then genotyped.ResultsMHC-B haplotypes based on SNP genotyping were consistent with the MHC-B haplotypes that were assigned previously in experimental lines that carry B2, B5, B12, B13, B15, B19, B21, and B24 haplotypes. SNP genotyping resulted in the identification of 122 MHC-B haplotypes including a number of recombinant haplotypes, which indicate that crossing-over events at multiple locations within the region lead to the production of new MHC-B haplotypes. Furthermore, evidence of gene duplication and deletion was found.ConclusionsThe chicken MHC-B region is highly polymorphic across the surveyed 209-kb region that contains 45 genes. Our results expand the number of identified haplotypes and provide insights into the contribution of recombination events to MHC-B diversity including the identification of recombination hotspots and an estimation of recombination frequency.Electronic supplementary materialThe online version of this article (doi:10.1186/s12711-015-0181-x) contains supplementary material, which is available to authorized users.
MHC haplotypes have a remarkable influence on whether tumors form following infection of chickens with oncogenic Marek’s disease herpesvirus. Although resistance to tumor formation has been mapped to a subregion of the chicken MHC-B region, the gene or genes responsible have not been identified. A full gene map of the subregion has been lacking. We have expanded the MHC-B region gene map beyond the 92-kb core previously reported for another haplotype revealing the presence of 46 genes within 242 kb in the Red Jungle Fowl haplotype. Even though MHC-B is structured differently, many of the newly revealed genes are related to loci typical of the MHC in other species. Other MHC-B loci are homologs of genes found within MHC paralogous regions (regions thought to be derived from ancient duplications of a primordial immune defense complex where genes have undergone differential silencing over evolutionary time) on other chromosomes. Still others are similar to genes that define the NK complex in mammals. Many of the newly mapped genes display allelic variability and fall within the MHC-B subregion previously shown to affect the formation of Marek’s disease tumors and hence are candidates for genes conferring resistance.
Telomeres are the specialized ends of chromosomes consisting of highly conserved repeat (5′-TTAGGG-3′)n sequences. Lack of information regarding the existence of an in vivo telomere clock function in birds, conflicting data regarding telomere array length in the chicken model, and the paucity of molecular telomere information for other avian species led us to study telomere array organization within and among 18 species and subspecies of birds. Most of the species contained between 2% and 4% telomere sequence per diploid genome. Arrays spanning 0.5–10 kb (Class I) and 10–40 kb (Class II) were observed in all of the species studied. Extremely long arrays, ranging from hundreds of kilobases to 1–2 Mb (Class III) were observed in all except two raptor species, the northern goshawk and American bald eagle. In chicken, there was evidence for shortening of the Class II arrays in vivo, based on intraindividual comparisons of somatic versus germline tissues in birds of different ages; terminally differentiated erythrocyte arrays were, on average, 2.3 kb shorter than sperm (germline) arrays. This study provides the first evidence for the existence of telomere arrays significantly larger than have been described for any vertebrate species to date and for developmentally programmed in vivo telomere shortening in the Aves taxa. The novel finding of megabase-sized telomere arrays may be an important feature of avian karyotypes that contain a large number of very small genetic units, the microchromosomes.
Nearly all genes presently mapped to chicken chromosome 16 (GGA 16) have either a demonstrated role in immune responses or are considered to serve in immunity by reason of sequence homology with immune system genes defined in other species. The genes are best described in regional units. Among these, the best known is the polymorphic major histocompatibility complex-B (MHC-B) region containing genes for classical peptide antigen presentation. Nearby MHC-B is a small region containing two CD1 genes, which encode molecules known to bind lipid antigens and which will likely be found in chickens to present lipids to specialized T cells, as occurs with CD1 molecules in other species. Another region is the MHC-Y region, separated from MHC-B by an intervening region of tandem repeats. Like MHC-B, MHC-Y is polymorphic. It contains specialized class I and class II genes and c-type lectin-like genes. Yet another region, separated from MHC-Y by the single nucleolar organizing region (NOR) in the chicken genome, contains olfactory receptor genes and scavenger receptor genes, which are also thought to contribute to immunity. The structure, distribution, linkages and patterns of polymorphism in these regions, suggest GGA 16 evolves as a microchromosome devoted to immune defense. Many GGA 16 genes are polymorphic and polygenic. At the moment most disease associations are at the haplotype level. Roles of individual MHC genes in disease resistance are documented in only a very few instances. Provided suitable experimental stocks persist, the availability of increasingly detailed maps of GGA 16 genes combined with new means for detecting genetic variability will lead to investigations defining the contributions of individual loci and more applications for immunogenetics in breeding healthy poultry.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.