BackgroundIn this study, a single-trait genomic model (STGM) is compared with a multiple-trait genomic model (MTGM) for genomic prediction using conventional estimated breeding values (EBVs) calculated using a conventional single-trait and multiple-trait linear mixed models as the response variables. Three scenarios with and without missing data were simulated; no missing data, 90% missing data in a trait with high heritability, and 90% missing data in a trait with low heritability. The simulated genome had a length of 500 cM with 5000 equally spaced single nucleotide polymorphism markers and 300 randomly distributed quantitative trait loci (QTL). The true breeding values of each trait were determined using 200 of the QTLs, and the remaining 100 QTLs were assumed to affect both the high (trait I with heritability of 0.3) and the low (trait II with heritability of 0.05) heritability traits. The genetic correlation between traits I and II was 0.5, and the residual correlation was zero.ResultsThe results showed that when there were no missing records, MTGM and STGM gave the same reliability for the genomic predictions for trait I while, for trait II, MTGM performed better that STGM. When there were missing records for one of the two traits, MTGM performed much better than STGM. In general, the difference in reliability of genomic EBVs predicted using the EBV response variables estimated from either the multiple-trait or single-trait models was relatively small for the trait without missing data. However, for the trait with missing data, the EBV response variable obtained from the multiple-trait model gave a more reliable genomic prediction than the EBV response variable from the single-trait model.ConclusionsThese results indicate that MTGM performed better than STGM for the trait with low heritability and for the trait with a limited number of records. Even when the EBV response variable was obtained using the multiple-trait model, the genomic prediction using MTGM was more reliable than the prediction using the STGM.
BackgroundArtificial selection for economically important traits in cattle is expected to have left distinctive selection signatures on the genome. Access to high-density genotypes facilitates the accurate identification of genomic regions that have undergone positive selection. These findings help to better elucidate the mechanisms of selection and to identify candidate genes of interest to breeding programs.ResultsInformation on 705 243 autosomal single nucleotide polymorphisms (SNPs) in 3122 dairy and beef male animals from seven cattle breeds (Angus, Belgian Blue, Charolais, Hereford, Holstein-Friesian, Limousin and Simmental) were used to detect selection signatures by applying two complementary methods, integrated haplotype score (iHS) and global fixation index (FST). To control for false positive results, we used false discovery rate (FDR) adjustment to calculate adjusted iHS within each breed and the genome-wide significance level was about 0.003. Using the iHS method, 83, 92, 91, 101, 85, 101 and 86 significant genomic regions were detected for Angus, Belgian Blue, Charolais, Hereford, Holstein-Friesian, Limousin and Simmental cattle, respectively. None of these regions was common to all seven breeds. Using the FST approach, 704 individual SNPs were detected across breeds. Annotation of the regions of the genome that showed selection signatures revealed several interesting candidate genes i.e. DGAT1, ABCG2, MSTN, CAPN3, FABP3, CHCHD7, PLAG1, JAZF1, PRKG2, ACTC1, TBC1D1, GHR, BMP2, TSG1, LYN, KIT and MC1R that play a role in milk production, reproduction, body size, muscle formation or coat color. Fifty-seven common candidate genes were found by both the iHS and global FST methods across the seven breeds. Moreover, many novel genomic regions and genes were detected within the regions that showed selection signatures; for some candidate genes, signatures of positive selection exist in the human genome. Multilevel bioinformatic analyses of the detected candidate genes suggested that the PPAR pathway may have been subjected to positive selection.ConclusionsThis study provides a high-resolution bovine genomic map of positive selection signatures that are either specific to one breed or common to a subset of the seven breeds analyzed. Our results will contribute to the detection of functional candidate genes that have undergone positive selection in future studies.Electronic supplementary materialThe online version of this article (doi:10.1186/s12711-015-0127-3) contains supplementary material, which is available to authorized users.
BackgroundTraditionally, Chinese indigenous sheep were classified geographically and morphologically into three groups: Mongolian, Kazakh and Tibetan. Herein, we aimed to evaluate the population structure and genome selection among 140 individuals from ten representative Chinese indigenous sheep breeds: Ujimqin, Hu, Tong, Large-Tailed Han and Lop breed (Mongolian group); Duolang and Kazakh (Kazakh group); and Diqing, Plateau-type Tibetan, and Valley-type Tibetan breed (Tibetan group).ResultsWe analyzed the population using principal component analysis (PCA), STRUCTURE and a Neighbor-Joining (NJ)-tree. In PCA plot, the Tibetan and Mongolian groups were clustered as expected; however, Duolang and Kazakh (Kazakh group) were segregated. STRUCTURE analyses suggested two subpopulations: one from North China (Kazakh and Mongolian groups) and the other from the Southwest (Tibetan group). In the NJ-tree, the Tibetan group formed an independent branch and the Kazakh and Mongolian groups were mixed. We then used the di statistic approach to reveal selection in Chinese indigenous sheep breeds. Among the 599 genome sequence windows analyzed, sixteen (2.7%) exhibited signatures of selection in four or more breeds. We detected three strong selection windows involving three functional genes: RXFP2, PPP1CC and PDGFD. PDGFD, one of the four subfamilies of PDGF, which promotes proliferation and inhibits differentiation of preadipocytes, was significantly selected in fat type breeds by the Rsb (across pairs of populations) approach. Two consecutive selection regions in Duolang sheep were obviously different to other breeds. One region was in OAR2 including three genes (NPR2, SPAG8 and HINT2) the influence growth traits. The other region was in OAR 6 including four genes (PKD2, SPP1, MEPE, and IBSP) associated with a milk production quantitative trait locus. We also identified known candidate genes such as BMPR1B, MSRB3, and three genes (KIT, MC1R, and FRY) that influence lambing percentage, ear size and coat phenotypes, respectively.ConclusionsBased on the results presented here, we propose that Chinese native sheep can be divided into two genetic groups: the thin type (Tibetan group), and the fat type (Mongolian and Kazakh group). We also identified important genes that drive valuable phenotypes in Chinese indigenous sheep, especially PDGFD, which may influence fat deposition in fat type sheep.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-015-1384-9) contains supplementary material, which is available to authorized users.
BackgroundGrowth and meat production traits are significant economic traits in sheep. The aim of the study is to identify candidate genes affecting growth and meat production traits at genome level with high throughput single nucleotide polymorphisms (SNP) genotyping technologies.Methodology and ResultsUsing Illumina OvineSNP50 BeadChip, we performed a GWA study in 329 purebred sheep for 11 growth and meat production traits (birth weight, weaning weight, 6-month weight, eye muscle area, fat thickness, pre-weaning gain, post-weaning gain, daily weight gain, height at withers, chest girth, and shin circumference). After quality control, 319 sheep and 48,198 SNPs were analyzed by TASSEL program in a mixed linear model (MLM). 36 significant SNPs were identified for 7 traits, and 10 of them reached genome-wise significance level for post-weaning gain. Gene annotation was implemented with the latest sheep genome Ovis_aries_v3.1 (released October 2012). More than one-third SNPs (14 out of 36) were located within ovine genes, others were located close to ovine genes (878bp-398,165bp apart). The strongest new finding is 5 genes were thought to be the most crucial candidate genes associated with post-weaning gain: s58995.1 was located within the ovine genes MEF2B and RFXANK, OAR3_84073899.1, OAR3_115712045.1 and OAR9_91721507.1 were located within CAMKMT, TRHDE, and RIPK2 respectively. GRM1, POL, MBD5, UBR2, RPL7 and SMC2 were thought to be the important candidate genes affecting post-weaning gain too. Additionally, 25 genes at chromosome-wise significance level were also forecasted to be the promising genes that influencing sheep growth and meat production traits.ConclusionsThe results will contribute to the similar studies and facilitate the potential utilization of genes involved in growth and meat production traits in sheep in future.
Tibetan sheep have lived on the Tibetan Plateau for thousands of years; however, the process and consequences of adaptation to this extreme environment have not been elucidated for important livestock such as sheep. Here, seven sheep breeds, representing both highland and lowland breeds from different areas of China, were genotyped for a genome-wide collection of single-nucleotide polymorphisms (SNPs). The FST and XP-EHH approaches were used to identify regions harbouring local positive selection between these highland and lowland breeds, and 236 genes were identified. We detected selection events spanning genes involved in angiogenesis, energy production and erythropoiesis. In particular, several candidate genes were associated with high-altitude hypoxia, including EPAS1, CRYAA, LONP1, NF1, DPP4, SOD1, PPARG and SOCS2. EPAS1 plays a crucial role in hypoxia adaption; therefore, we investigated the exon sequences of EPAS1 and identified 12 mutations. Analysis of the relationship between blood-related phenotypes and EPAS1 genotypes in additional highland sheep revealed that a homozygous mutation at a relatively conserved site in the EPAS1 3′ untranslated region was associated with increased mean corpuscular haemoglobin concentration and mean corpuscular volume. Taken together, our results provide evidence of the genetic diversity of highland sheep and indicate potential high-altitude hypoxia adaptation mechanisms, including the role of EPAS1 in adaptation.
BackgroundIn recent years, genome-wide association studies have successfully uncovered single-nucleotide polymorphisms (SNPs) associated with complex traits such as diseases and quantitative phenotypes. These variations account for a small proportion of heritability. With the development of high throughput techniques, abundant submicroscopic structural variations have been found in organisms, of which the main variations are copy number variations (CNVs). Therefore, CNVs are increasingly recognized as an important and abundant source of genetic variation and phenotypic diversity.ResultsAnalyses of CNVs in the genomes of three sheep breeds were performed using the Ovine SNP50 BeadChip array. A total of 238 CNV regions (CNVRs) were identified, including 219 losses, 13 gains, and six with both events (losses and gains), which cover 60.35 Mb of the sheep genomic sequence and correspond to 2.27% of the autosomal genome sequence. The length of the CNVRs on autosomes range from 13.66 kb to 1.30 Mb with a mean size of 253.57 kb, and 75 CNVRs events had a frequency > 3%. Among these CNVRs, 47 CNVRs identified by the PennCNV overlapped with the CNVpartition. Functional analysis indicated that most genes in the CNVRs were significantly enriched for involvement in the environmental response. Furthermore, 10 CNVRs were selected for validation and 6 CNVRs were further experimentally confirmed by qPCR. In addition, there were 57 CNVRs overlapped in our new dataset and other published ruminant CNV studies.ConclusionsIn this study, we firstly constructed a sheep CNV map based on the Ovine SNP50 array. Our results demonstrated the differences of two detection tools and integration of multiple algorithms can enhance the detection of sheep genomic structure variations. Furthermore, our findings would be of help for understanding the sheep genome and provide preliminary foundation for carrying out the CNVs association studies with economically important phenotypes of sheep in the future.
Chinese indigenous sheep can be classified into three types based on tail morphology: fat-tailed, fat-rumped, and thin-tailed sheep, of which the typical breeds are large-tailed Han sheep, Altay sheep, and Tibetan sheep, respectively. To unravel the genetic mechanisms underlying the phenotypic differences among Chinese indigenous sheep with tails of three different types, we used ovine high-density 600K SNP arrays to detect genome-wide copy number variation (CNV). In large-tailed Han sheep, Altay sheep, and Tibetan sheep, 371, 301, and 66 CNV regions (CNVRs) with lengths of 71.35 Mb, 51.65 Mb, and 10.56 Mb, respectively, were identified on autosomal chromosomes. Ten CNVRs were randomly chosen for confirmation, of which eight were successfully validated. The detected CNVRs harboured 3130 genes, including genes associated with fat deposition, such as PPARA, RXRA, KLF11, ADD1, FASN, PPP1CA, PDGFA, and PEX6. Moreover, multilevel bioinformatics analyses of the detected candidate genes were significantly enriched for involvement in fat deposition, GTPase regulator, and peptide receptor activities. This is the first high-resolution sheep CNV map for Chinese indigenous sheep breeds with three types of tails. Our results provide valuable information that will support investigations of genomic structural variation underlying traits of interest in sheep.
Whole genome sequencing studies are essential to obtain a comprehensive understanding of the vast pattern of human genomic variations. Here we report the results of a high-coverage whole genome sequencing study for 44 unrelated healthy Caucasian adults, each sequenced to over 50-fold coverage (averaging 65.8×). We identified approximately 11 million single nucleotide polymorphisms (SNPs), 2.8 million short insertions and deletions, and over 500,000 block substitutions. We showed that, although previous studies, including the 1000 Genomes Project Phase 1 study, have catalogued the vast majority of common SNPs, many of the low-frequency and rare variants remain undiscovered. For instance, approximately 1.4 million SNPs and 1.3 million short indels that we found were novel to both the dbSNP and the 1000 Genomes Project Phase 1 data sets, and the majority of which (∼96%) have a minor allele frequency less than 5%. On average, each individual genome carried ∼3.3 million SNPs and ∼492,000 indels/block substitutions, including approximately 179 variants that were predicted to cause loss of function of the gene products. Moreover, each individual genome carried an average of 44 such loss-of-function variants in a homozygous state, which would completely “knock out” the corresponding genes. Across all the 44 genomes, a total of 182 genes were “knocked-out” in at least one individual genome, among which 46 genes were “knocked out” in over 30% of our samples, suggesting that a number of genes are commonly “knocked-out” in general populations. Gene ontology analysis suggested that these commonly “knocked-out” genes are enriched in biological process related to antigen processing and immune response. Our results contribute towards a comprehensive characterization of human genomic variation, especially for less-common and rare variants, and provide an invaluable resource for future genetic studies of human variation and diseases.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.