BackgroundCopy number variation (CNV) has been recently identified in human and other mammalian genomes, and there is a growing awareness of CNV's potential as a major source for heritable variation in complex traits. Genomic selection is a newly developed tool based on the estimation of breeding values for quantitative traits through the use of genome-wide genotyping of SNPs. Over 30,000 Holstein bulls have been genotyped with the Illumina BovineSNP50 BeadChip, which includes 54,001 SNPs (~SNP/50,000 bp), some of which fall within CNV regions.ResultsWe used the BeadChip data obtained for 912 Israeli bulls to investigate the effects of CNV on SNP calls. For each of the SNPs, we estimated the frequencies of occurrence of loss of heterozygosity (LOH) and of gain, based either on deviation from the expected Hardy-Weinberg equilibrium (HWE) or on signal intensity (SI) using the PennCNV "detect" option. Correlations between LOH/CNV frequencies predicted by the two methods were low (up to r = 0.08). Nevertheless, 418 locations displayed significantly high frequencies by both methods. Efficiency of designating large genomic clusters of olfactory receptors as CNVs was 29%. Frequency values for copy loss were distinguishable in non-autosomal regions, indicating misplacement of a region in the current BTA7 map. Analysis of BTA18 placed major quantitative trait loci affecting net merit in the US Holstein population in regions rich in segmental duplications and CNVs. Enrichment of transporters in CNV loci suggested their potential effect on milk-production traits.ConclusionsExpansion of HWE and PennCNV analyses allowed estimating LOH/CNV frequencies, and combining the two methods yielded more sensitive detection of inherited CNVs and better estimation of their possible effects on cattle genetics. Although this approach was more effective than methodologies previously applied in cattle, it has severe limitations. Thus the number of CNVs reported here for the Holstein breed may represent as little as one-tenth of inherited common structural variation.
Strong selection in the Israeli Holstein dairy cattle population over the last three decades should have left clear signatures of selection. Two experimental approaches were applied to detect evidence of contemporary selection based on the 54K BeadChip genotypes of ~1000 Israeli Holstein bulls: (i) the long-range haplotype test, which searches for structural evidence resulting from selective sweep, and (ii) direct analysis of the changes in haplotypes frequencies over time combined with linkage disequilibrium blocks haplotype-based association analysis. Ten traits were analyzed: the PD07 Israeli selection index, milk, milk fat, % fat, milk protein, % protein, somatic cell score, female fertility, milk production persistency and herd life. The long-range haplotype test detected ~15% of the 3288 haplotypes that showed significant positive frequency trends (P < 0.05) and was significantly correlated with the substitution effects of the haplotypes and the selection intensities for the different traits. Thirty signatures of recent selection, which correspond to both approaches and affect the Israeli PD07 selection index, were identified on 17 of the 29 autosomes. The second experimental approach also was used to estimate the selection intensity of the different traits. The correlation between the selection intensities for the traits analyzed, derived from changes in haplotype frequencies in the population of bulls, and those derived from trait-based analysis of the cow population was 0.93 over all traits. Thus, the changes in haplotypes frequencies in the bulls' population accurately estimate genetic trends in the general cow population and can be used to detect signatures of recent selection.
Incorrect paternity assignment in cattle can have a major effect on rates of genetic gain. Of the 576 Israeli Holstein bulls genotyped by the BovineSNP50 BeadChip, there were 204 bulls for which the father was also genotyped. The results of 38 828 valid single nucleotide polymorphisms (SNPs) were used to validate paternity, determine the genotyping error rates and determine criteria enabling deletion of defective SNPs from further analysis. Based on the criterion of >2% conflicts between the genotype of the putative sire and son, paternity was rejected for seven bulls (3.5%). The remaining bulls had fewer conflicts by one or two orders of magnitude. Excluding these seven bulls, all other discrepancies between sire and son genotypes are assumed to be caused by genotyping mistakes. The frequency of discrepancies was >0.07 for nine SNPs, and >0.025 for 81 SNPs. The overall frequency of discrepancies was reduced from 0.00017 to 0.00010 after deletion of these 81 SNPs, and the total expected fraction of genotyping errors was estimated to be 0.05%. Paternity of bulls that are genotyped for genomic selection may be verified or traced against candidate sires at virtually no additional cost.
A quantitative trait locus (QTL) affecting female fertility, scored as the inverse of the number of inseminations to conception, on Bos taurus chromosome 7 was detected by a daughter design analysis of the Israeli Holstein population (P < 0.0003). Sires of five of the 10 families analyzed were heterozygous for the QTL. The 95% confidence interval of the QTL spans 27 cM from the centromere. Seven hundred and four SNP markers on the Illumina BovineSNP50 BeadChip within the QTL confidence interval were tested for concordance. A single SNP, NGS-58779, was heterozygous for all the five QTL heterozygous patriarchs, and homozygous for the remaining five QTL homozygous sires. A significant effect on fertility was associated with this marker in the sample of 900 sires genotyped (P < 10−6). Haplotype phase was the same for four of the five segregating sires. Thus concordance was obtained in nine of the ten families. We identified a common haplotype region associated with the rare and economically favorable allele of the SNP, spanning 270 kbp on BTA7 upstream to 4.72 Mbp. Eleven genes found in the common haplotype region should be considered as positional candidates for the identification of the causative quantitative trait nucleotide. Copy number variation was found in one of these genes, KIAA1683. Four gene variants were identified, but only the number of copies of a specific variant (V1) was significantly associated with breeding values of sires for fertility.
Several studies have shown that computation of genomic estimated breeding values (GEBV) with accuracies significantly greater than parent average (PA) estimated breeding values (EBVs) requires genotyping of at least several thousand progeny-tested bulls. For all published analyses, GEBV computed from the selected samples of markers have lower or equal accuracy than GEBV derived on the basis of all valid single nucleotide polymorphisms (SNPs). In the current study, we report on four new methods for selection of markers. Milk, fat, protein, somatic cell score, fertility, persistency, herd life and the Israeli selection index were analyzed. The 972 Israeli Holstein bulls genotyped with EBV for milk production traits computed from daughter records in 2012 were assigned into a training set of 844 bulls with progeny test EBV in 2008, and a validation set of 128 young bulls. Numbers of bulls in the two sets varied slightly among the nonproduction traits. In EFF 12 , SNPs were first selected for each trait based on the effects of each marker on the bulls' 2012 EBV corrected for effective relationships, as determined by the SNP matrix. EFF 08 was the same as EFF 12 , except that the SNPs were selected on the basis of the 2008 EBV. In DIF max , the SNPs with the greatest differences in allelic frequency between the bulls in the training and validation sets were selected, whereas in DIF min the SNPs with the smallest differences were selected. For all methods, the numbers of SNPs retained varied over the range of 300 to 6000. For each trait, except fertility, an optimum number of markers between 800 and 5000 was obtained for EFF 12 , based on the correlation between the GEBV and current EBV of the validation bulls. For all traits, the difference between the correlation of GEBV and current EBV and the correlation of the PA and current EBV was >0.25. EFF 08 was inferior to EFF 12 , and was generally no better than PA EBV. DIF max always outperformed DIF min and generally outperformed EFF 08 and PA. Furthermore, GEBV based on DIF max were generally less biased than PA. It is likely that other methods of SNP selection could improve upon these results.
A single nucleotide polymorphism in the intergenic region upstream of the ZNF496 gene on Bos taurus chromosome 7 displayed significant population-wide linkage disequilibrium with milk protein percentage in the Israeli Holstein population. The frequency of the allele associated with increased protein concentration was 10%. This single nucleotide polymorphism was located in the promoter region from which a 10-exon transcript of the bovine and the ovine ZNF496 genes are transcribed. The gene architecture was similar to the mouse ortholog Zkscan17. A 5-exon murine antisense transcript was complementary to the 5' untranslated Zkscan17 region that included a sequence domain conserved between mouse and ruminants, suggesting a regulatory function. In the bovine ZNF496 chromosomal region, segregation of a quantitative trait locus (QTL) for milk protein percentage was confirmed in a daughter design sire family. Concordance was not obtained between QTL status of bulls and any of the polymorphisms in the functional elements of ZNF496. This excludes these variations as the causative polymorphism under the assumption of no epigenetic effect for this locus. However, ZNF496 variants were differentially expressed in bovine ovaries, and only the paternal variant was expressed in liver and kidney in a sheep family with polymorphic ZNF496 sequence. Thus, the search for the mutation underlying the minor QTL allele, which is a top economically favorable allele in Israeli Holstein cattle, may be complicated by the presence of an imprinting center in this QTL confidence interval.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.