The objective of this study was to investigate the effect of composition and size of the reference population in imputation efficiency of INDUSCHIP v2 in Indian HF crossbred cattle. Data set consisted of a total of 869 cattle from 14 Indicine breeds, 2 crossbreds (HF and Jersey crossbreds) and 2 exotic breeds (HF, Jersey) genotyped with Illumina BovineHD (Illumina, San Diego, CA) panel. Post QC, 846 animals and 449955 SNPs remained for imputation study. 3 test groups each with randomly selected 25 HFCB animals with subset genotype of INDUSCHIP v2 were created, whereas with HD genotyping data of remaining animals, 3 different categories of reference groups were created namely reference 1 (HF, Jersey, all 14 Indicine breeds, HF and Jersey crossbreds), reference 2 (HF, HF crossbred, Sahiwal, Gir and Kankrej) and reference 3 (pure HF, Sahiwal, Gir and Kankrej). Imputation efficiency of INDUSCHIP v2 was expressed in terms of concordance rate and Dosage R2 (DR2). Reference groups 1 and 2 were found to be better than Reference group 3. Further, the size of the reference population had an impact on imputation efficiency. The concordance rate and DR2 decreased with decline about population size. However, a reference population with 280 animals was found to be sufficient to obtain a concordance rate of around 95% or more and DR2 around 0.93. More number of HF, HF crossbred, Sahiwal, Gir and Kankrej animals need to be HD genotyped and incorporated in the reference population to improve the imputation efficiency of INDUSCHIP v2.
INDUSCHIPan Illumina platform based custom made genotyping chip was designed with 45K polymorphic markers for Indian cattle breeds and 8K base SNPs of Illumina BovineLD chip to genotype indigenous and crossbred cattle in India. Current study was undertaken to assess the genotype imputation efficiency of INDUSCHIP v2 microarray in HF crossbred cattle and compare its efficiency of imputation with that of GGP-35K microarray. HD genotyping data of total 869 cattle from 14 indicine breeds, 2 crossbred (HF and Jersey crossbred) and 2 exotic breeds (HF, Jersey) were used for this study. Post quality control, only 846 animals and 449955 SNPs remained for imputation study. Only 23.65% of 35339 SNPs in GGP-35K chip are found to be common with INDUSCHIP v2 SNP panel. Imputation was carried out with the help of Beagle 5.0 software using subset of both INDUSCHIP v2 and GGP-35K SNP panels. The study revealed higher average concordance rate (CR) and squared correlation (DR 2 ) for INDUSCHIP v2 as compared to GGP-35K in crossbred HF population.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.