Predicting organismal phenotypes from genotype data is important for preventive and personalized medicine as well as plant and animal breeding. Although genome-wide association studies (GWAS) for complex traits have discovered a large number of trait- and disease-associated variants, phenotype prediction based on associated variants is usually in low accuracy even for a high-heritability trait because these variants can typically account for a limited fraction of total genetic variance. In comparison with GWAS, the whole-genome prediction (WGP) methods can increase prediction accuracy by making use of a huge number of variants simultaneously. Among various statistical methods for WGP, multiple-trait model and antedependence model show their respective advantages. To take advantage of both strategies within a unified framework, we proposed a novel multivariate antedependence-based method for joint prediction of multiple quantitative traits using a Bayesian algorithm via modeling a linear relationship of effect vector between each pair of adjacent markers. Through both simulation and real-data analyses, our studies demonstrated that the proposed antedependence-based multiple-trait WGP method is more accurate and robust than corresponding traditional counterparts (Bayes A and multi-trait Bayes A) under various scenarios. Our method can be readily extended to deal with missing phenotypes and resequence data with rare variants, offering a feasible way to jointly predict phenotypes for multiple complex traits in human genetic epidemiology as well as plant and livestock breeding.
Estimation of genomic breeding values is the key step in genomic selection (GS). Many methods have been proposed for continuous traits, but methods for threshold traits are still scarce. Here we introduced threshold model to the framework of GS, and specifically, we extended the three Bayesian methods BayesA, BayesB and BayesCp on the basis of threshold model for estimating genomic breeding values of threshold traits, and the extended methods are correspondingly termed BayesTA, BayesTB and BayesTCp. Computing procedures of the three BayesT methods using Markov Chain Monte Carlo algorithm were derived. A simulation study was performed to investigate the benefit of the presented methods in accuracy with the genomic estimated breeding values (GEBVs) for threshold traits. Factors affecting the performance of the three BayesT methods were addressed. As expected, the three BayesT methods generally performed better than the corresponding normal Bayesian methods, in particular when the number of phenotypic categories was small. In the standard scenario (number of categories ¼ 2, incidence ¼ 30%, number of quantitative trait loci ¼ 50, h 2 ¼ 0.3), the accuracies were improved by 30.4%, 2.4%, and 5.7% points, respectively. In most scenarios, BayesTB and BayesTCp generated similar accuracies and both performed better than BayesTA. In conclusion, our work proved that threshold model fits well for predicting GEBVs of threshold traits, and BayesTCp is supposed to be the method of choice for GS of threshold traits.
Improvement in growth and fatness traits are the main objectives in pig all breeding programs. Tenth rib backfat thickness (10RIBBFT) and days to 100 kg (D100), which are good predictors of carcass lean content and growth rate, respectively, are economically important traits and also main breeding target traits in pigs. To investigate the genetic mechanisms of 10RIBBFT and D100 of pigs, we sampled 1,137 and 888 pigs from 2 Yorkshire populations of American and British origin, respectively, and conducted genome-wide association study (GWAS) through combined analysis and meta-analysis, to identify SNPs associated with 10RIBBFT and D100. A total of 11 and 7 significant SNPs were identified by combined analysis for 10RIBBFT and D100, respectively. And in meta-analysis, 8 and 7 significant SNPs were identified for 10RIBBFT and D100, respectively. Among them, 6 and 5 common significant SNPs in two analysis results were, respectively, identified associated with 10RIBBFT and D100, and correspondingly explained 2.09% and 0.52% of the additive genetic variance of 10RIBBFT and D100. Further bioinformatics analysis revealed 10 genes harboring or close to these common significant SNPs, 5 for 10RIBBFT and 5 for D100. In particular, Gene Ontology analysis highlighted 6 genes, PCK1, ANGPTL3, EEF1A2, TNFAIP8L3, PITX2, and PLA2G12, as promising candidate genes relevant with backfat thickness and growth. PCK1, ANGPTL3, EEF1A2, and TNFAIP8L3 could influence backfat thickness through phospholipid transport, regulation of lipid metabolic process through the glycerophospholipid biosynthesis and metabolism pathway, the metabolism of lipids and lipoproteins pathway. PITX2 has a crucial role in skeletal muscle tissue development and animal organ morphogenesis, and PLA2G12A plays a role in the lipid catabolic and phospholipid catabolic processes, which both are involved in the body weight pathway. All these candidate genes could directly or indirectly influence fat production and growth in Yorkshire pigs. Our findings provide novel insights into the genetic basis of growth and fatness traits in pigs. The candidate genes for D100 and 10RIBBFT are worthy of further investigation.
In this article, we propose a model selection method, the Bayesian composite model space approach, to map quantitative trait loci (QTL) in a half-sib population for continuous and binary traits. In our method, the identity-by-descentbased variance component model is used. To demonstrate the performance of this model, the method was applied to map QTL underlying production traits on BTA6 in a Chinese half-sib dairy cattle population. A total of four QTLs were detected, whereas only one QTL was identified using the traditional least square (LS) method. We also conducted two simulation experiments to validate the efficiency of our method. The results suggest that the proposed method based on a multiple-QTL model is efficient in mapping multiple QTL for an outbred half-sib population and is more powerful than the LS method based on a single-QTL model.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.