2015
DOI: 10.1111/jbg.12155
|View full text |Cite
|
Sign up to set email alerts
|

Combined use of principal component analysis and random forests identify population‐informative single nucleotide polymorphisms: application in cattle breeds

Abstract: The genetic identification of the population of origin of individuals, including animals, has several practical applications in forensics, evolution, conservation genetics, breeding and authentication of animal products. Commercial high-density single nucleotide polymorphism (SNP) genotyping tools that have been recently developed in many species provide information from a large number of polymorphic sites that can be used to identify population-/breed-informative markers. In this study, starting from Illumina… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

5
68
1

Year Published

2016
2016
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 45 publications
(82 citation statements)
references
References 31 publications
5
68
1
Order By: Relevance
“…Sottile, Sardina, Mastrangelo, Di Gerlando, Tolone, Chiodi and Portolano Penalized multinomial regression and stability selection v. principal component analysis and random forest Penalized multinomial regression and stability selection procedure is a new strategy used for assigning animals to a breed. In order to compare our approach with other previously reported strategies and to test its efficiency in assigning individuals, PCA and RF strategy (Bertolini et al, 2015) were also used with the real data. With respect to the two first ranking SNP panels (MDGI and MAD for 48 and 96 SNPs), the OOB errors in the test population were 4.09% and 2.03%, respectively, while the misclassification error rates for the validation population were both 2.86%.…”
Section: Resultsmentioning
confidence: 99%
See 4 more Smart Citations
“…Sottile, Sardina, Mastrangelo, Di Gerlando, Tolone, Chiodi and Portolano Penalized multinomial regression and stability selection v. principal component analysis and random forest Penalized multinomial regression and stability selection procedure is a new strategy used for assigning animals to a breed. In order to compare our approach with other previously reported strategies and to test its efficiency in assigning individuals, PCA and RF strategy (Bertolini et al, 2015) were also used with the real data. With respect to the two first ranking SNP panels (MDGI and MAD for 48 and 96 SNPs), the OOB errors in the test population were 4.09% and 2.03%, respectively, while the misclassification error rates for the validation population were both 2.86%.…”
Section: Resultsmentioning
confidence: 99%
“…In order to compare our approach to those previously reported, another mixed strategy was considered (Bertolini et al, 2015). In particular, PCA and RF was used to discover a new SNP panel able to discriminate among the breeds For each autosome, the top 20 SNPs were selected and merged together, leading to a final panel of 520 markers.…”
Section: Datamentioning
confidence: 99%
See 3 more Smart Citations