2008
DOI: 10.1038/sj.ejhg.5201988
|View full text |Cite
|
Sign up to set email alerts
|

Imputing missing genotypic data of single-nucleotide polymorphisms using neural networks

Abstract: With advances in high-throughput single-nucleotide polymorphism (SNP) genotyping, the amount of genotype data available for genetic studies is steadily increasing, and with it comes new abilities to study multigene interactions as well as to develop higher dimensional genetic models that more closely represent the polygenic nature of common disease risk. The combined impact of even small amounts of missing data on a multi-SNP analysis may be considerable. In this study, we present a neural network method for i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
28
0

Year Published

2008
2008
2023
2023

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 34 publications
(28 citation statements)
references
References 31 publications
0
28
0
Order By: Relevance
“…Missing genotypes were computed through the 20-SNP haplotypes with EM convergence tolerance of 0.001 and maximum EM iteration number of 50. In a simulation study, this method achieved imputation accuracy above 95% with missing rates ranging from 1 to 10% [Sun and Kardia, 2008]. The same complete data set was analyzed with Random Forests and RuleFit methods.…”
Section: Missing Data Imputationmentioning
confidence: 99%
“…Missing genotypes were computed through the 20-SNP haplotypes with EM convergence tolerance of 0.001 and maximum EM iteration number of 50. In a simulation study, this method achieved imputation accuracy above 95% with missing rates ranging from 1 to 10% [Sun and Kardia, 2008]. The same complete data set was analyzed with Random Forests and RuleFit methods.…”
Section: Missing Data Imputationmentioning
confidence: 99%
“…As a result of LD, a disease-susceptibility SNP need not be genotyped, as long as it is “tagged” by a SNP or set of SNPs that are genotyped (i.e., SNPs in LD with the disease-susceptibility SNP are genotyped). Recently this concept has been further exploited by the introduction of methods to impute genotypes at untyped markers, based on genotypes at typed markers and information about LD within the region [3], [4], [5], [6], [7], [8], [9], [10], [11], [12]. These methods are particularly useful in the context of failed genotyping and combining data across multiple platforms and recently have been extended to untyped markers using a reference data set [8], [10], [11].…”
Section: Introductionmentioning
confidence: 99%
“…In Sun and Kardia (2008), the authors employ an Artificial Neural Networks based method for imputing the MVs artificially generated over genotype data. Pisoni, Pastor, and Volta (2008) also use Artificial Neural Networks for interpolating missing satellite data.…”
Section: A Short Review On the Use Of Imputation Methods For Neural Nmentioning
confidence: 99%