2012
DOI: 10.1002/gepi.21684
|View full text |Cite
|
Sign up to set email alerts
|

Utilizing Graph Theory to Select the Largest Set of Unrelated Individuals for Genetic Analysis

Abstract: Many statistical analyses of genetic data rely on the assumption of independence among samples. Consequently, relatedness is either modeled in the analysis or samples are removed to “clean” the data of any pairwise relatedness above a tolerated threshold. Current methods do not maximize the number of unrelated individuals retained for further analysis, and this is a needless loss of resources. We report a novel application of graph theory that identifies the maximum set of unrelated samples in any dataset give… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
50
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
8
1

Relationship

2
7

Authors

Journals

citations
Cited by 48 publications
(50 citation statements)
references
References 23 publications
(31 reference statements)
0
50
0
Order By: Relevance
“…37 An individual is only added to a family network if the sample is related to at least one other person in the network given a user-defined minimum coefficient of relatedness. For example, 0.1875, the midpoint between the mean expected IBD proportion for second-and third-degree relatives, is a threshold that will capture connections between most seconddegree relatives or closer.…”
Section: Family-network Identificationmentioning
confidence: 99%
See 1 more Smart Citation
“…37 An individual is only added to a family network if the sample is related to at least one other person in the network given a user-defined minimum coefficient of relatedness. For example, 0.1875, the midpoint between the mean expected IBD proportion for second-and third-degree relatives, is a threshold that will capture connections between most seconddegree relatives or closer.…”
Section: Family-network Identificationmentioning
confidence: 99%
“…37 Our method utilizes the power of SNP arrays or next-generation sequence data to evaluate genome-wide identity-bydescent (IBD) estimates generated by programs such as PLINK 14 or KING (Kinship-Based Inference for Genomewide Association Studies). 16 Our method assigns relationships by using the expected mean and variance for each relationship class and leverages all pairwise relationships within a family (as well as genetically determined sex) to reconstruct the possible pedigree structures in a manner consistent with the observed pairwise sharing.…”
Section: Introductionmentioning
confidence: 99%
“…We next used the PRIMUS suite of analytical tools to evaluate and perform quality controls on this initial data (Staples et al 2013(Staples et al , 2014. The algorithms in PRIMUS that estimate allele sharing were used to infer genealogical relatedness among all pairs.…”
Section: Evaluation Of Initial Data Set Using Primusmentioning
confidence: 99%
“…Moreover, it is often important to be able to verify stated pedigree relationships from the observed data and to check for errors. For example, the results of an association analysis on a case-control study could be seriously biased if some cases and controls were related and linkage analyses are sensitive to undeclared relationships between pedigree founders [11,23,51,56]. More recently, family-based designs have been recommended for association studies concerned with parent-of-origin effects and the control of population stratification [40].…”
Section: Introductionmentioning
confidence: 99%