Challenges of Linkage Analysis in the Era of Whole‐Genome Sequencing

Santorico, Stephanie A.; Edwards, Karen L.

doi:10.1002/gepi.21832

Cited by 5 publications

(7 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Another potential way to enrich subjects for genetic risk factors is to utilize individuals with a strong family history of the condition, since pedigrees with multiple affected individuals are more likely to have pre-disposing genetic variants than typical sporadic cases. Indeed, the way in which traditional genetic linkage approaches can be effectively used in conjunction with new technological and analytical methods for the identification of rare, disease causing mutations is currently being explored ( Santorico and Edwards, 2014 ).…”

Section: Missing Heritabilitymentioning

confidence: 99%

Identification of rare variants in Alzheimerâ€™s disease

Lord

Cruchaga

2014

Front. Genet.

View full text Add to dashboard Cite

Much progress has been made in recent years in identifying genes involved in the risk of developing Alzheimer’s disease (AD), the most common form of dementia. Yet despite the identification of over 20 disease associated loci, mainly through genome wide association studies (GWAS), a large proportion of the genetic component of the disorder remains unexplained. Recent evidence from the AD field, as with other complex diseases, suggests a large proportion of this “missing heritability” may be due to rare variants of moderate to large effect size, but the methodologies to detect such variants are still in their infancy. The latest studies in the field have been focused on the identification of coding variation associated with AD risk, through whole-exome or whole-genome sequencing. Such variants are expected to have larger effect sizes than GWAS loci, and are easier to functionally characterize, and develop cellular and animal models for. This review explores the issues involved in detecting rare variant associations in the context of AD, highlighting some successful approaches utilized to date.

show abstract

Section: Missing Heritabilitymentioning

confidence: 99%

Identification of rare variants in Alzheimerâ€™s disease

Lord

Cruchaga

2014

Front. Genet.

View full text Add to dashboard Cite

show abstract

“…Today, investigators continue to use pedigrees to determine the heritability and genetic models for traits and disorders, and knowing the exact pedigree structure allows them to correctly identify the genetic mode of disease inheritance and utilize powerful genetic-analysis tools that require, or benefit from, the true pedigree structure. Such tools include linkage, 1 family-based association, 2 pedigree-aware imputation, pedigree-aware phasing, Mendelian error checking, heritability, and pVAAST (Pedigree Variant Annotation, Analysis, and Search Tool). 3 In many instances, knowing the pedigree that is consistent with the generated genetic data is crucial to solving the disease.…”

Section: Introductionmentioning

confidence: 99%

PRIMUS: Rapid Reconstruction of Pedigrees from Genome-wide Estimates of Identity by Descent

Staples

Qiao

Cho

et al. 2014

The American Journal of Human Genetics

142

162

View full text Add to dashboard Cite

Understanding and correctly utilizing relatedness among samples is essential for genetic analysis; however, managing sample records and pedigrees can often be error prone and incomplete. Data sets ascertained by random sampling often harbor cryptic relatedness that can be leveraged in genetic analyses for maximizing power. We have developed a method that uses genome-wide estimates of pairwise identity by descent to identify families and quickly reconstruct and score all possible pedigrees that fit the genetic data by using up to third-degree relatives, and we have included it in the software package PRIMUS (Pedigree Reconstruction and Identification of the Maximally Unrelated Set). Here, we validate its performance on simulated, clinical, and HapMap pedigrees. Among these samples, we demonstrate that PRIMUS can verify reported pedigree structures and identify cryptic relationships. Finally, we show that PRIMUS reconstructed pedigrees, all of which were previously unknown, for 203 families from a cohort collected in Starr County, TX (1,890 samples).

show abstract

“…Third, families who 9 have visited a healthcare system for many years with multiple encounters will have 10 extensive medical records, making them more likely to be included in a study compared 11 to transient residents with brief medical records and fewer encounters. Both family 12 structure and distant cryptic relatedness are more pronounced in populations with low 13 migration rates 5 . Conversely, confounding population substructure may be less of a factor 14 in HPG studies if the sampled healthcare system's population is a single homogenous 15 genetic deme 1 .…”

Section: Cc-by-nc-nd 40 International License Peer-reviewed) Is the mentioning

confidence: 99%

“…5 We show that cryptic family structure in a large sequencing dataset presents an 6 opportunity to harness a valuable, untapped source of genetic insights rather than a 7 nuisance that must be managed during downstream analyses. As we enter the era of 8 genomic-based precision medicine, we see a critical need for additional innovative 9 methods and tools that are capable of effectively mining the familial structure and distant 10 relatedness contained within the ever-growing sequencing cohorts.…”

Section: Cc-by-nc-ndmentioning

confidence: 99%

“…We reconstructed 12,574 pedigrees using these relationships 4 (including 2,192 nuclear families) and leveraged them for multiple applications. The 5 pedigrees substantially improved the phasing accuracy of 20,947 rare, deleterious 6 compound heterozygous mutations. Reconstructed nuclear families were critical for 7 identifying 3,415 de novo mutations in ~1,783 genes.…”

mentioning

confidence: 99%

See 1 more Smart Citation

Profiling and leveraging relatedness in a precision medicine cohort of 92,455 exomes

Staples

Maxwell

Gosalia

et al. 2017

Preprint

View full text Add to dashboard Cite

16Large-scale human genetics studies are ascertaining increasing proportions of 17 populations as they continue growing in both number and scale. As a result, the amount 18 of cryptic relatedness within these study cohorts is growing rapidly and has significant 19 implications on downstream analyses. We demonstrate this growth empirically among 20 the first 92,455 exomes from the DiscovEHR cohort and, via a custom simulation 21 framework we developed called SimProgeny, show that these measures are in-line with 22 expectations given the underlying population and ascertainment approach. For example, 23 we identified ~66,000 close (first-and second-degree) relationships within DiscovEHR 1 involving 55.6% of study participants. Our simulation results project that >70% of the 2 cohort will be involved in these close relationships as DiscovEHR scales to 250,000 3 recruited individuals. We reconstructed 12,574 pedigrees using these relationships 4 (including 2,192 nuclear families) and leveraged them for multiple applications. The 5 pedigrees substantially improved the phasing accuracy of 20,947 rare, deleterious 6 compound heterozygous mutations. Reconstructed nuclear families were critical for 7 identifying 3,415 de novo mutations in ~1,783 genes. Finally, we demonstrate the 8 segregation of known and suspected disease-causing mutations through reconstructed 9 pedigrees, including a tandem duplication in LDLR causing familial hypercholesterolemia. 10In summary, this work highlights the prevalence of cryptic relatedness expected among 11 large healthcare population genomic studies and demonstrates several analyses that are 12 uniquely enabled by large amounts of cryptic relatedness. 13 14 Key words: cryptic relatedness; pedigree reconstruction; relationship inference; identity 15 by decent; compound heterozygous mutation phasing; de novo mutations; precision 16 medicine; healthcare population-based genetic study; exome sequencing; family structure; 17 familial-hypercholesterolemia 18 19 20 21 22

show abstract

Challenges of Linkage Analysis in the Era of Whole‐Genome Sequencing

Cited by 5 publications

References 18 publications

Identification of rare variants in Alzheimerâ€™s disease

Identification of rare variants in Alzheimerâ€™s disease

PRIMUS: Rapid Reconstruction of Pedigrees from Genome-wide Estimates of Identity by Descent

Profiling and leveraging relatedness in a precision medicine cohort of 92,455 exomes

Contact Info

Product

Resources

About