2017
DOI: 10.1038/ng.3968
|View full text |Cite
|
Sign up to set email alerts
|

Analysis commons, a team approach to discovery in a big-data environment for genetic epidemiology

Abstract: Summary paragraph The exploding volume of whole-genome sequence (WGS) and multi-omics data requires new approaches for analysis. As one solution, we have created a cloud-based Analysis Commons, which brings together genotype and phenotype data from multiple studies in a setting that is accessible by multiple investigators. This framework addresses many of the challenges of multi-center WGS analyses, including data sharing mechanisms, phenotype harmonization, integrated multi-omics analyses, annotation, and com… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

2
69
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 95 publications
(77 citation statements)
references
References 11 publications
2
69
0
Order By: Relevance
“…Table 1 shows how reference panel size has increased over the years due to projects such as the International HapMap Project 9 , the 1000 Genomes Project (1000G) 10 , the UK10K Project 11 , and the Haplotype Reference Consortium (HRC) 12 . Soon larger reference panels will become available from the Trans-Omics for Precision Medicine (TOPMed) program 13 and the 100,000 Genomes Project 14 , both of which will exceed 100,000 high-coverage whole genome sequenced samples. Further ahead, as sequencing data on all 500,000 participants of the UK Biobank 2 becomes available, this will be used as an even larger reference panel.…”
Section: Introductionmentioning
confidence: 99%
“…Table 1 shows how reference panel size has increased over the years due to projects such as the International HapMap Project 9 , the 1000 Genomes Project (1000G) 10 , the UK10K Project 11 , and the Haplotype Reference Consortium (HRC) 12 . Soon larger reference panels will become available from the Trans-Omics for Precision Medicine (TOPMed) program 13 and the 100,000 Genomes Project 14 , both of which will exceed 100,000 high-coverage whole genome sequenced samples. Further ahead, as sequencing data on all 500,000 participants of the UK Biobank 2 becomes available, this will be used as an even larger reference panel.…”
Section: Introductionmentioning
confidence: 99%
“…The value of larger reference panels has led to steadily increasing reference panel size, with the largest reference panels to date having thousands or tens of thousands of samples. [6][7][8][9][10][11][12] Ongoing, large-scale projects, such as the Trans-Omics for Precision Medicine (TopMed) program, 13 and the National Human Genome Research Institute's Centers for Common Disease Genomics, 14; 15 are expected to produce reference panels with more than one hundred thousand individuals.…”
Section: Introductionmentioning
confidence: 99%
“…In recent years, large-scale whole-exome sequencing and whole-genome sequencing (WGS) data have been generated, such as those in the Exome Sequencing Project 1 , the UK10K project 2 and the ongoing NIH NHLBI Trans-Omics for Precision Medicine (TOPMed) WGS program 3 , providing unprecedented opportunities to investigate low-frequency variants (minor allele frequency [MAF] between 1% and 5%) and rare variants (RVs; MAF < 1%) in association with complex diseases and traits. However, WGS-based association analysis of complex traits remains a tremendous challenge due to the large number of RVs, many of which are non-traitassociated neutral variants.…”
Section: Introductionmentioning
confidence: 99%