2017
DOI: 10.2119/molmed.2017.00100
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Genome-wide Association in Biobanks Using Topic Modeling Identifies Multiple Novel Disease Loci

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
18
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
6
2

Relationship

2
6

Authors

Journals

citations
Cited by 18 publications
(19 citation statements)
references
References 25 publications
1
18
0
Order By: Relevance
“…After extraction of DNA from buffy coat, samples were genotyped via one of the Illumina Multi-Ethnic Genotyping Arrays, which include content from phase 3 of the 1000 Genomes Project. For details of genotyping, see our prior publication 7 . To address potential batch effects across the four genotyping waves, we cleaned, imputed, and analyzed each one separately.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…After extraction of DNA from buffy coat, samples were genotyped via one of the Illumina Multi-Ethnic Genotyping Arrays, which include content from phase 3 of the 1000 Genomes Project. For details of genotyping, see our prior publication 7 . To address potential batch effects across the four genotyping waves, we cleaned, imputed, and analyzed each one separately.…”
Section: Methodsmentioning
confidence: 99%
“…The remaining PheWAS code count by subject matrix was used to fit a latent Dirichlet allocation (LDA) model with 50 topics; the 50 topic count was selected for consistency with our own prior work and in the absence of well-established methods for optimal topic count selection 23 . As we have described 7 , this unsupervised machine learning method treats each subject’s medical record as if it were a document composed of PheWAS codes reflecting a mixture of underlying topics, or disease categories. The LDA model that results reflects a distribution of all PheWAS codes over each topic, although most codes contribute only a trivial amount.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…To address the contribution of the microbiota in our model, we used a co-housing experiment to homogenise the microbiota between experimental groups. Co-housing of mice has been shown to resolve microbiota drifts due to experimental treatments, resulting in significant clustering of co-housed animals (24,25). Co-housing was simulated by transfer of bedding and faecal pellets between cages.…”
Section: Vnmaa Treatment Severely Disrupts the Gut Microbiota Landscapementioning
confidence: 99%
“…In addition to its wide adoption in the text mining field, topic modeling has achieved many successes in computer vision and biomedical science. Recently, a few groups have used this approach to analyze electronic health records (EHRs) [9,10] and genetic data to capture the characteristic of data [11,12].…”
Section: Introductionmentioning
confidence: 99%