2014
DOI: 10.1093/bioinformatics/btu375
|View full text |Cite
|
Sign up to set email alerts
|

Preserving biological heterogeneity with a permuted surrogate variable analysis for genomics batch correction

Abstract: All analyses were performed using R version 2.15.0. The code and data used to generate the results of this manuscript is available from https://sourceforge.net/projects/psva.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
109
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
4
3
3

Relationship

1
9

Authors

Journals

citations
Cited by 116 publications
(109 citation statements)
references
References 27 publications
0
109
0
Order By: Relevance
“…We then examined the association between the residuals of DNA methylation (independent variable) and mRNA expression (dependent variable) using a linear regression model. In FHS, we removed 25 surrogate variables (SVs) [26] from the gene expression, along with sex, age, and imputed blood cell fractions as fixed effects, and technical covariates, such as batch effects and lab effects as random effects. We also removed 25 separately computed SVs from the methylation data, along with sex, age, and imputed blood cell fractions as fixed effects, and technical covariates, such as batch effects and lab effects as random effects.…”
Section: Methodsmentioning
confidence: 99%
“…We then examined the association between the residuals of DNA methylation (independent variable) and mRNA expression (dependent variable) using a linear regression model. In FHS, we removed 25 surrogate variables (SVs) [26] from the gene expression, along with sex, age, and imputed blood cell fractions as fixed effects, and technical covariates, such as batch effects and lab effects as random effects. We also removed 25 separately computed SVs from the methylation data, along with sex, age, and imputed blood cell fractions as fixed effects, and technical covariates, such as batch effects and lab effects as random effects.…”
Section: Methodsmentioning
confidence: 99%
“…For RNA data, we normalised each tissue separately, using limma-voom 61 to remove heteroscedasticity from scaled TPM values, followed by pSVA 62 and regressing out RNA sequencing batches and clinical batches (Supplementary Methods). For the proteomics data, we regressed out batches from the log2-transformed normalised abundance values, then quantile normalised the residuals as analogous step to the differential expression analysis; all analyses were also done without quantile normalisation, with no appreciable difference in results.…”
Section: Sample Clusteringmentioning
confidence: 99%
“…To correct for batch effects and remove unwanted variation between the first and second fish, we used the ComBat function from R Bioconductor's sva package (Parker et al 2014). After this procedure, variation between individuals was minimal (Supplemental Fig.…”
Section: Single-cell Rna-seq Data Processingmentioning
confidence: 99%