2017
DOI: 10.1101/144519
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A standard operating procedure for outlier removal in large-sample epidemiological transcriptomics datasets

Abstract: Transcriptome measurements and other -omics type data are increasingly more used in epidemiological studies. Most of omics studies to date are small with samples sizes in the tens, or sometimes low hundreds, but this is changing. Our Norwegian Woman and Cancer (NOWAC) datasets are to date one or two orders of magnitude larger. The NOWAC biobank contains about 50000 blood samples from a prospective study. Around 125 breast cancer cases occur in this cohort each year. The large biological variation in gene expre… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
3
2
1

Relationship

4
2

Authors

Journals

citations
Cited by 8 publications
(10 citation statements)
references
References 18 publications
0
10
0
Order By: Relevance
“…We performed preprocessing of the raw microarray data according to the NOWAC standard procedure (17). Broadly this comprises the following steps; see the code linked above and the referenced manuscript for details:…”
Section: Laboratory Analyses and Data Pre-processingmentioning
confidence: 99%
“…We performed preprocessing of the raw microarray data according to the NOWAC standard procedure (17). Broadly this comprises the following steps; see the code linked above and the referenced manuscript for details:…”
Section: Laboratory Analyses and Data Pre-processingmentioning
confidence: 99%
“…Pippeline and a description of our microarray preprocessing pipeline are available at: https://github.com/uitbdps/pippeline A demo dataset from [22] is available at: https://doi.org/10.18710/FGVLKS…”
Section: Discussionmentioning
confidence: 99%
“…Steps with a dashed border are optional, while steps with a solid border are mandatory. More details are in[22]-[26] A screenshot of the user interface in R Studio, viewing the documentation help page for the "Biopsies" dataset in the NOWAC study. The right-hand panel shows the documentation generated by the code in the top left panel.…”
mentioning
confidence: 99%
“…The data were processed according to [6] and [7]. The pre-processed data is a 88 × 12404 fold change matrix, X, on the log 2 scale.…”
Section: Materials and Methods Datamentioning
confidence: 99%