2017
DOI: 10.1186/s12859-017-1925-0
|View full text |Cite
|
Sign up to set email alerts
|

Integration of RNA-Seq data with heterogeneous microarray data for breast cancer profiling

Abstract: BackgroundNowadays, many public repositories containing large microarray gene expression datasets are available. However, the problem lies in the fact that microarray technology are less powerful and accurate than more recent Next Generation Sequencing technologies, such as RNA-Seq. In any case, information from microarrays is truthful and robust, thus it can be exploited through the integration of microarray data with RNA-Seq data. Additionally, information extraction and acquisition of large number of sample… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
30
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 48 publications
(32 citation statements)
references
References 41 publications
2
30
0
Order By: Relevance
“…For example, Pirooznia et al 36 used eight machine learning algorithms in eight microarray gene expression data sets, and they found SVM to have the best performance. In line with this, Castillo et al 37 analysed RNA-sequencing and microarray data sets, finding SVM to be more accurate than RF or nearest-neighbour classification. However, Bienkowska et al 38 used SVM and RF in combination with iterated feature selection in gene expression data.…”
Section: Discussionmentioning
confidence: 85%
“…For example, Pirooznia et al 36 used eight machine learning algorithms in eight microarray gene expression data sets, and they found SVM to have the best performance. In line with this, Castillo et al 37 analysed RNA-sequencing and microarray data sets, finding SVM to be more accurate than RF or nearest-neighbour classification. However, Bienkowska et al 38 used SVM and RF in combination with iterated feature selection in gene expression data.…”
Section: Discussionmentioning
confidence: 85%
“…Computational deconvolution to differentiate cell types is an important topic of transcriptomic data analysis, which can facilitate the elucidation of cell-type specific transcriptomic profiles in future studies [10, 19, 21, 23-31]. These approaches are generally limited to a single platform and usually for a single species, however, even though RNA-Seq/microarray data integration has already been applied in comparison studies [35, 36], tool development [37], and cancer research [38]. Due to the limitations of prior techniques, the vast accumulation of gene expression microarray and RNA-Seq data usually contains a tissue mixture with multiple cell types.…”
Section: Discussionmentioning
confidence: 99%
“…They were then mapped de novo to mm10 genome and annotated depending on Gencode vM15 by STAR 2.5.3a (Dobin et al, 2013). The integration of microarray and RNA-seq data followed Castillo's strategy (Castillo et al, 2017). Specifically, the RNA-seq count matrix was normalized by Limma-voom (Law, Chen, Shi, & Smyth, 2014), then combined with the downloaded microarray matrix and (after removing the genes that were absent from both platforms) further normalized by Limma-normalizeBetweenArrays.…”
Section: Data Acquisition and Processingmentioning
confidence: 99%