2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) 2015
DOI: 10.1109/embc.2015.7319872
|View full text |Cite
|
Sign up to set email alerts
|

Effect of low-expression gene filtering on detection of differentially expressed genes in RNA-seq data

Abstract: We compare methods for filtering RNA-seq lowexpression genes and investigate the effect of filtering on detection of differentially expressed genes (DEGs). Although RNA-seq technology has improved the dynamic range of gene expression quantification, low-expression genes may be indistinguishable from sampling noise. The presence of noisy, low-expression genes can decrease the sensitivity of detecting DEGs. Thus, identification and filtering of these low-expression genes may improve DEG detection sensitivity. Us… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
35
0
2

Year Published

2016
2016
2023
2023

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 68 publications
(43 citation statements)
references
References 19 publications
(28 reference statements)
0
35
0
2
Order By: Relevance
“…Samples were analyzed statistically using the tool Empirical analysis of DGE ( Differential Gene Expression ) in CLC Genomics Workbench as described in Materials and Methods, with original total exon counts from each sample used as input. For each gene, if the average exon counts from all biological replicates of a sample were identified as <10, the gene was defined as not expressed in that sample (see Soneson and Delorenzi, 2013 ; Sha et al, 2015 ). Based on this condition, 14,491, 14,157 and 14,259 genes from Arabidopsis were identified as not expressed in each of the three experimental comparisons (i), (ii) and (iii), respectively, and these were consequently excluded from further analyses (Table 1 ).…”
Section: Resultsmentioning
confidence: 99%
“…Samples were analyzed statistically using the tool Empirical analysis of DGE ( Differential Gene Expression ) in CLC Genomics Workbench as described in Materials and Methods, with original total exon counts from each sample used as input. For each gene, if the average exon counts from all biological replicates of a sample were identified as <10, the gene was defined as not expressed in that sample (see Soneson and Delorenzi, 2013 ; Sha et al, 2015 ). Based on this condition, 14,491, 14,157 and 14,259 genes from Arabidopsis were identified as not expressed in each of the three experimental comparisons (i), (ii) and (iii), respectively, and these were consequently excluded from further analyses (Table 1 ).…”
Section: Resultsmentioning
confidence: 99%
“…For reliable differential expression analysis, a further improvement of eFDR levels is thus needed. Additional filtering steps have been successfully used to that effect [13, 24]. For RNA-seq, unlike for microarrays, beyond filters for small effect size (fold change) also filters for small expression levels are necessary.…”
Section: Resultsmentioning
confidence: 99%
“…Especially for RNA-seq a plethora of new tools is being developed, and the selection of an effective pipeline is not trivial [24]. Going beyond the comparisons of the original SEQC study [2, 3], we here present comprehensive benchmark results covering all known genes and a range of effect sizes typically observed in experiments.…”
Section: Discussionmentioning
confidence: 99%
“…Similarly, the correlation between the lncRNA expression and the closest coding gene was performed for the coding genes that were expressed (RPKM >0.5) in at least 3 samples among the 8 studied. This criteria has been defined in order to remove low counts in the libraries to improve the sensitivity and the precision of the differential genes expression ( 20 ). Moreover, this threshold was selected because, for GRO-Seq, reads are counted throughout the gene body, which represents more total reads per genes than RNA-seq ( 12 , 15 , 21 , 22 ).…”
Section: Methodsmentioning
confidence: 99%