2019
DOI: 10.3389/fgene.2019.00734
|View full text |Cite
|
Sign up to set email alerts
|

A Tool for Visualization and Analysis of Single-Cell RNA-Seq Data Based on Text Mining

Abstract: Gene expression in individual cells can now be measured for thousands of cells in a single experiment thanks to innovative sample-preparation and sequencing technologies. State-of-the-art computational pipelines for single-cell RNA-sequencing data, however, still employ computational methods that were developed for traditional bulk RNA-sequencing data, thus not accounting for the peculiarities of single-cell data, such as sparseness and zero-inflated counts. Here, we present a ready-to-use pipeline named … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
36
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 16 publications
(36 citation statements)
references
References 28 publications
(50 reference statements)
0
36
0
Order By: Relevance
“…The two case studies aimed at the extraction of chemical names from the texts relevant to HIV reverse transcriptase inhibition, proteins and genes from the texts relevant to HIV control allow us to determine the advantages and disadvantages of text mining approaches to new information. The main advantage of text mining approaches is the possibility of covering the huge amount of textual data (Ruusmann and Maran, 2013 ; Capuzzi et al, 2017 , 2018 ; Kandhro et al, 2017 ; Azam et al, 2019 ; Gambardella and di Bernardo, 2019 ; Guin et al, 2019 ; Ivanisenko et al, 2019 ; Alves et al, 2020 ). Text mining approaches allow retrieving the most recent and important information about chemicals, proteins, and genes associated with HIV treatment including their tissue-specific expression level (Ivanisenko et al, 2019 ).…”
Section: Discussionmentioning
confidence: 99%
“…The two case studies aimed at the extraction of chemical names from the texts relevant to HIV reverse transcriptase inhibition, proteins and genes from the texts relevant to HIV control allow us to determine the advantages and disadvantages of text mining approaches to new information. The main advantage of text mining approaches is the possibility of covering the huge amount of textual data (Ruusmann and Maran, 2013 ; Capuzzi et al, 2017 , 2018 ; Kandhro et al, 2017 ; Azam et al, 2019 ; Gambardella and di Bernardo, 2019 ; Guin et al, 2019 ; Ivanisenko et al, 2019 ; Alves et al, 2020 ). Text mining approaches allow retrieving the most recent and important information about chemicals, proteins, and genes associated with HIV treatment including their tissue-specific expression level (Ivanisenko et al, 2019 ).…”
Section: Discussionmentioning
confidence: 99%
“…We next generated an atlas encompassing all 32 cell-lines, as shown in Figure 1A and available online. This was achieved by combining data across cell-lines with the gf-icf pipeline ( 19 ), which performs count normalization, feature selection and dimensionality reduction ( 20 ) of the profiled cells. In the atlas, cell lines derived from the same cancer subtypes tend to cluster together, while being separated from the other subtypes (Figure 1A): luminal BC cell lines form a big “island” with multiple “peninsulas” with intermixing of cells from distinct cell lines (Figure 1A,B); on the contrary, TNBC cells give rise to an “archipelago” with cells from the same cell-line grouped into distinct islands, thus suggesting that TNBC cell-lines represent instances of distinct diseases.…”
Section: Resultsmentioning
confidence: 99%
“…Single cells expression profiles were normalized using GF-ICF (Gene Frequency – Inverse Cell Frequency) normalization using the gficf package 65,66 for R statistical environment (https://github.com/dibbelab/gficf). GF-ICF is based on a data transformation model called term frequency-inverse document frequency (TF-IDF) that has been extensively used in the field of text mining.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations