2020
DOI: 10.3389/fbioe.2020.00350
|View full text |Cite
|
Sign up to set email alerts
|

Identifying Cell-Type Specific Genes and Expression Rules Based on Single-Cell Transcriptomic Atlas Data

Abstract: Single-cell sequencing technologies have emerged to address new and longstanding biological and biomedical questions. Previous studies focused on the analysis of bulk tissue samples composed of millions of cells. However, the genomes within the cells of an individual multicellular organism are not always the same. In this study, we aimed to identify the crucial and characteristically expressed genes that may play functional roles in tissue development and organogenesis, by analyzing a single-cell transcriptomi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
10
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
8

Relationship

3
5

Authors

Journals

citations
Cited by 9 publications
(10 citation statements)
references
References 69 publications
0
10
0
Order By: Relevance
“…Boruta feature filtering ( Kursa and Rudnicki, 2010 ; Pan et al, 2020 ; Yuan et al, 2020 ) is usually used to rapidly select all relevant features to the target labels on the basis of a random forest (RF) classifier. In brief, the calculation of Boruta includes the following steps: (1) shuffled data are created by shuffling the feature values of copies of original data; (2) RF can be trained on the original and shuffled data to measure the feature importance, and the Z score is calculated for each feature by standardizing its importance score from the RF; (3) one original feature is tagged as important when its Z score is greater than the maximum Z score of shadow features; otherwise, it is tagged as unimportant; (4) the above processes are repeated until all features are tagged as important or not.…”
Section: Methodsmentioning
confidence: 99%
“…Boruta feature filtering ( Kursa and Rudnicki, 2010 ; Pan et al, 2020 ; Yuan et al, 2020 ) is usually used to rapidly select all relevant features to the target labels on the basis of a random forest (RF) classifier. In brief, the calculation of Boruta includes the following steps: (1) shuffled data are created by shuffling the feature values of copies of original data; (2) RF can be trained on the original and shuffled data to measure the feature importance, and the Z score is calculated for each feature by standardizing its importance score from the RF; (3) one original feature is tagged as important when its Z score is greater than the maximum Z score of shadow features; otherwise, it is tagged as unimportant; (4) the above processes are repeated until all features are tagged as important or not.…”
Section: Methodsmentioning
confidence: 99%
“…The Boruta method can find out the relevant features and significantly reduce the number of features based on ensemble learning of random forest classifiers. Boruta is a widely used method and has been proven to be an effective method to find all relevant features ( Pan et al, 2020 ; Yuan et al, 2020 ; Zhang et al, 2020 ).…”
Section: Methodsmentioning
confidence: 99%
“…Here, we employed Boruta ( Kursa and Rudnicki, 2010 ) method to quickly select relevant features with particular class labels (e.g., cancer types or non-cancer class). This method has been applied to deal with different biological and medical problems ( Pan et al, 2020 ; Yuan et al, 2020 ; Zhang et al, 2021a ).…”
Section: Methodsmentioning
confidence: 99%