2019
DOI: 10.1093/bioinformatics/btz165
|View full text |Cite
|
Sign up to set email alerts
|

PyFeat: a Python-based effective feature generation tool for DNA, RNA and protein sequences

Abstract: Motivation Extracting useful feature set which contains significant discriminatory information is a critical step in effectively presenting sequence data to predict structural, functional, interaction and expression of proteins, DNAs and RNAs. Also, being able to filter features with significant information and avoid sparsity in the extracted features require the employment of efficient feature selection techniques. Here we present PyFeat as a practical and easy to use toolkit implemented in … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
80
0
1

Year Published

2019
2019
2023
2023

Publication Types

Select...
7
1
1

Relationship

1
8

Authors

Journals

citations
Cited by 91 publications
(81 citation statements)
references
References 12 publications
0
80
0
1
Order By: Relevance
“…Several widely used and convenient bio-sequence feature representation tools have been developed (Mrozek et al, 2013;Liu et al, 2015aLiu et al, , 2019cYu et al, 2015Cheng and Hu, 2018;Hu et al, 2019;Muhammod et al, 2019). The two main tools used in this work were iLearn (Hu et al, 2019) and PyFeat (Muhammod et al, 2019).…”
Section: Feature Representationmentioning
confidence: 99%
See 1 more Smart Citation
“…Several widely used and convenient bio-sequence feature representation tools have been developed (Mrozek et al, 2013;Liu et al, 2015aLiu et al, , 2019cYu et al, 2015Cheng and Hu, 2018;Hu et al, 2019;Muhammod et al, 2019). The two main tools used in this work were iLearn (Hu et al, 2019) and PyFeat (Muhammod et al, 2019).…”
Section: Feature Representationmentioning
confidence: 99%
“…In this study, X, Y, and k were set to 1, 2, or 3; and eight XYK combinations (except for 3mer-kspaced-3mer) were used for encoding. The PyFeat tool developed by Rafsanjani et al (Muhammod et al, 2019) was used to convert RNA sequences into vectors.…”
Section: Xmer K-spaced Ymer Composition Frequencymentioning
confidence: 99%
“…1). The 25 feature encoding algorithms are provided in detail in the Supplementary methods; each feature was generated by using a package of comprehensive protein feature generation tools, including 188D [45], Pse-in-One [73], iFeature [42] and pyFeat [43], or custom Python scripts.…”
Section: B Feature Extraction Strategymentioning
confidence: 99%
“…Similarly, xxKGAP feature is one variation of Kmer method implemented in PyFeat package [36], where the composition of subsequences with k-gaps is used to describe sequences. In this paper, we choose monoMonoKGap (mMKGap), monoDiKGap (mDKGap) and monoTriKGap (mTKGap) features with k = 1∌3 to construct.…”
Section: ) Xxkgapmentioning
confidence: 99%