2022
DOI: 10.1016/j.compbiolchem.2022.107688
|View full text |Cite
|
Sign up to set email alerts
|

A new approach for determining SARS-CoV-2 epitopes using machine learning-based in silico methods

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 15 publications
(6 citation statements)
references
References 61 publications
0
1
0
Order By: Relevance
“…In another study, a Bayesian neural network was used for this task and was able to achieve an accuracy of 85% [63]. It can be seen from Table 9 that the random forest model developed in [43] was able to make good predictions, although there is a need for improvement in the model's performance. In all the methods used previously, to the best of our knowledge, none have attempted the transfer learning strategy that was used in this study.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…In another study, a Bayesian neural network was used for this task and was able to achieve an accuracy of 85% [63]. It can be seen from Table 9 that the random forest model developed in [43] was able to make good predictions, although there is a need for improvement in the model's performance. In all the methods used previously, to the best of our knowledge, none have attempted the transfer learning strategy that was used in this study.…”
Section: Discussionmentioning
confidence: 99%
“…The data consisted of ten features, both structural and chemical. The protein sequences and peptide sequences were in categorical form but were later converted to numerical form using their sequence lengths so that each protein sequence or peptide sequence would have a value that corresponded to the number of its categorical letters, while chou_fasman (beta turn), kolaskar_tongaonkar (antigenicity), Parker (hydrophobicity), Emini (relative surface accessibility), stability, isoelectric_point, aromaticity, and hydrophobicity were numerical, as previously described in [43]; see Figure 1 for the data distribution. The SARS-CoV-2 dataset lacked label information and comprised 20,312 peptides isolated from the virus' spike protein.…”
Section: Datasetsmentioning
confidence: 99%
“…Expanding the dataset to include data from other ICUs within the same institution could help address this limitation. Fourth, balancing a dataset using methods, such as SMOTE (Synthetic Minority Over-sampling Technique) [ 41 ], can improve the performance of a machine learning model in minority class prediction by creating synthetic samples by interpolating between existing minority class samples and then increasing the number of minority class samples in the training dataset. Fifth, the study did not use more complex modeling techniques, such as recurrent neural networks or transformers, which have built-in temporal dynamics functionality, due to the desire to use smaller computational units and maintain interpretability of features.…”
Section: Discussionmentioning
confidence: 99%
“…In this study, accuracy, precision, recall, and F1 score metrics were used to evaluate the classification performance of the RNN methods. The formulas for these metrics are provided below [9,10]:…”
Section: B Performance Metricsmentioning
confidence: 99%