Predicting the multi-label protein subcellular localization through multi-information fusion and MLSI dimensionality reduction based on MLFE classifier

Liu, Yushuang; Jin, Shuping; Gao, Hongli; Wang, Xue; Wang, Congjing; Zhou, Weifeng; Yu, Bin

doi:10.1093/bioinformatics/btab811

Cited by 15 publications

(5 citation statements)

References 63 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…But knowledge data are limited and only applicable to wellcurated proteins, which limits the predictive power of this kind of method for novel or newly discovered proteins. In recent studies [75][76][77], different kinds of information are fused together for better model performance, given that computational methods excel with high dimensional data as inputs.…”

Section: Knowledge-based Methodsmentioning

confidence: 99%

“…The fusion methods can basically be divided into two categories: feature-level fusion [77,104,105] and decision-level fusion [106]. Feature-level fusion is mostly based on average pooling, weighted combination [107], serial combination, or concatenation of selected values.…”

Section: Knowledge-based Ai Approachesmentioning

confidence: 99%

“…Feature-level fusion is mostly based on average pooling, weighted combination [107], serial combination, or concatenation of selected values. Liu et al [77] utilized the latent semantic index method to represent multi-label information, while Yu et al [49] constructed a novel parallel framework of attribute fusion to avoid the impact of duplicated information. This fusion level enhances the information from multiple sources and allows flexibility in fusion techniques, such as early integration, intermediate integration, and late integration [108].…”

Section: Knowledge-based Ai Approachesmentioning

confidence: 99%

See 2 more Smart Citations

A Review for Artificial Intelligence Based Protein Subcellular Localization

Xiao,

Zou,

Wang

et al. 2024

Biomolecules

View full text Add to dashboard Cite

Proteins need to be located in appropriate spatiotemporal contexts to carry out their diverse biological functions. Mislocalized proteins may lead to a broad range of diseases, such as cancer and Alzheimer’s disease. Knowing where a target protein resides within a cell will give insights into tailored drug design for a disease. As the gold validation standard, the conventional wet lab uses fluorescent microscopy imaging, immunoelectron microscopy, and fluorescent biomarker tags for protein subcellular location identification. However, the booming era of proteomics and high-throughput sequencing generates tons of newly discovered proteins, making protein subcellular localization by wet-lab experiments a mission impossible. To tackle this concern, in the past decades, artificial intelligence (AI) and machine learning (ML), especially deep learning methods, have made significant progress in this research area. In this article, we review the latest advances in AI-based method development in three typical types of approaches, including sequence-based, knowledge-based, and image-based methods. We also elaborately discuss existing challenges and future directions in AI-based method development in this research field.

show abstract

Section: Knowledge-based Methodsmentioning

confidence: 99%

Section: Knowledge-based Ai Approachesmentioning

confidence: 99%

Section: Knowledge-based Ai Approachesmentioning

confidence: 99%

See 1 more Smart Citation

A Review for Artificial Intelligence Based Protein Subcellular Localization

Xiao,

Zou,

Wang

et al. 2024

Biomolecules

View full text Add to dashboard Cite

show abstract

Section: Knowledge-based Methodsmentioning

confidence: 99%

A Review for Artificial Intelligence Based Protein Subcellular Localization

Xiao,

Zou,

Wang

et al. 2024

Preprint

View full text Add to dashboard Cite

Proteins need to be located in appropriate spatiotemporal contexts to carry out their diverse biological functions. Mislocalized proteins may lead to a broad range of diseases, such as cancer and Alzheimer’s disease. Knowing where a target protein resides within a cell will give insights into tailored drug design for a disease. As the gold validation standard, the conventional wet lab uses fluorescent microscopy imaging, immunoelectron microscopy, and fluorescent biomarker tags for protein subcellular location identification. However, the booming era of proteomics and high-throughput sequencing generates tons of newly discovered proteins, making protein subcel-lular localization by wet-lab experiments a mission impossible. To tackle this concern, in the past decades, artificial intelligence (AI) and machine learning (ML), especially deep learning methods, have made significant progress in this research area. In this article, we review the latest advances in AI-based method development in three typical types of approaches, including sequence-based, knowledge-based, and image-based methods. We also elaborately discuss existing challenges and future directions in AI-based method development in this research field.

show abstract

“…Most of them use general sequence features rather than hand-crafted features related to specific sorting signals and claim to be able to address the problem of proteins localized at multiple sites (i.e., multi-labeled proteins), though there still remains the problem that their training data do not seem to have been annotated with a uniform criterion (see below). Some of them proposed extensions of existing sequence features, such as the k -mer compositions ( Li et al, 2019 ; Yao et al, 2019 ; Sahu et al, 2020 ), while some imported external information, such as Gene Ontology and protein-protein interactions ( Chen et al, 2021 ; Liu et al, 2021 ; Zhang et al, 2021 ). One method employed an ensemble approach of multiple classifiers with voting, claiming that the approach is effective in addressing the problem of imbalanced sizes of training data between different localization sites ( Wattanapornprom et al, 2021 ).…”

Section: Miscellaneous Algorithmsmentioning

confidence: 99%

Recent Advances in the Prediction of Subcellular Localization of Proteins and Related Topics

Nakai

Wei

2022

Front. Bioinform.

View full text Add to dashboard Cite

Prediction of subcellular localization of proteins from their amino acid sequences has a long history in bioinformatics and is still actively developing, incorporating the latest advances in machine learning and proteomics. Notably, deep learning-based methods for natural language processing have made great contributions. Here, we review recent advances in the field as well as its related fields, such as subcellular proteomics and the prediction/recognition of subcellular localization from image data.

show abstract

Predicting the multi-label protein subcellular localization through multi-information fusion and MLSI dimensionality reduction based on MLFE classifier

Cited by 15 publications

References 63 publications

A Review for Artificial Intelligence Based Protein Subcellular Localization

A Review for Artificial Intelligence Based Protein Subcellular Localization

A Review for Artificial Intelligence Based Protein Subcellular Localization

Recent Advances in the Prediction of Subcellular Localization of Proteins and Related Topics

Contact Info

Product

Resources

About