Yunxiang Fu scite author profile

Yunxiang Fu

5Publications

35Citation Statements Received

99Citation Statements Given

How they've been cited

How they cite others

236

Affiliations

Karlsruhe Institute of Technology, University of Hong Kong, Beijing University of Technology

Publications

Order By: Most citations

Protein Representation Learning via Knowledge Enhanced Primary Structure Modeling

Zhou

Zhang³

et al. 2023

Preprint

View full text Add to dashboard Cite

Protein representation learning has primarily benefited from the remarkable development of language models (LMs). Accordingly, pre-trained protein models also suffer from a problem in LMs: a lack of factual knowledge. The recent solution models the relationships between protein and associated knowledge terms as the knowledge encoding objective. However, it fails to explore the relationships at a more granular level, i.e., the token level. To mitigate this, we propose Knowledge-exploited Auto-encoder for Protein (KeAP), which performs token-level knowledge graph exploration for protein representation learning. In practice, non-masked amino acids iteratively query the associated knowledge tokens to extract and integrate helpful information for restoring masked amino acids via attention. We show that KeAP can consistently outperform the previous counterpart on 9 representative downstream applications, sometimes surpassing it by large margins. These results suggest that KeAP provides an alternative yet effective way to perform knowledge enhanced protein representation learning.

show abstract

Emerging disposal technologies of harmful phytoextraction biomass (HPB) containing heavy metals: A review

et al. 2022

View full text Add to dashboard Cite

A GNN-based indoor localization method using mobile RFID platform

Xiong

Liu

et al. 2022

View full text Add to dashboard Cite

CAM: A fine-grained vehicle model recognition method based on visual attention model

Zhu³

et al. 2020

Image and Vision Computing

View full text Add to dashboard Cite

Protein Representation Learning via Knowledge Enhanced Primary Structure Modeling

Zhou¹,

Fu²,

Zhang³

et al. 2023

Preprint

View full text Add to dashboard Cite

Protein representation learning has primarily benefited from the remarkable development of language models (LMs). Accordingly, pre-trained protein models also suffer from a problem in LMs: a lack of factual knowledge. The recent solution models the relationships between protein and associated knowledge terms as the knowledge encoding objective. However, it fails to explore the relationships at a more granular level, i.e., the token level. To mitigate this, we propose Knowledge-exploited Auto-encoder for Protein (KeAP), which performs tokenlevel knowledge graph exploration for protein representation learning. In practice, non-masked amino acids iteratively query the associated knowledge tokens to extract and integrate helpful information for restoring masked amino acids via attention. We show that KeAP can consistently outperform the previous counterpart on 9 representative downstream applications, sometimes surpassing it by large margins. These results suggest that KeAP provides an alternative yet effective way to perform knowledge enhanced protein representation learning. Code and models are available at https://github.com/RL4M/KeAP.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yunxiang Fu

Protein Representation Learning via Knowledge Enhanced Primary Structure Modeling

Emerging disposal technologies of harmful phytoextraction biomass (HPB) containing heavy metals: A review

A GNN-based indoor localization method using mobile RFID platform

CAM: A fine-grained vehicle model recognition method based on visual attention model

Protein Representation Learning via Knowledge Enhanced Primary Structure Modeling

Contact Info

Product

Resources

About