Efficient Data Mining Algorithms for Screening Potential Proteins of Drug Target

BioMed Research International

2017

Self Cite

The identification and validation of drug targets are crucial in biomedical research and many studies have been conducted on analyzing drug target features for getting a better understanding on principles of their mechanisms. But most of them are based on either strong biological hypotheses or the chemical and physical properties of those targets separately. In this paper, we investigated three main ways to understand the functional biomolecules based on the topological features of drug targets. There are no significant differences between targets and common proteins in the protein-protein interactions network, indicating the drug targets are neither hub proteins which are dominant nor the bridge proteins. According to some special topological structures of the drug targets, there are significant differences between known targets and other proteins. Furthermore, the drug targets mainly belong to three typical communities based on their modularity. These topological features are helpful to understand how the drug targets work in the PPI network. Particularly, it is an alternative way to predict potential targets or extract nontargets to test a new drug target efficiently and economically. By this way, a drug target's homologue set containing 102 potential target proteins is predicted in the paper.

Section: Discussionmentioning

confidence: 99%

Section: Data Collectionmentioning

confidence: 99%

Drug Target Protein-Protein Interaction Networks: A Systematic Perspective

Feng

BioMed Research International

2017

Self Cite

“…However, with the in-depth research and theoretical development, especially in the 21st century, with the significant improvement of computer ability, learning natural language based on deep learning technology has gradually matured. In 2013, Wang and others proposed the word2vec algorithm and a bag-of-words model was constructed using neural network, and the word vector representation of the target language was calculated according to the context word distribution of the target language in a large-scale corpus [7]. Word2vec algorithm completes the transformation from text representation to static number vector.…”

Section: Literature Reviewmentioning

confidence: 99%

Practical Skills of Business English Correspondence Writing Based on Data Mining Algorithm

Liu

Habil

2022

Scientific Programming

English correspondence writing has become a necessary skill for every scientific researcher and high-tech talents. An English correspondence writing auxiliary writing system can help nonnative English speakers make up for the lack of professional expression. The key factor of business English correspondence writing system is the construction of knowledge base. To improve the business English correspondence writing knowledge base, we need to mine frequent patterns of sentences in each category. The purpose of this topic is to improve and supplement the knowledge base for the business English correspondence writing system and propose frequent pattern mining for sentences in each category, so as to improve the writing knowledge base for the business English correspondence writing system. Firstly, we crawl a large number of business English letters and telegrams from the Internet, extract the relevant summary information, then store it, and preliminarily construct a corpus based on sentences. Then, we do some research on the structure of business English correspondence abstracts, mark the sentences in the corpus and count the relevant information, and have a certain understanding of their writing methods. Finally, we mine frequent patterns for sentences in each category, so as to improve the knowledge base of summary writing for the business English correspondence writing system. In the experiment, we use the classical FP growth algorithm as the mining method. The experiment shows that the frequent patterns between 3 and 6 words have been mined to a certain extent. By gradually improving the mining strategy, the quality of mining results has been improved and the writing effect of business English correspondence of scientific researchers has been improved.

“…Some studies were dedicated to developing a more efficient statistical inference method. With the boom in machine learning methods and high credibility biological database [ 7 ], new methods are in great need to help identify novel pathway regulation relationships of protein interactions.…”

Section: Introductionmentioning

confidence: 99%

PAIRS: Prediction of Activation/Inhibition Regulation Signaling Pathway

Feng

Computational Intelligence and Neuroscience

2017

Self Cite

Uncovering the signaling architecture in protein-protein interaction (PPI) can certainly benefit the understanding of disease mechanisms and promise to facilitate the therapeutic interventions. Therefore, it is important to reveal the signaling relationship from one protein to another in terms of activation and inhibition. In this study, we propose a new measurement to characterize the regulation relationship of a PPI pair. By utilizing both Gene Ontology (GO) functional annotation and protein domain information, we developed a tool called Prediction of Activation/Inhibition Regulation Signaling Pathway (PAIRS) that takes protein interaction pairs as input and gives both known and predicted result of the human protein regulation relationship in terms of activation and inhibition. It helps to give prognostic regulation information for further signaling pathway reconstruction.