Ermal Elbasani scite author profile

Background Compound–protein interaction prediction is necessary to investigate health regulatory functions and promotes drug discovery. Machine learning is becoming increasingly important in bioinformatics for applications such as analyzing protein-related data to achieve successful solutions. Modeling the properties and functions of proteins is important but challenging, especially when dealing with predictions of the sequence type. Result We propose a method to model compounds and proteins for compound–protein interaction prediction. A graph neural network is used to represent the compounds, and a convolutional layer extended with a bidirectional recurrent neural network framework, Long Short-Term Memory, and Gate Recurrent unit is used for protein sequence vectorization. The convolutional layer captures regulatory protein functions, while the recurrent layer captures long-term dependencies between protein functions, thus improving the accuracy of interaction prediction with compounds. A database of 7000 sets of annotated compound protein interaction, containing 1000 base length proteins is taken into consideration for the implementation. The results indicate that the proposed model performs effectively and can yield satisfactory accuracy regarding compound protein interaction prediction. Conclusion The performance of GCRNN is based on the classification accordiong to a binary class of interactions between proteins and compounds The architectural design of GCRNN model comes with the integration of the Bi-Recurrent layer on top of CNN to learn dependencies of motifs on protein sequences and improve the accuracy of the predictions.

show abstract

AMR-CNN: Abstract Meaning Representation with Convolution Neural Network for Toxic Content Detection

Elbasani

Kim

2022

JWE

View full text Add to dashboard Cite

Recognizing the offensive, abusive, and profanity of multimedia content on the web has been a challenge to keep the web environment for user’s freedom of speech. As profanity filtering function has been developed and applied in text, audio, and video context in platforms such as social media, entertainment, and education, the number of methods to trick the web-based application also has been increased and became a new issue to be solved. Compared to commonly developed toxic content detection systems that use lexicon and keyword-based detection, this work tries to embrace a different approach by the meaning of the sentence. Meaning representation is a way to grasp the meaning of linguistic input. This work proposed a data-driven approach utilizing Abstract meaning Representation to extract the meaning of the online text content into a convolutional neural network to detect level profanity. This work implements the proposed model in two kinds of datasets from the Offensive Language Identification Dataset and other datasets from the Offensive Hate dataset merged with the Twitter Sentiment Analysis dataset. The results indicate that the proposed model performs effectively, and can achieve a satisfactory accuracy in recognizing the level of online text content toxicity.

show abstract

Prediction of trehalose-metabolic pathway and comparative analysis of KEGG, MetaCyc, and RAST databases based on complete genome of Variovorax sp. PAMC28711

et al. 2022

View full text Add to dashboard Cite

Background Metabolism including anabolism and catabolism is a prerequisite phenomenon for all living organisms. Anabolism refers to the synthesis of the entire compound needed by a species. Catabolism refers to the breakdown of molecules to obtain energy. Many metabolic pathways are undisclosed and many organism-specific enzymes involved in metabolism are misplaced. When predicting a specific metabolic pathway of a microorganism, the first and foremost steps is to explore available online databases. Among many online databases, KEGG and MetaCyc pathway databases were used to deduce trehalose metabolic network for bacteria Variovorax sp. PAMC28711. Trehalose, a disaccharide, is used by the microorganism as an alternative carbon source. Results While using KEGG and MetaCyc databases, we found that the KEGG pathway database had one missing enzyme (maltooligosyl-trehalose synthase, EC 5.4.99.15). The MetaCyc pathway database also had some enzymes. However, when we used RAST to annotate the entire genome of Variovorax sp. PAMC28711, we found that all enzymes that were missing in KEGG and MetaCyc databases were involved in the trehalose metabolic pathway. Conclusions Findings of this study shed light on bioinformatics tools and raise awareness among researchers about the importance of conducting detailed investigation before proceeding with any further work. While such comparison for databases such as KEGG and MetaCyc has been done before, it has never been done with a specific microbial pathway. Such studies are useful for future improvement of bioinformatics tools to reduce limitations.

show abstract

LLAD: Life-Log Anomaly Detection Based on Recurrent Neural Network LSTM

Elbasani

Kim

2021

Journal of Healthcare Engineering

View full text Add to dashboard Cite

Life-Log is a term used for the daily monitoring of health conditions and recognizing anomalies from data generated by sensor devices. The development of smart sensors enables collection of health data, which can be considered as a solution to risks associated with personal healthcare by raising awareness regarding health conditions and wellness. Therefore, Life-Log analysis methods are important for real-life monitoring and anomaly detection. This study proposes a method for the improvement and combination of previous methods and techniques in similar fields to detect anomalies in health log data generated by various sensors. Recurrent neural networks with long short-term memory units are used for analyzing the Life-Log data. The results indicate that the proposed model performs more effectively than conventional health data analysis methods, and the proposed approach can yield a satisfactory accuracy in anomaly detection.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ermal Elbasani

A Survey on RFID in Industry 4.0

GCRNN: graph convolutional recurrent neural network for compound–protein interaction prediction

AMR-CNN: Abstract Meaning Representation with Convolution Neural Network for Toxic Content Detection

Prediction of trehalose-metabolic pathway and comparative analysis of KEGG, MetaCyc, and RAST databases based on complete genome of Variovorax sp. PAMC28711

LLAD: Life-Log Anomaly Detection Based on Recurrent Neural Network LSTM

Contact Info

Product

Resources

About