Memory-based collaborative filtering (CF) is applied to help users to find their favorite items in recommender systems. Up to now, this approach has been proven successful in recommender systems, such as e-commerce systems. The idea of this approach is that the interest of a particular user will be more consistent with those who share similar preference with him or her. Therefore, it is critical that an appropriate similarity measure should be selected for making recommendations. This paper proposes a new similarity measure named adjusted Euclidean distance (AED) method which unifies all Euclidean distances between vectors in different dimensional vector spaces. Our AED enjoy the advantages that it takes both the length of vectors and different dimension-numbers of vector spaces into consideration. Based on two datasets MovieLens and Book-Crossing, we conduct experiments comparing our AED with two notable existing methods. The experimental results demonstrate that our AED improves the accuracy of prediction and recommendation
Background: MicroRNAs (miRNAs) are a kind of non-coding RNA, which plays an essential role in gene regulation by binding to messenger RNAs(mRNAs). Accurate and rapid identification of miRNA target genes is helpful to reveal the mechanism of transcriptome regulation, which is of great significance for the study of cancer and other diseases. Many bioinformatics methods have been proposed to solve this problem, but the previous research did not further study the encoding of the base sequence. Results: In this study, we developed a novel method combining word embedding and deep learning for human miRNA targets at the site level prediction, which is inspired by the similarity between natural language and biological sequences. First, the wor2vec model was used to mine the distribution representation of miRNAs and mRNAs. Then, the data features are fully extracted automatically from temporal and spatial via the stacked Bidirectional Long short-term memory(BiLSTM) network. We compare the effects of different embedding methods on model accuracy in different deep learning models, and the results prove that using word2vec can improve the accuracy of deep learning models. In addition, we performed visual analysis on the distributed represented sequences and found hidden similarity relationships between bases. Finally, compared with different advanced methods and data sets, the results show that our proposed method has gotten better performance in multiple evaluation aspects. Conclusions: We present a novel method for predicting miRNA target sites consisting of word2vec and the BiLSTM model and demonstrate that this method can realize automatic feature extraction and has higher accuracy. Furthermore, we process miRNA and mRNA as two languages for the first time and explore their biological significance through visual analysis.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.