2022
DOI: 10.3389/fmolb.2022.916639
|View full text |Cite
|
Sign up to set email alerts
|

Inferring microRNA regulation: A proteome perspective

Abstract: Post-transcriptional regulation in multicellular organisms is mediated by microRNAs. However, the principles that determine if a gene is regulated by miRNAs are poorly understood. Previous works focused mostly on miRNA seed matches and other features of the 3′-UTR of transcripts. These common approaches rely on knowledge of the miRNA families, and computational approaches still yield poor, inconsistent results, with many false positives. In this work, we present a different paradigm for predicting miRNA-regula… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 9 publications
(10 citation statements)
references
References 58 publications
(64 reference statements)
0
10
0
Order By: Relevance
“…A concern was whether the model was simply identifying surface-level patterns, such as associating all diseases from a specific data-source, such as Orphanet (an orphan disease database) with a subtype. Such hidden confounders are common in many predictive scenarios [35], [36]. To validate our model, we examined the distribution of disease subtypes in our predictions against the known subtypes, focusing on their source database ( Table 1 ).…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…A concern was whether the model was simply identifying surface-level patterns, such as associating all diseases from a specific data-source, such as Orphanet (an orphan disease database) with a subtype. Such hidden confounders are common in many predictive scenarios [35], [36]. To validate our model, we examined the distribution of disease subtypes in our predictions against the known subtypes, focusing on their source database ( Table 1 ).…”
Section: Resultsmentioning
confidence: 99%
“…These features are denoted as “Text Embedding X’’ in Fig 3, where X represents a vector in the embedding. For interpretability, we implemented an automated explanation framework showing exemplars of high, neutral and low values for embedding dimensions, inspired by approaches in other NLP or automated-machine learning works[35], [36], [52]. It is available in our codebase.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Multiple partitions of the data were analyzed separately: the entire dataset (25,117 sequences), a human-only subset (18,418 sequences), and only viruses from genera with a human host (3,915 sequences). Features were extracted using the SparkBeyond autoML framework, as described in (Ofer and Linial, 2022). Inputs included the protein sequence, length, taxonomy, name, UniProt keywords, virus-host species, and Baltimore classification, but not embeddings.…”
Section: Methodsmentioning
confidence: 99%
“…Functionalities in the SparkBeyond Discovery platform (SparkBeyond, Israel) were adopted to provide an AI-driven engine as a tool for extraction of meaningful features and to perform experiments on various machine learning models to learn and solve the time-series problem. The tool has been successfully applied in many research works to gain insight and create efficient models [ 28 , 29 , 30 ], as well as several industrial applications, ranging from banking, and electronic commerce, to insurance.…”
Section: Data Analytics and Prediction Modelsmentioning
confidence: 99%