2022
DOI: 10.1101/2022.05.03.490410
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Translation rate prediction and regulatory motif discovery with multi-task learning

Abstract: BackgroundMany studies have found that sequence in the 5’ untranslated regions (UTRs) impacts the translation rate of an mRNA, but the regulatory grammar that underpins this translation regulation remains elusive. Deep learning methods deployed to analyse massive sequencing datasets offer new solutions to motif discovery. However, existing works focused on extracting sequence motifs in individual datasets, which may not be generalisable to other datasets from the same cell type. We hypothesise that motifs that… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
1

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(14 citation statements)
references
References 53 publications
0
9
1
Order By: Relevance
“…The shared encoder is therefore trained on multiple data sets, while the task-specific tower is trained only on the respective task. In comparison to Optimus5' and Framepool, MTtrans provides an increase in R 2 of 0.015 − 0.06 in prediction accuracy, depending on the data set [17]). Interestingly, training MTtrans on multiple data sets at once rather than in a sequential, task-specific manner, achieved an almost similar effect.…”
Section: Mpra Growth-basedmentioning
confidence: 99%
See 1 more Smart Citation
“…The shared encoder is therefore trained on multiple data sets, while the task-specific tower is trained only on the respective task. In comparison to Optimus5' and Framepool, MTtrans provides an increase in R 2 of 0.015 − 0.06 in prediction accuracy, depending on the data set [17]). Interestingly, training MTtrans on multiple data sets at once rather than in a sequential, task-specific manner, achieved an almost similar effect.…”
Section: Mpra Growth-basedmentioning
confidence: 99%
“…The CNN also recovered some of the important regulatory elements such as uORFs [14]. More recently, a novel experimental design was used to accurately measure the output of yeast reporters driven by natural 5'UTRs [15], while novel DL architectures and training approaches aimed to improve prediction accuracy [16,17]. Potential limitations of DL models built from synthetic sequences is that it is a priori unclear whether the training set contains the regulatory elements that are relevant in vivo and whether the features extracted by the model generalize well across systems such as cell types and readouts of the process of interest.…”
Section: Introductionmentioning
confidence: 99%
“…Chen et al 9 introduced the RNA-FM embedding, which was used in conjunction with a 32-layer residual network (ResNet) to predict MRL. Zheng et al 5 developed a multi-task CNN model for TE prediction using multiple data sources.…”
Section: Data Availabilitymentioning
confidence: 99%
“…Investigation into the role of 5' UTRs encompasses various aspects of translational control. With the growing interest in studying and designing 5' UTRs, various computational tools [5][6][7][8][9]5,7,8 have been developed to study its functions. For example, the ribosome load measures the number of ribosomes engaged in translating a given mRNA at a given time.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation