2021
DOI: 10.1101/2021.02.17.431338
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

RaptGen: A variational autoencoder with profile hidden Markov model for generative aptamer discovery

Abstract: Nucleic acid aptamers are generated by an in vitro molecular evolution method known as systematic evolution of ligands by exponential enrichment (SELEX). A variety of candidates is limited by actual sequencing data from an experiment. Here, we developed RaptGen, which is a variational autoencoder for in silico aptamer generation. RaptGen exploits a profile hidden Markov model decoder to represent motif sequences effectively. We showed that RaptGen embedded simulation sequence data into low-dimension latent spa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
2

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(7 citation statements)
references
References 45 publications
(64 reference statements)
0
6
0
Order By: Relevance
“…This research was continued by Park et al and a similar conclusion was drawn for RNA aptamers specifically [39]. In [40], Iwano et al propose RaptGen which is similar to [37] and [39]. They too utilized DeepBind, however, RaptGen implemented a CNN-LSTM as their encoder and a profile hidden Markov model as their decoder.…”
Section: Aptamer Generationmentioning
confidence: 81%
See 2 more Smart Citations
“…This research was continued by Park et al and a similar conclusion was drawn for RNA aptamers specifically [39]. In [40], Iwano et al propose RaptGen which is similar to [37] and [39]. They too utilized DeepBind, however, RaptGen implemented a CNN-LSTM as their encoder and a profile hidden Markov model as their decoder.…”
Section: Aptamer Generationmentioning
confidence: 81%
“…After obtaining 849 unique known aptamer sequences [13,51,52], only 390 met the condition of being between the minimum (20) and maximum (40)…”
Section: Datamentioning
confidence: 99%
See 1 more Smart Citation
“…Embedding biological sequences using NLP overcomes these difficulties and outperforms existing methods in many tasks, such as function, structure, localization, and disorder prediction (Table 1). In addition to these general biological tasks, representation learning has also been used to solve specific problems, such as RNA aptamer optimization [98], viral mutation prediction [56], and venom toxin prediction [36]. In these studies, representation learning of biological sequences could capture biophysical and biochemical properties of biological systems, and representation learning may reveal the grammar of life.…”
Section: Discussionmentioning
confidence: 99%
“…and a similar conclusion was drawn for RNA aptamers specifically [ 39 ]. In [ 40 ], Iwano et al . propose RaptGen which is similar to [ 37 ] and [ 39 ].…”
Section: Related Workmentioning
confidence: 99%