2020
DOI: 10.1101/2020.03.16.993501
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Explainable Transformer Models for Functional Genomics in Prokaryotes

Abstract: The annotation of transcription start sites with computational methods is an important and unsolved problem in genomics. In recent years, several novel experimental methodologies -named Cappable-seq, SMRT-Cappable-seq and SEnd-seq -have been introduced for the detection of transcription start sites and applied on E. coli. In this study, a comparison is made between these new methodologies and the curated transcription start site data set featured by RegulonDB. The analysis between these data sets is facilitate… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 48 publications
(62 reference statements)
0
1
0
Order By: Relevance
“…Several newly developed techniques have provided a comprehensive view of the E. coli transcriptome by identifying full-length primary transcripts (13)(14)(15)(16)(17). For example, SMRT-Cappable-seq (6) combines the isolation of the full-length bacterial primary transcriptome with PacBio SMRT (Single Molecule, Real-Time) sequencing (6), and simultaneous 5' and 3' end sequencing (SEnd-seq) (7) captures both transcription start sites (TSSs) and transcription termination sites (TTSs) via circularization of transcripts (17). Despite the great progress in experimental techniques, there are still some deficiencies.…”
Section: Introductionmentioning
confidence: 99%
“…Several newly developed techniques have provided a comprehensive view of the E. coli transcriptome by identifying full-length primary transcripts (13)(14)(15)(16)(17). For example, SMRT-Cappable-seq (6) combines the isolation of the full-length bacterial primary transcriptome with PacBio SMRT (Single Molecule, Real-Time) sequencing (6), and simultaneous 5' and 3' end sequencing (SEnd-seq) (7) captures both transcription start sites (TSSs) and transcription termination sites (TTSs) via circularization of transcripts (17). Despite the great progress in experimental techniques, there are still some deficiencies.…”
Section: Introductionmentioning
confidence: 99%