2022
DOI: 10.1038/s41588-022-01102-2
|View full text |Cite
|
Sign up to set email alerts
|

A sequence-based global map of regulatory activity for deciphering human genetics

Abstract: Epigenomic profiling has enabled large-scale identification of regulatory elements, yet we still lack a systematic mapping from any sequence or variant to regulatory activities. We address this challenge with Sei, a framework for integrating human genetics data with sequence information to discover the regulatory basis of traits and diseases. Sei learns a vocabulary of regulatory activities, called sequence classes, using a deep learning model that predicts 21,907 chromatin profiles across >1,300 cell lines… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

3
131
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 100 publications
(134 citation statements)
references
References 40 publications
(67 reference statements)
3
131
0
Order By: Relevance
“…We also observed similar levels of performance of single and multitask models across metrics (Figure 4bc, Supplementary Figure 4ab), consistent with the successful application of multitask models to the prediction of chromatin state from DNA input in bulk and single cells [13][14][15][16][17][18][19][20][21][22][23] . Using a multitask model offers much less training overhead and faster inference times than training 100s of single task models across tasks.…”
Section: In Vitro Rna Binding Prediction With Deepbindsupporting
confidence: 77%
See 1 more Smart Citation
“…We also observed similar levels of performance of single and multitask models across metrics (Figure 4bc, Supplementary Figure 4ab), consistent with the successful application of multitask models to the prediction of chromatin state from DNA input in bulk and single cells [13][14][15][16][17][18][19][20][21][22][23] . Using a multitask model offers much less training overhead and faster inference times than training 100s of single task models across tasks.…”
Section: In Vitro Rna Binding Prediction With Deepbindsupporting
confidence: 77%
“…This data has in turn powered machine learning methods aimed at predicting the functional readouts of these sequences such as histone marks 5 , chromatin accessibility 6 , 3D conformation 7 , and gene expression 8 . Deep learning (DL) has become especially popular in this space, and has been successfully applied to tasks such as DNA and RNA protein binding motif detection [9][10][11][12] , chromatin state prediction [13][14][15][16][17][18][19][20][21][22][23] , transcriptional activity prediction 16,[24][25][26][27] and 3D contact prediction 28,29 . Recently, complementary models have been developed to predict data from massively parallel reporter assays (MPRAs) that directly test the gene regulatory potential of candidate elements [30][31][32] .…”
Section: Introductionmentioning
confidence: 99%
“…Importantly, each of these tools uses a different subset of available annotation data and thus may come to a different conclusion as to which mutations should be prioritized for follow-up. Recent tools such as GWAVA [ 68 ], DeepSEA [ 69 ], and Sei [ 70 ] use machine learning classification models to prioritize non-coding mutations. For example, based on a modified random forest algorithm, GWAVA prioritized five SNPs inside the 3’UTR of the caveolin 2 gene, CAV2 [ 71 ].…”
Section: Strategies For Resolving the Gap In Functional Interpretatio...mentioning
confidence: 99%
“…TF binding and chromatin accessibility DeepBind (Alipanahi et al, 2015) DeepBind (Alipanahi et al, 2015) FactorNet (Quang and Xie, 2019) BPNet (Avsec et al, 2021b) DeepSEA (Zhou and Troyanskaya, 2015) Basset (Kelley et al, 2016) ChromBPNet (Trevino et al, 2021) Basset (Kelley et al, 2016) FactorNet (Quang and Xie, 2019) DeepMEL2 (Atak et al, 2021) DeepFIGV (Hoffman et al, 2019) DeepFIGV (Hoffman et al, 2019) MPRA-DragoNN (Movva et al, 2019) DeepMEL2 (Atak et al, 2021) DeepFun (Pei et al, 2021) DNABERT (Ji et al, 2021) Gene expression Basenji (Kelley et al, 2018) Enformer (Avsec et al, 2021a) Basenji (Kelley et al, 2018) Xpresso (Agarwal and Shendure, 2020) ExPecto (Zhou et al, 2018) Xpresso (Agarwal and Shendure, 2020) Enformer (Avsec et al, 2021a) Enformer (Avsec et al, 2021a) Chromatin conformation Akita (Fudenberg et al, 2020) DeepC (Schwessinger et al, 2020) Orca (Zhou, 2022) Lan and Corces 10.3389/fnagi.2022.1027224…”
Section: Deeplift or Deepshapmentioning
confidence: 99%