Our system is currently under heavy load due to increased usage. We're actively working on upgrades to improve performance. Thank you for your patience.
2022
DOI: 10.1093/bioinformatics/btac214
|View full text |Cite
|
Sign up to set email alerts
|

Exploiting deep transfer learning for the prediction of functional non-coding variants using genomic sequence

Abstract: Motivation Though genome-wide association studies have identified tens of thousands of variants associated with complex traits and most of them fall within the noncoding regions, they may not the causal ones. The development of high-throughput functional assays leads to the discovery of experimental validated noncoding functional variants. However, these validated variants are rare due to technical difficulty and financial cost. The small sample size of validated variants makes it less reliab… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
2

Relationship

2
7

Authors

Journals

citations
Cited by 15 publications
(11 citation statements)
references
References 51 publications
(42 reference statements)
0
8
0
Order By: Relevance
“…Therefore, transfer learning can leverage both labelled data from source task, which usually has a large sample size, and labelled data from the target task with a small sample size to improve the prediction performance for the target task. Particularly, deep transfer learning, which utilizes deep neural network in the transfer learning, has been widely adopted in genomic study ( Chen et al , 2022 ). Practically, the first layers are pretrained using labelled data from the source task, frozen and transferred to the target task, where the last layers or all layers are fine-tuned using the labelled data from the target task.…”
Section: Methodsmentioning
confidence: 99%
“…Therefore, transfer learning can leverage both labelled data from source task, which usually has a large sample size, and labelled data from the target task with a small sample size to improve the prediction performance for the target task. Particularly, deep transfer learning, which utilizes deep neural network in the transfer learning, has been widely adopted in genomic study ( Chen et al , 2022 ). Practically, the first layers are pretrained using labelled data from the source task, frozen and transferred to the target task, where the last layers or all layers are fine-tuned using the labelled data from the target task.…”
Section: Methodsmentioning
confidence: 99%
“…Methods such as Mendelian randomization are providing new insights into causal pathways for risk factors [ 102 ], but can be inadequately powered and suffer from weak instrument and survival bias. Machine learning methods, such as deep learning approaches, could improve the way risk variants are identified and functionally assessed within genome-wide association studies [ 103 105 ], creating stronger genetic instruments to improve causal analysis. Contemporary causal machine learning approaches have the potential to enhance our understanding of the underlying mechanisms which connect multiple risk factors, pathologies and the clinical syndrome of dementia itself.…”
Section: Preventionmentioning
confidence: 99%
“…Therefore, methods pinpointing to such regulatory SNPs (rSNPs) are a topic of current interest. Several methods exist that successfully highlight rSNPs based on epigenetic information like open chromatin data, TF and histone ChIP-seq without taking into account which TF might be affected [30,25,2,12]. In contrast, methods that evaluate the effect of a SNP on a TFBS rely on the ability to describe the binding behaviour of a Transcription Factor (TF) to assess the difference induced by a non-coding SNP.…”
Section: Introductionmentioning
confidence: 99%