2017
DOI: 10.1093/bioinformatics/btx255
|View full text |Cite
|
Sign up to set email alerts
|

Direct AUC optimization of regulatory motifs

Abstract: MotivationThe discovery of transcription factor binding site (TFBS) motifs is essential for untangling the complex mechanism of genetic variation under different developmental and environmental conditions. Among the huge amount of computational approaches for de novo identification of TFBS motifs, discriminative motif learning (DML) methods have been proven to be promising for harnessing the discovery power of accumulated huge amount of high-throughput binding data. However, they have to sacrifice accuracy for… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
4
1

Relationship

1
9

Authors

Journals

citations
Cited by 32 publications
(7 citation statements)
references
References 39 publications
0
7
0
Order By: Relevance
“…At the same time, we plan to use the CNV detection results in the correction of Cancer Cell Fraction (CCF), which will greatly promote a comprehensive understanding of tumor occurrence and development ( Xi et al, 2018 ; Mao et al, 2021 ; Tarabichi et al, 2021 ). We will also try to apply this algorithm to the research of ancient DNA mutation detection, which may be helpful to explore the evolution process of species ( Zhu et al, 2017b ).…”
Section: Discussionmentioning
confidence: 99%
“…At the same time, we plan to use the CNV detection results in the correction of Cancer Cell Fraction (CCF), which will greatly promote a comprehensive understanding of tumor occurrence and development ( Xi et al, 2018 ; Mao et al, 2021 ; Tarabichi et al, 2021 ). We will also try to apply this algorithm to the research of ancient DNA mutation detection, which may be helpful to explore the evolution process of species ( Zhu et al, 2017b ).…”
Section: Discussionmentioning
confidence: 99%
“…One of the most popular genetic feature inputted for study analysis are SNPs (Single Nucleotide Polymorphisms), these are variants in base pairs within the DNA sequence [8] [6]. While a majority of these SNPs will have little to no impact on the biological systems, the consequential causal sequence can lead to imbalances in chemicals, misfolds in protein polypeptide chains and instability in mRNA transcripts [9] [11][12][13][14][15]. The involvement of these SNPs in the genetic analysis for the purpose of finding risk variants is due to the abundance of variation throughout the genome; proving promising and successful in many determined diseases so far [9][10] [16].…”
Section: Background and Related Workmentioning
confidence: 99%
“…Direct optimization of AUC without gradient-based methods Optimizing AUC directly using coordinate descent (i.e. optimizing one model parameter at a time, which is feasible despite the zero gradient) yields good results for certain machine learning methods that were designed specifically for genetics applications [Zhu et al, 2017]. LeDell et al [2016] introduce an ensemble approach based on Super Learner [van der Laan et al, 2007] which adjusts the combination of scores from several individual classifiers in favor of a higher AUC.…”
Section: Difficulties Of Auc Optimization and Related Workmentioning
confidence: 99%