DIVAN: accurate identification of non-coding disease-specific risk variants using multi-omics profiles

Chen, Li; Jin, Peng; Qin, Zhaohui S.

doi:10.1186/s13059-016-1112-z

Cited by 75 publications

(79 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, IW-Scoring still allows for the functional variants associated with specific tissues, cells and features to be identified through the regulatory annotation module. This is currently lacking in many other methods, although some algorithms have chosen to focus on the identification of disease/tissue specific risk variants recently [22, 41]. Compared to most available methods, we believe our approach is optimally balanced between summarised and detailed evidences for the diverse range of users.…”

Section: Discussionmentioning

confidence: 99%

“…Via a vigorous weight learning process, strong weights were assigned to the block of closely correlated scores (Eigen, DeepSEA, FATHMM noncoding, ReMM and CADD), and the derived IW-Scoring significantly outperformed individual constituent scores (including Eigen and Eigen-PC) across various data sets, demonstrating the accuracy and validity of our approach. Such ensemble based approach with different estimated weights has been shown to perform better than any single component classifier [26], and has been widely used in various bioinformatics problems [41, 42]. The weighted integration technique based on the eigendecomposition of the covariance matrix also offers the flexibility to incorporate any other correlated genome-wide functional scores/features into the integrative scores.…”

Section: Discussionmentioning

confidence: 99%

See 1 more Smart Citation

IW-Scoring: an Integrative Weighted Scoring framework for annotating and prioritizing genetic variations in the noncoding genome

Wang

Ullah

Chelala

2017

Preprint

View full text Add to dashboard Cite

IW-Scoring represents a new Integrative Weighted Scoring model to annotate and prioritise functionally relevant noncoding variations. The pipeline integrates 11 popular algorithms and outperforms individual methods in three independent data sets, including variants in ClinVar database and GWAS studies, and cancer mutations.Using IW-Scoring, we located 11 recurrently mutated noncoding regions enriched for at least three functional mutations in 14 follicular lymphoma genomes, and validated 9 clusters (82%) in the International Cancer Genome Consortium cohort (n=36), including promoter and enhancer regions of PAX5. IW-Scoring offers greater versatility to identify trait and disease associated noncoding variants.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Discussionmentioning

confidence: 99%

IW-Scoring: an Integrative Weighted Scoring framework for annotating and prioritizing genetic variations in the noncoding genome

Wang

Ullah

Chelala

2017

Preprint

View full text Add to dashboard Cite

show abstract

“…Such variants do not change the functionality of a gene product but instead are thought to affect transcript levels through distant gene regulation. In support of this concept, many of these genomic sites display transcription regulatory potential in disease-relevant cell-types, as judged from their epigenetic signatures (1000Genomes Project Consortium et al, 2010Chen et al, 2016;Maurano et al, 2012). Gene regulation over distance requires chromatin looping to bring distal regulatory DNA elements in close proximity to target genes.…”

Section: Hi-c To Link Disease Variants To Genesmentioning

confidence: 99%

“…However, the identification of target genes and underlying mechanisms remains challenging. Indeed, more than 90% of disease-associated variation resides in noncoding DNA (1000Genomes Project Consortium et al, 2010Chen et al, 2016;Maurano et al, 2012).…”

Section: Introductionmentioning

confidence: 99%

Detailed Regulatory Interaction Map of the Human Heart Facilitates Gene Discovery for Cardiovascular Disease

Bianchi

Geeven

Tucker

et al. 2019

Preprint

View full text Add to dashboard Cite

SummaryMost disease-associated variants identified by population based genetic studies are non-coding, which compromises finding causative genes and mechanisms. Presumably they interact through looping with nearby genes to modulate transcription. Hi-C provides the most complete and unbiased method for genome-wide identification of potential regulatory interactions, but finding chromatin loops in Hi-C data remains difficult and tissue specific data are limited. We have generated Hi-C data from primary cardiac tissue and developed a method, peakHiC, for sensitive and quantitative loop calling to uncover the human heart regulatory interactome. We identify complex CTCF-dependent and -independent contact networks, with loops between coding and non-coding gene promoters, shared enhancers and repressive sites. Across the genome, enhancer interaction strength correlates with gene transcriptional output and loop dynamics follows CTCF, cohesin and H3K27Ac occupancy levels. Finally, we demonstrate that intersection of the human heart regulatory interactome with cardiovascular disease variants facilitates prioritizing disease-causative genes.

show abstract

“…Feature selection and outlier removal were employed to achieve the best performance. The optimal feature set was selected depend on the largest area under the receiver operating characteristic curve (ROC-AUC) value as described in previous study [27]. Briefly, the confidence of each feature was measured by p values based on Wilcoxon rank sum test.…”

Section: Feature Selection and Outlier Removalmentioning

confidence: 99%

IPEV: a web server for inferring pathogenic enhancers with variants

Zhang¹,

Xie²,

Bai³

et al. 2019

Preprint

View full text Add to dashboard Cite

Background Enhancer has been recognized as an important driver whose genetic alterations contribute to disease progression. However, there is still no easy-to-use tools to identify pathogenic enhancers, allowing for deciphering functional influence of genetic variants on enhancer. Results We developed a user-friendly one-stop shop platform, named inferring pathogenic enhancer with variant (IPEV), only requiring variants as input, to quickly infer the pathogenic enhancers that harbor variants affecting their activities. Results of IPEV are explored in an interactive, user-friendly web environment, which is designed to highlight the most probable pathogenic enhancers and their target genes. Furthermore, IPEV provides intuitive visualizations of how a variant affects the corresponding enhancer activity by mediating TF binding changes. Conclusions IPEV is specially designed to prioritize the potentially pathogenic enhancers with genetic variants, and provides intuitive visualizations how a variant affects the corresponding enhancer activity by mediating which transcription factor binding changes. The use of IPEV does not require any specialized computer skills. We believe that IPEV will be useful in interpreting non-coding variants by the inferring pathogenic enhancers. It is freely available at http://biocc.hrbmu.edu.cn/IPEV/ or http://210.46.80.168/IPEV and supports recent versions of all major browsers.

show abstract

DIVAN: accurate identification of non-coding disease-specific risk variants using multi-omics profiles

Cited by 75 publications

References 39 publications

IW-Scoring: an Integrative Weighted Scoring framework for annotating and prioritizing genetic variations in the noncoding genome

IW-Scoring: an Integrative Weighted Scoring framework for annotating and prioritizing genetic variations in the noncoding genome

Detailed Regulatory Interaction Map of the Human Heart Facilitates Gene Discovery for Cardiovascular Disease

IPEV: a web server for inferring pathogenic enhancers with variants

Contact Info

Product

Resources

About