2021
DOI: 10.1371/journal.pcbi.1009423
|View full text |Cite
|
Sign up to set email alerts
|

Segmentation and genome annotation algorithms for identifying chromatin state and other genomic patterns

Abstract: Segmentation and genome annotation (SAGA) algorithms are widely used to understand genome activity and gene regulation. These algorithms take as input epigenomic datasets, such as chromatin immunoprecipitation-sequencing (ChIP-seq) measurements of histone modifications or transcription factor binding. They partition the genome and assign a label to each segment such that positions with the same label exhibit similar patterns of input data. SAGA algorithms discover categories of activity such as promoters, enha… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
46
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 25 publications
(46 citation statements)
references
References 86 publications
(148 reference statements)
0
46
0
Order By: Relevance
“…To facilitate the use of CSREP, we provide an implementation of CSREP as a snakemake pipeline (Mölder et al ., 2021) with a detailed tutorial that only requires users to modify parameters in a yaml file. The program can be run either on local computers or on computing clusters, in which case snakemake will optimize the workflow for execution.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…To facilitate the use of CSREP, we provide an implementation of CSREP as a snakemake pipeline (Mölder et al ., 2021) with a detailed tutorial that only requires users to modify parameters in a yaml file. The program can be run either on local computers or on computing clusters, in which case snakemake will optimize the workflow for execution.…”
Section: Discussionmentioning
confidence: 99%
“…After CSREP trains the multi-class logistic regression model on training data that constitute 10% of the genome, and l 2-norm penalty. The model is implemented using Python’s sklearn, pybedtools package and snakemake (Dale et al ., 2011; Quinlan and Hall, 2010; Mölder et al ., 2021). CSREP applies the model to generate predictions of genome-wide probabilistic chromatin state map for sample n , which is presented in a matrix of size G * S .…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Notably, approaches have been developed for annotating the genome into 'chromatin states' based on the combinatorial and spatial patterns of epigenomic marks inferred de novo from the data. These different 'chromatin states' can correspond to different classes of genomic elements, including enhancers, promoters, and repressive regions [8][9][10][11]. Annotations from these methods have been used for a diverse range of applications, including understanding gene regulation and genetic variants associated with disease [2,11,12].…”
Section: Introductionmentioning
confidence: 99%
“…These different 'chromatin states' can correspond to different classes of genomic elements, including enhancers, promoters, and repressive regions [8][9][10][11]. Annotations from these methods have been used for a diverse range of applications, including understanding gene regulation and genetic variants associated with disease [2,11,12].…”
Section: Introductionmentioning
confidence: 99%