Practice and Experience in Advanced Research Computing 2020
DOI: 10.1145/3311790.3396621
|View full text |Cite
|
Sign up to set email alerts
|

PEGR: a management platform for ChIP-based next generation sequencing pipelines

Abstract: There has been a rapid development in genome sequencing, including high-throughput next generation sequencing (NGS) technologies, automation in biological experiments, new bioinformatics tools and utilization of high-performance computing and cloud computing.ChIP-based NGS technologies, e.g. ChIP-seq and ChIP-exo, are widely used to detect the binding sites of DNA-interacting proteins in the genome and help us to have a deeper mechanistic understanding of genomic regulation. As sequencing data is generated at … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
2
1

Relationship

3
0

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 16 publications
(9 reference statements)
0
4
0
Order By: Relevance
“…Additional previously published ChIP-exo datasets for Hsf1, Msn2, Spt15, Spt16, Ifh1, and Fhl1 were included in data processing and analysis for this study 23 , 36 . Data were managed, quality controlled, and processed through a custom automated workflow control called PEGR (Platform for Epi-Genomic Research) 38 . Sequence reads were aligned to the yeast (sacCer3) genome using bwa-mem (v0.7.17) Aligned reads were filtered using Picard (v2.7.1) 39 and samtools (v0.1.18) 40 to remove PCR duplicates (i.e., where the 5’ coordinates-strand of Read_1 and Read_2 were identical to another read pair), and non-uniquely mapping reads.…”
Section: Methodsmentioning
confidence: 99%
“…Additional previously published ChIP-exo datasets for Hsf1, Msn2, Spt15, Spt16, Ifh1, and Fhl1 were included in data processing and analysis for this study 23 , 36 . Data were managed, quality controlled, and processed through a custom automated workflow control called PEGR (Platform for Epi-Genomic Research) 38 . Sequence reads were aligned to the yeast (sacCer3) genome using bwa-mem (v0.7.17) Aligned reads were filtered using Picard (v2.7.1) 39 and samtools (v0.1.18) 40 to remove PCR duplicates (i.e., where the 5’ coordinates-strand of Read_1 and Read_2 were identical to another read pair), and non-uniquely mapping reads.…”
Section: Methodsmentioning
confidence: 99%
“…Previously, we presented a proof-of-concept vision of PEGR software [23]. We now present a fully functional and open-source version of PEGR software, including streamlined experiment tracking with reagent barcoding, flexibility for multiple heterogeneous bioinformatics workflows, and a project management approach.…”
Section: Pegr Logs Sample Information and Experimental Details It Manages Metadata Produced Bymentioning
confidence: 99%
“…It supports Findable, Accessible, Interoperable and Reusable (FAIR) best practices by tracking the metadata throughout the entire sequencing pipeline to enable experimental reproducibility [22]. Previously, we presented a proof-of-concept vision of PEGR software [23]. We now present a fully functional and open-source version of PEGR software, including streamlined experiment tracking with reagent barcoding, flexibility for multiple heterogeneous bioinformatics workflows, and a project management approach.…”
Section: Introductionmentioning
confidence: 99%
“…It supports Findable, Accessible, Interoperable and Reusable (FAIR) best practices by tracking the metadata throughout the entire sequencing pipeline to enable experimental reproducibility [ 29 ]. Previously, we presented a proof-of-concept vision of PEGR software [ 30 ]. We now present a fully functional and open-source version of PEGR software, including streamlined experiment tracking with reagent barcoding, flexibility for multiple heterogeneous bioinformatics workflows, and a project management approach.…”
Section: Introductionmentioning
confidence: 99%