2010
DOI: 10.1093/bioinformatics/btq460
|View full text |Cite
|
Sign up to set email alerts
|

A Gibbs sampling strategy applied to the mapping of ambiguous short-sequence tags

Abstract: Motivation: Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) is widely used in biological research. ChIP-seq experiments yield many ambiguous tags that can be mapped with equal probability to multiple genomic sites. Such ambiguous tags are typically eliminated from consideration resulting in a potential loss of important biological information.Results: We have developed a Gibbs sampling-based algorithm for the genomic mapping of ambiguous sequence tags. Our algorithm relies on th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
28
0

Year Published

2011
2011
2018
2018

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 35 publications
(28 citation statements)
references
References 14 publications
(18 reference statements)
0
28
0
Order By: Relevance
“…Both uniquely aligned reads and those aligned to less than 10, 100, 1000 or 10000 locations were analyzed. For reads aligned to multiple regions, a single location was chosen based on a Gibbs sampling algorithm 52 , using 5 or 10 iterations. The sequencing reads count on DR2 Alu was computed using intersectBed tool in BedTools 53 , and were normalized to the sequencing depth.…”
Section: Methodsmentioning
confidence: 99%
“…Both uniquely aligned reads and those aligned to less than 10, 100, 1000 or 10000 locations were analyzed. For reads aligned to multiple regions, a single location was chosen based on a Gibbs sampling algorithm 52 , using 5 or 10 iterations. The sequencing reads count on DR2 Alu was computed using intersectBed tool in BedTools 53 , and were normalized to the sequencing depth.…”
Section: Methodsmentioning
confidence: 99%
“…To deal with this challenge, we used a strategy of assigning a mapping likelihood (mL tag) to reads equal to the inverse of its genome-wide mapping positions; in effect, splitting up a read and mapping an equal portion of it to all of its possible sites of true mapping (Wang et al 2010;Chung et al 2011). By using this alternative mapping strategy, we then build fragment densities to display on enrichment tracks and to call peaks (see Methods).…”
Section: Subtelomere Annotationmentioning
confidence: 99%
“…Sequence tags were mapped to the human genome reference sequence and co-located with genes and TEs as described in the Materials and Methods section. A recently developed algorithm for mapping ambiguous tags was used to ensure maximal coverage of repetitive TE sequences for the short sequence tags used 35 . This algorithm ensures that the best single genomic location for each multi-mapping tag is chosen, thus ensuring deeper coverage of TE sequences than would be achieved if multi-mapping tags were discarded.…”
Section: Resultsmentioning
confidence: 99%
“…RNA sequence reads were mapped to the human genome reference sequence (NCBI36/hg18) using the program Bowtie 47 with a threshold of ≤ 2 mismatches allowed. The most likely mapping locations for reads that mapped to more than one location were rescued using the Gibbs sampling strategy for multi-mapping tags 35 . mRNA and sRNA sequence tags mapped and processed in this way were co-located with human gene and TE loci annotated in the UCSC Genome Browser 48 .…”
Section: Methodsmentioning
confidence: 99%