2017
DOI: 10.1093/bioinformatics/btx655
|View full text |Cite
|
Sign up to set email alerts
|

Bartender: a fast and accurate clustering algorithm to count barcode reads

Abstract: Supplementary data are available at Bioinformatics online.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
76
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 84 publications
(78 citation statements)
references
References 47 publications
0
76
0
Order By: Relevance
“…Raw sequence files, separated by their fraction-associated barcodes, were processed with Cutadapt 47 , outputting the 50 nt UTR and 9 -15 nt of the N-terminal of the CDS. UTRs were clustered and UMIs were counted using Bartender 48 . The eGFP library contained approximately 750,000 unique sequences and the mCherry library contained approximately 500,000 sequences.…”
Section: Sequence Processingmentioning
confidence: 99%
“…Raw sequence files, separated by their fraction-associated barcodes, were processed with Cutadapt 47 , outputting the 50 nt UTR and 9 -15 nt of the N-terminal of the CDS. UTRs were clustered and UMIs were counted using Bartender 48 . The eGFP library contained approximately 750,000 unique sequences and the mCherry library contained approximately 500,000 sequences.…”
Section: Sequence Processingmentioning
confidence: 99%
“…Lineage tracking from barcode sequencing was reconstructed as described in 41 and using https://github.com/Sherlock-Lab/Barcode_seq/blob/master/bartender_BC1_BC2.py with some minor modifications. Briefly, after extraction of the UMI, and both low and high complexity barcodes from the sequencing read, low complexity barcodes were clustered against their expected sequences, whereas the high complexity barcodes were pooled across all libraries and clustered with bartender (v1.1) 45 . The updated reads and the UMIs were used to derive raw barcode counts, which were assembled into the raw count lineage trajectories.…”
Section: Dfe / Mutational Fitness Spectrum U(s) Inferencementioning
confidence: 99%
“…To quantify individual lineages, we isolated the subpopulation containing CNVs from two populations (bc01 and bc02) at multiple timepoints (generations 70, 90, 150, and 270) using fluorescence activated cell sorting (FACS) ( Figure 5A ). We sequenced barcodes from the CNV subpopulation at each time point and determined the number of unique lineages ( [69] and methods ). To account for variation in the purity of the isolated CNV subpopulation, we analyzed individual clones from the CNV subpopulation isolated by FACS to estimate a false positive rate, which we find varies as a function of time point ( Figure S12B and methods ), and applied this correction to barcode counts ( Table S10 ).…”
Section: Glucose-limitation Urea-limitation Glutamine-limitationmentioning
confidence: 99%
“…However, the reverse read failed due to over-clustering, so all analyses were performed only using the forward read. We used the Bartender algorithm with UMI handling to account for PCR duplicates and to cluster sequences with merging decisions based solely on distance except in cases of low coverage (<500 reads/barcode), for which the default cluster merging threshold was used [69] . Clusters with a size less than four or with high entropy (>0.75 quality score) were discarded.…”
Section: Quantifying the Number Of Cnv Lineagesmentioning
confidence: 99%