2016
DOI: 10.1186/s12859-016-1284-2
|View full text |Cite
|
Sign up to set email alerts
|

Je, a versatile suite to handle multiplexed NGS libraries with unique molecular identifiers

Abstract: BackgroundThe yield obtained from next generation sequencers has increased almost exponentially in recent years, making sample multiplexing common practice. While barcodes (known sequences of fixed length) primarily encode the sample identity of sequenced DNA fragments, barcodes made of random sequences (Unique Molecular Identifier or UMIs) are often used to distinguish between PCR duplicates and transcript abundance in, for example, single-cell RNA sequencing (scRNA-seq). In paired-end sequencing, different b… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
102
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 150 publications
(102 citation statements)
references
References 18 publications
0
102
0
Order By: Relevance
“…Read quality was evaluated using FastQC. iCELL8 library barcodes from the first 21 bp reads were assigned to the associated nanowell with the Je demultiplexing suite 30 . Remaining primer sequences, Poly-A/T tails and low-quality ends (<25) were trimmed using Cutadapt.…”
Section: Methodsmentioning
confidence: 99%
“…Read quality was evaluated using FastQC. iCELL8 library barcodes from the first 21 bp reads were assigned to the associated nanowell with the Je demultiplexing suite 30 . Remaining primer sequences, Poly-A/T tails and low-quality ends (<25) were trimmed using Cutadapt.…”
Section: Methodsmentioning
confidence: 99%
“…Briefly, fastq files were uploaded and demultiplex using the "Je-demultiplex" tool (Girardot et al, 2016) . Reads were mapped to the mm10 mouse reference genome (GRCm38) using "Map with BWA" Durbin, 2009, 2010) .…”
Section: Karyotypingmentioning
confidence: 99%
“…Then, the reads at each unique alignment location are independently deduplicated based on the UMI sequences. This is done by tools like UMI-tools [5], zUMIs [7], umis [8], gencore [9], fgbio [10], Picard [11], and Je [12]. It is also possible to skip the alignment step and deduplicate reads directly based on the entire DNA sequence, which includes the UMI (this is done by Calib [13]).…”
Section: Introductionmentioning
confidence: 99%
“…Though there have been many tools proposed for accurately handling UMI data, there have not been many works focused on discussing algorithms that make the UMI deduplication process more efficient. Many previous tools either deduplicate purely based on unique UMIs without being error tolerant [14,3], or require pairwise comparisons between UMIs (i.e., zUMIs [7], Picard [11], Je [12], older versions of UMI-tools [5], gencore [9], and fgbio [10]), which does not efficiently scale to larger datasets. [5] proposed network-based algorithms for estimating the actual number of UMI, which was implemented in their UMI-tools.…”
Section: Introductionmentioning
confidence: 99%