Barcoding strategies are fundamental to droplet-based single-cell sequencing, and understanding the biases and caveats between approaches is essential. Here, we comprehensively evaluated both short and long reads of the cDNA obtained through the two marketed approaches from 10x Genomics, the "3' assay" and the "5' assay", which attach barcodes at different ends of the mRNA molecule. Although the barcode detection, cell-type identification, and gene expression profile are similar in both assays, the 5' assay captured more exonic molecules and fewer intronic molecules compared to the 3' assay. We found that 13.7% of genes sequenced have longer average read lengths and are more complete (spanning both polyA-site and TSS) in the long reads from the 5' assay compared to the 3' assay. These genes are characterized by long average transcript length, high intron number, and low expression overall. Despite these differences, cell-type-specific isoform profiles observed from the two assays remain highly correlated. This study provides a benchmark for choosing the single-cell assay for the intended research question, and insights regarding platform-specific biases to be mindful of when analyzing data, particularly across samples and technologies.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.