In single-cell RNA-sequencing (scRNA-seq) data, stratification of sequencing reads by cellular barcode is necessary to study cell specific features. However, apart from gene expression, the analyses of cell-specific features are not supported by available tools that are designed for bulk RNA-Seq data. We introduce a tool, SCExecute, which executes a user-provided command on barcode-stratified, extracted on-the-fly, single cell binary alignment map (scBAM) files. SCExecute extracts the cell barcode from aligned, pooled single-cell sequencing data. The user-specified command option executes all the commands defined in the session from monolithic programs and multi-command shell-scripts to complex shell-based pipelines. The execution can be further restricted to barcodes or/and genomic regions of interest. We demonstrate SCExecute with two popular variant callers, GATK and Strelka2, combined with modules for bam file manipulation and variant filtering, to detect single cell-specific expressed Single Nucleotide Variants (sceSNVs) from droplet scRNA-seq data (10X Genomics Chromium System). In conclusion, SCExecute facilitates custom cell-level analyses on barcoded scRNA-seq data using currently available tools and provides an effective solution for studying low (cellular) frequency transcriptome features.
Availability: SCExecute is implemented in Python3 using the PySAM package and distributed for Linux and Python environments from https://github.com/HorvathLab/NGS/tree/master/SCExecute.