Murad Chowdhury scite author profile

Visual validation is an important step to minimize false-positive predictions from structural variant (SV) detection. We present Samplot, a tool for creating images that display the read depth and sequence alignments necessary to adjudicate purported SVs across samples and sequencing technologies. These images can be rapidly reviewed to curate large SV call sets. Samplot is applicable to many biological problems such as SV prioritization in disease studies, analysis of inherited variation, or de novo SV review. Samplot includes a machine learning package that dramatically decreases the number of false positives without human review. Samplot is available at https://github.com/ryanlayer/samplot.

show abstract

Samplot: A Platform for Structural Variant Visual Validation and Automated Filtering

Belyeu

Chowdhury

Brown

et al. 2020

Preprint

View full text Add to dashboard Cite

Visual validation is an essential step in structural variant (SV) detection to eliminate false positives. We present Samplot, a tool for quickly creating images that display the read depth and sequence alignments necessary to adjudicate purported SVs across multiple samples and sequencing technologies, including short, long, and phased reads. These simple images can be rapidly reviewed to curate large SV call sets. Samplot is easily applicable to many biological problems such as prioritization of potentially causal variants in disease studies, family-based analysis of inherited variation, or de novo SV review. Samplot also includes a trained machine learning package that dramatically decreases the number of false positives without human review. Samplot is available via the conda package manager or at https://github.com/ryanlayer/samplot.

show abstract

Searching thousands of genomes to classify somatic and novel structural variants using STIX

et al. 2022

View full text Add to dashboard Cite

Structural variants are associated with cancers and developmental disorders, but challenges with estimating population frequency remain a barrier to prioritizing mutations over inherited variants. In particular, variability in variant calling heuristics and filtering limits the use of current structural variant catalogs. We present STIX, a method that, instead of relying on variant calls, indexes and searches the raw alignments from thousands of samples to enable more comprehensive allele frequency estimation.

show abstract

Extracting and Segmenting Container Name from Container Images

Chowdhury¹,

Deb²

2013

IJCA

View full text Add to dashboard Cite

Container name extraction is very important to the modern container management system.Similar techniques have been suggested for vehicle license plate recognition in past decades.Container name extraction has more complexity from license plate extraction because of the severity of nonuniform illumination and invalidation of color information.The main purpose of this paper is to propose a new methodology for text extraction,segmenting text characters and removing non text,background from images. Existing text extraction methods do not work efficiently in case of images with noice and complex background. Documents with only text work efficiently in OCR.The approach used is based on edge detection,close operation,detecting connected components,removing non text regions and character segmentation.

show abstract

ATAC-seq signal processing and recurrent neural networks can identify RNA polymerase activity

Tripodi¹,

Chowdhury²,

Dowell

2019

Preprint

View full text Add to dashboard Cite

Nascent transcription assays are the current gold standard for identifying regions of active transcription, including markers for functional transcription factor (TF) binding. Here we present a signal processingbased model to determine regions of active transcription genome-wide using the simpler assay for transposase-accessible chromatin, followed by high-throughput sequencing (ATAC-seq). The focus of this study is twofold: First, we perform a frequency space analysis of the "signal" generated from ATAC-seq experiments' short reads, at a single-nucleotide resolution, using a discrete wavelet transform. Second, we explore different uses of neural networks to combine this signal with its underlying genome sequence in order to classify ATAC-seq peaks on the presence or absence of bidirectional transcription. We analyze the performance of different data encoding schemes and machine learning architectures, and show how a hybrid signal/sequence representation classified using recurrent neural networks (RNNs) yields the best performance across different cell types.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Murad Chowdhury

Samplot: a platform for structural variant visual validation and automated filtering

Samplot: A Platform for Structural Variant Visual Validation and Automated Filtering

Searching thousands of genomes to classify somatic and novel structural variants using STIX

Extracting and Segmenting Container Name from Container Images

ATAC-seq signal processing and recurrent neural networks can identify RNA polymerase activity

Contact Info

Product

Resources

About