Akash Kumar scite author profile

Motivation The ability to generate massive amounts of sequencing data continues to overwhelm the processing capability of existing algorithms and compute infrastructures. In this work, we explore the use of hardware/software co-design and hardware acceleration to significantly reduce the execution time of short sequence alignment, a crucial step in analyzing sequenced genomes. We introduce Shouji, a highly parallel and accurate pre-alignment filter that remarkably reduces the need for computationally-costly dynamic programming algorithms. The first key idea of our proposed pre-alignment filter is to provide high filtering accuracy by correctly detecting all common subsequences shared between two given sequences. The second key idea is to design a hardware accelerator that adopts modern field-programmable gate array (FPGA) architectures to further boost the performance of our algorithm. Results Shouji significantly improves the accuracy of pre-alignment filtering by up to two orders of magnitude compared to the state-of-the-art pre-alignment filters, GateKeeper and SHD. Our FPGA-based accelerator is up to three orders of magnitude faster than the equivalent CPU implementation of Shouji. Using a single FPGA chip, we benchmark the benefits of integrating Shouji with five state-of-the-art sequence aligners, designed for different computing platforms. The addition of Shouji as a pre-alignment step reduces the execution time of the five state-of-the-art sequence aligners by up to 18.8×. Shouji can be adapted for any bioinformatics pipeline that performs sequence alignment for verification. Unlike most existing methods that aim to accelerate sequence alignment, Shouji does not sacrifice any of the aligner capabilities, as it does not modify or replace the alignment step. Availability and implementation https://github.com/CMU-SAFARI/Shouji. Supplementary information Supplementary data are available at Bioinformatics online.

show abstract

Online Peak Power and Maximum Temperature Management in Multi-core Mixed-Criticality Embedded Systems

Ranjbar

Nguyen

Ejlali

et al. 2019

View full text Add to dashboard Cite

DeMAS: An efficient design methodology for building approximate adders for FPGA-based systems

Prabakaran

Rehman

Hanif

et al. 2018

View full text Add to dashboard Cite

Accelerating throughput-aware runtime mapping for heterogeneous MPSoCs

Singh

Kumar

Srikanthan

2013

ACM Trans. Des. Autom. Electron. Syst.

View full text Add to dashboard Cite

Modern embedded systems need to support multiple time-constrained multimedia applications that often employ multiprocessor-systems-on-chip (MPSoCs). Such systems need to be optimized for resource usage and energy consumption. It is well understood that a design-time approach cannot provide timing guarantees for all the applications due to its inability to cater for dynamism in applications. However, a runtime approach consumes large computation requirements at runtime and hence may not lend well to constrained-aware mapping.In this article, we present a hybrid approach for efficient mapping of applications in such systems. For each application to be supported in the system, the approach performs extensive design-space exploration (DSE) at design time to derive multiple design points representing throughput and energy consumption at different resource combinations. One of these points is selected at runtime efficiently, depending upon the desired throughput while optimizing for energy consumption and resource usage. While most of the existing DSE strategies consider a fixed multiprocessor platform architecture, our DSE considers a generic architecture, making DSE results applicable to any target platform. All the compute-intensive analysis is performed during DSE, which leaves for minimum computation at runtime. The approach is capable of handling dynamism in applications by considering their runtime aspects and providing timing guarantees.The presented approach is used to carry out a DSE case study for models of real-life multimedia applications: H.263 decoder, H.263 encoder, MPEG-4 decoder, JPEG decoder, sample rate converter, and MP3 decoder. At runtime, the design points are used to map the applications on a heterogeneous MPSoC. Experimental results reveal that the proposed approach provides faster DSE, better design points, and efficient runtime mapping when compared to other approaches. In particular, we show that DSE is faster by 83% and runtime mapping is accelerated by 93% for some cases. Further, we study the scalability of the approach by considering applications with large numbers of tasks.

show abstract

Energy optimization by exploiting execution slacks in streaming applications on multiprocessor systems

Singh

Das

Kumar

2013

View full text Add to dashboard Cite

Dynamic voltage and frequency scaling (DVFS) offers great potential for optimizing the energy efficiency of Multiprocessor Systems-on-Chip (MPSoCs). The conventional approaches for processor voltage and frequency adjustment are not suitable for streaming multimedia applications due to the cyclic nature of dependencies in the executing tasks which can potentially violate the throughput constraints. In this paper, we propose a methodology that applies DVFS for such cyclic dependent tasks. The methodology involves an off-line analysis that assumes worst-case execution times of tasks to identify the executions that can be slowed down and an on-line analysis to utilize the slacks arising from tasks that finish their execution before the worst-case execution times. Thus, the methodology minimizes energy consumption during both off-line and on-line analysis while satisfying the throughput constraints. Experiments based on models of real-life streaming multimedia applications show that the proposed methodology reduces the overall energy consumption by 43% when compared to existing approaches.

show abstract

Geochemistry of mafic dykes in part of Chotanagpur gneissic complex: Petrogenetic and tectonic implications

Kumar

Ahmad

2007

Geochem. J.

View full text Add to dashboard Cite

Chotanagpur Gneissic Complex basement rocks of the Eastern Indian shield has been dissected by numerous mafic dykes, now occurring as amphibolitic dykes and gneissic amphibolites. These dykes are subalkaline, ranging in composition from basalt through basaltic-andesite to andesite. These rocks have enriched incompatible trace element patterns. These are particularly enriched in light rare earth elements (LREE) and large ion lithophile elements (LILE) with depleted high field strength elements (HFSE; Nb, P, Ti) characterisitcs (i). Negative Sr anomaly is conspicuous. Nb/La and Nb/Ce ratios of the dykes are lower compared to the primitive mantle but these values are closer to average crustal values. Incompatible trace element data suggest enriched source characteristics and influence of crustal contamination in their genesis. Trace element ratios such as Gd/Yb of these dykes indicate at least two different sources. They probably represent Precambrian continental rifting in this region.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Akash Kumar

Architectural-space exploration of approximate multipliers

Communication-aware heuristics for run-time task mapping on NoC-based MPSoC platforms

Shouji: a fast and efficient pre-alignment filter for sequence alignment

Online Peak Power and Maximum Temperature Management in Multi-core Mixed-Criticality Embedded Systems

DeMAS: An efficient design methodology for building approximate adders for FPGA-based systems

Accelerating throughput-aware runtime mapping for heterogeneous MPSoCs

Energy optimization by exploiting execution slacks in streaming applications on multiprocessor systems

Geochemistry of mafic dykes in part of Chotanagpur gneissic complex: Petrogenetic and tectonic implications

Contact Info

Product

Resources

About