Abstract:Modern data center solid state drives (SSDs) integrate multiple general-purpose embedded cores to manage ash translation layer, garbage collection, wear-leveling, and etc., to improve the performance and the reliability of SSDs. As the performance of these cores steadily improves there are opportunities to repurpose these cores to perform application driven computations on stored data, with the aim of reducing the communication between the host processor and the SSD. Reducing host-SSD bandwidth demand cuts dow… Show more
“…Second, it reduces the overall performance and energy burden of the application from the rest of the system, freeing them up to do other useful work meanwhile. Third, as shown by many prior works (e.g., [88][89][90][91]), ISP can benefit from the SSD's larger internal bandwidth. For example, with 8 ( 16) channels for SSD-C (SSD-P) and the maximum per-channel bandwidth of 1.2 GB/s, the maximum internal bandwidth is calculated to be 9.6 GB/s (19.2 GB/s).…”
Section: Our Goalmentioning
confidence: 95%
“…In-Storage Processing. Several works propose ISP designs as accelerators for different applications [88, (e.g., in machine learning [147,150,151], pattern processing and read mapping [91,149], and graph analytics [141]), generalpurpose [145,146,[152][153][154][155][156][157][158][159][160][161][162][163], bulk-bitwise operations using flash memory [164,165], in close integration with FPGAs [90,[166][167][168][169][170], or GPUs [171]. None of these works perform metagenomic analysis nor address the challenges of ISP for metagenomics.…”
of read mapping processes of reads with different properties and degrees of genetic variation, we meticulously design low-cost hardware accelerators and data/computation flows inside a NAND flashbased solid-state drive (SSD). Our evaluation using a wide range of real genomic datasets shows that GenStore, when implemented in three modern NAND flash-based SSDs, significantly improves the read mapping performance of state-of-the-art software (hardware) baselines by 2.07-6.05× (1.52-3.32×) for read sets with high similarity to the reference genome and 1.45-33.63× (2.70-19.2×) for read sets with low similarity to the reference genome.
CCS CONCEPTS• Computer systems organization → Special purpose systems;• Hardware → External storage.
“…Second, it reduces the overall performance and energy burden of the application from the rest of the system, freeing them up to do other useful work meanwhile. Third, as shown by many prior works (e.g., [88][89][90][91]), ISP can benefit from the SSD's larger internal bandwidth. For example, with 8 ( 16) channels for SSD-C (SSD-P) and the maximum per-channel bandwidth of 1.2 GB/s, the maximum internal bandwidth is calculated to be 9.6 GB/s (19.2 GB/s).…”
Section: Our Goalmentioning
confidence: 95%
“…In-Storage Processing. Several works propose ISP designs as accelerators for different applications [88, (e.g., in machine learning [147,150,151], pattern processing and read mapping [91,149], and graph analytics [141]), generalpurpose [145,146,[152][153][154][155][156][157][158][159][160][161][162][163], bulk-bitwise operations using flash memory [164,165], in close integration with FPGAs [90,[166][167][168][169][170], or GPUs [171]. None of these works perform metagenomic analysis nor address the challenges of ISP for metagenomics.…”
of read mapping processes of reads with different properties and degrees of genetic variation, we meticulously design low-cost hardware accelerators and data/computation flows inside a NAND flashbased solid-state drive (SSD). Our evaluation using a wide range of real genomic datasets shows that GenStore, when implemented in three modern NAND flash-based SSDs, significantly improves the read mapping performance of state-of-the-art software (hardware) baselines by 2.07-6.05× (1.52-3.32×) for read sets with high similarity to the reference genome and 1.45-33.63× (2.70-19.2×) for read sets with low similarity to the reference genome.
CCS CONCEPTS• Computer systems organization → Special purpose systems;• Hardware → External storage.
“…Works that show the general applicability of NDP exist, prominent examples include INSIDER [30], Biscuit [19], Summarizer [22], iSSD [13], and Willow [32]. These works demonstrated that it is possible to reap the benefits of NDP in SSDs by leveraging tailor-made, user-space programming framework.…”
Section: Background and Related Workmentioning
confidence: 99%
“…Although near data processing (NDP) in-storage devices has been proposed in the context of hard disks drives (HDDs) [7] and databases [20] a long time ago, it become technologically and economically feasible only recently with the introduction of solid state drives (SSDs) [11,16,17,19,22,28,30,32,37]. SSDs store data on NAND flash memory media, which have no mechanical parts compared to traditional rotational HDDs.…”
“…The advent of the information era has led to the explosive growth of data, and thus poses a tremendous challenge to the conventional computing paradigm based on von Neumann architecture [1][2][3][4][5]. The continual data shuttling between the memory and CPU dramatically hinder the improvement of speed and energy efficiency which is referred to as the von Neumann bottleneck [6][7][8][9].…”
In-memory computing is highly expected to break the von Neumann bottleneck and memory wall. Memristor with inherent nonvolatile property is considered to be a strong candidate to execute this new computing paradigm. In this work, we have presented a reconfigurable nonvolatile logic method based on one-transistor-two-memristor device structure, inhibiting the sneak path in the large-scale crossbar array. By merely adjusting the applied voltage signals, all 16 binary Boolean logic functions can be achieved in a single cell. More complex computing tasks including one-bit parallel full adder and set–reset latch have also been realized with optimization, showing simple operation process, high flexibility, and low computational complexity. The circuit verification based on cadence PSpice simulation is also provided, proving the feasibility of the proposed design. The work in this paper is intended to make progress in constructing architectures for in-memory computing paradigm.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.