2019
DOI: 10.1186/s40537-019-0265-5
|View full text |Cite
|
Sign up to set email alerts
|

Computational storage: an efficient and scalable platform for big data and HPC applications

Abstract: In the era of big data applications, the demand for more sophisticated data centers and high-performance data processing mechanisms is increasing drastically. Data are originally stored in storage systems. To process data, application servers need to fetch them from storage devices, which imposes the cost of moving data to the system. This cost has a direct relation with the distance of processing engines from the data. This is the key motivation for the emergence of distributed processing platforms such as Ha… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
4
1

Relationship

4
6

Authors

Journals

citations
Cited by 26 publications
(7 citation statements)
references
References 32 publications
(30 reference statements)
0
6
0
Order By: Relevance
“…It presents several advantages, such as scalability, flexibility, cost-effectiveness, organized architecture, and resilience to failure. Recently, several Hadoop-based platforms have been proposed as efficient and flexible solutions for computational storage with the strategy of processing data close to where they reside [53][54][55]. Among them, we cite the lineage-aware data management (LDM) that exploits the data locality to decrease the network footprint [56,57].…”
Section: Mapreduce Model Of Computationmentioning
confidence: 99%
“…It presents several advantages, such as scalability, flexibility, cost-effectiveness, organized architecture, and resilience to failure. Recently, several Hadoop-based platforms have been proposed as efficient and flexible solutions for computational storage with the strategy of processing data close to where they reside [53][54][55]. Among them, we cite the lineage-aware data management (LDM) that exploits the data locality to decrease the network footprint [56,57].…”
Section: Mapreduce Model Of Computationmentioning
confidence: 99%
“…One unique feature with our CSD is that the CBDD supports file-system access by the ISP applications and the host. By the embedded Linux to mount partitions on the storage and present a file API, this makes it not only easier to port applications but also more efficient to access the data [18], [19]. The host CPU and ISP units can also communicate via the same partition, which is mounted by both the host and the ISP.…”
Section: B System Software On the Csdmentioning
confidence: 99%
“…While the first approach is powerand cost-efficient, it provides limited processing power, and also it can negatively impact the performance of the SSD controller. In the previous works focusing on the second approach, researchers have investigated different types of dedicated processing engines such as FPGAs [14] and embedded processors [23,24]. Although FPGAs are power efficient, they have several limitations for being used in ISPs, including the challenging step of implementing an RTL design of the tasks [18], the time-consuming step of reconfiguration of FPGAs for running different tasks, and the lack of supporting file systems to allow tasks to deal with the concept of file when accessing the storage.…”
Section: In-storage Processingmentioning
confidence: 99%