2009
DOI: 10.14778/1687553.1687625
|View full text |Cite
|
Sign up to set email alerts
|

Column-oriented database systems

Abstract: Column-oriented database systems (column-stores) have attracted a lot of attention in the past few years. Column-stores, in a nutshell, store each database table column separately, with attribute values belonging to the same column stored contiguously, compressed, and densely packed, as opposed to traditional database systems that store entire records (rows) one after the other. Reading a subset of a table's columns becomes faster, at the potential expense of excessive disk-head seeking from column to column f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
77
0
4

Year Published

2014
2014
2021
2021

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 173 publications
(81 citation statements)
references
References 6 publications
0
77
0
4
Order By: Relevance
“…Locality is important for both disk-based and in-memory scans. Specifically, adding scans to the feature list creates the following locality conflicts: Scan vs. Get/Put Most analytical systems use a columnar storage layout to increase locality [2]. KVS, in contrast, typically favor a row-oriented layout in order to process get/put requests without the need to materialize records [4].…”
Section: Why Is It Difficult?mentioning
confidence: 99%
See 1 more Smart Citation
“…Locality is important for both disk-based and in-memory scans. Specifically, adding scans to the feature list creates the following locality conflicts: Scan vs. Get/Put Most analytical systems use a columnar storage layout to increase locality [2]. KVS, in contrast, typically favor a row-oriented layout in order to process get/put requests without the need to materialize records [4].…”
Section: Why Is It Difficult?mentioning
confidence: 99%
“…Column-major vertically partitions the data and stores a whole column of a table (or set of records) as a contiguous sequence of bytes. Such a column-major layout is beneficial for scans as analytical queries often involve only a subset of the columns [25,3,2,45,9,8]. In addition, columnmajor supports vector operations (SIMD) to further speed up bulk operations and scans on modern hardware [47,49].…”
Section: How To Arrange Records?mentioning
confidence: 99%
“…To improve read efficiency, columnar values are typically densely packed, and using light-weight compression scheme. Especially, columnoriented database facilitates vast improvements in compression, which can result in an additional reduction in storage access while maintaining high performance [4].…”
Section: Motivationmentioning
confidence: 99%
“…As regards the proper data storage solution, Hadoop's ecosystem incorporates HBase [29]. HBase is a distributed column-oriented database (e.g., [30]), which also falls in the category of NoSQL storage systems (e.g., [31]), that founds on the underlying Hadoop Distributed File System (HDFS) (e.g., [32]) for common data processing routines. HBase has been proved to be highly scalable.…”
Section: Hadoop: An Overviewmentioning
confidence: 99%