Abstract:The explosion of graph data in social and biological networks, recommendation systems, provenance databases, etc. makes graph storage and processing of paramount importance. We present a performance introspection framework for graph databases, PIG, which provides both a toolset and methodology for understanding graph database performance. PIG consists of a hierarchical collection of benchmarks that compose to produce performance models; the models provide a way to illuminate the strengths and weaknesses of a p… Show more
“…For evaluating the proposed analysis technique, an application was implemented to getting the terms used in class name following the CamelCase coding style (predominant style due to its ease of writing and adoption [7,13]), storing terms in a Neo4j database (standard graph database in the industry [26]). The application was executed on twenty projects of the organizations Apache and Eclipse (see table 1).…”
“…For evaluating the proposed analysis technique, an application was implemented to getting the terms used in class name following the CamelCase coding style (predominant style due to its ease of writing and adoption [7,13]), storing terms in a Neo4j database (standard graph database in the industry [26]). The application was executed on twenty projects of the organizations Apache and Eclipse (see table 1).…”
“…A different benchmark comparison between Neo4j and DEX is reported in [28], but they mainly use micro operations like "get vertex" or "get edge" instead of more complex queries. They found that Neo4j scales very well for in-memory graphs, which is the case in our benchmark, but significantly loses performance when reading from disk and especially writing due to guaranteed ACID transactions.…”
“…In contrast, the nonnative graph storage, rely on a mature non-graph backend whose production characteristics are well comprehended by operations teams. Native graph processing (indexfree adjacency) benefits traversal [Marek et al 2012, Macko et al 2013 performance, however at the expense of making some non-traversal queries difficult or memory intensive. …”
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.