Pawel Guzewicz scite author profile

Pawel Guzewicz

3Publications

33Citation Statements Received

143Citation Statements Given

How they've been cited

How they cite others

142

Affiliations

Institut Polytechnique de Paris, École Polytechnique, Computer Science Laboratory of the École Polytechnique

Publications

Order By: Most citations

RDF graph summarization for first-sight structure discovery

2020

View full text Add to dashboard Cite

To help users get familiar with large RDF graphs, RDF summarization techniques can be used. In this work, we study quotient summaries of RDF graphs, that is: graph summaries derived from a notion of equivalence among RDF graph nodes. We make the following contributions: (i) four novel summaries which are often small and easy-to-comprehend, in the style of E-R diagrams; (ii) efficient (amortized linear-time) algorithms for computing these summaries either from scratch, or incrementally, reflecting additions to the graph; (iii) the first formal study of the interplay between RDF graph saturation in the presence of an RDFS ontology, and summarization; we provide a sufficient condition for a highly efficient shortcut method to build the quotient summary of a graph without saturating it; (iv) formal results establishing the shortcut conditions for some of our summaries and others from the literature; (v) experimental validations of our claim within a tool available online.

show abstract

Efficient Exploration of Interesting Aggregates in RDF Graphs

Diao

Guzewicz

Manolescu

et al. 2021

View full text Add to dashboard Cite

As large Open Data are increasingly shared as RDF graphs today, there is a growing demand to help users discover the most interesting facets of a graph, which are often hard to grasp without automatic tools. We consider the problem of automatically identifying the 𝑘 most interesting aggregate queries that can be evaluated on an RDF graph, given an integer 𝑘 and a user-specified interestingness function. Our problem departs from analytics in relational data warehouses in that (𝑖) in an RDF graph we are not given but we must identify the facts, dimensions, and measures of candidate aggregates; (𝑖𝑖) the classical approach to efficiently evaluating multiple aggregates breaks in the face of multi-valued dimensions in RDF data. In this work, we propose an extensible end-to-end framework that enables the identification and evaluation of interesting aggregates based on a new RDF-compatible one-pass algorithm for efficiently evaluating a lattice of aggregates and a novel early-stop technique (with probabilistic guarantees) that can prune uninteresting aggregates. Experiments using both real and synthetic graphs demonstrate the ability of our framework to find interesting aggregates in a large search space, the efficiency of our algorithms (with up to 2.9× speedup over a similar pipeline based on existing algorithms), and scalability as the data size and complexity grow. CCS CONCEPTS• Information systems → Database management system engines; Graph-based database models.

show abstract

Quotient RDF Summaries Based on Type Hierarchies

Guzewicz

Manolescu

2018

View full text Add to dashboard Cite

Abstract-Summarization has been applied to RDF graphs to obtain a compact representation thereof, easier to grasp by human users. We present a new brand of quotient-based RDF graph summaries, whose main novelty is to summarize together RDF nodes belonging to the same type hierarchy. We argue that such summaries bring more useful information to users about the structure and semantics of an RDF graph.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Pawel Guzewicz

RDF graph summarization for first-sight structure discovery

Efficient Exploration of Interesting Aggregates in RDF Graphs

Quotient RDF Summaries Based on Type Hierarchies

Contact Info

Product

Resources

About