Wolf Siberski scite author profile

Abstract.Ranking is an important concept to avoid empty or overfull and unordered result sets. However, such scoring can only express total orders, which restricts its usefulness when several factors influence result relevance. A more flexible way to express relevance is the notion of preferences. Users state which kind of answers they 'prefer' by adding soft constraints to their queries.Current approaches in the Semantic Web offer only limited facilities for specification of scoring and result ordering. There is no common language element to express and formalize ranking and preferences. We present a comprehensive extension of SPARQL which directly supports the expression of preferences. This includes formal syntax and semantics of preference expressions for SPARQL. Additionally, we report our implementation of preference query processing, which is based on the ARQ query engine.

show abstract

Semantic Overlay Clusters within Super-Peer Networks

Löser

Naumann

Siberski³

et al. 2004

View full text Add to dashboard Cite

Cardinality estimation and dynamic length adaptation for Bloom filters

Papapetrou

Siberski

Nejdl

2010

Distrib Parallel Databases

View full text Add to dashboard Cite

Bloom filters are extensively used in distributed applications, especially in distributed databases and distributed information systems, to reduce network requirements and to increase performance. In this work, we propose two novel Bloom filter features that are important for distributed databases and information systems. First, we present a new approach to encode a Bloom filter such that its length can be adapted to the cardinality of the set it represents, with negligible overhead with respect to computation and false positive probability. The proposed encoding allows for significant network savings in distributed databases, as it enables the participating nodes to optimize the length of each Bloom filter before sending it over the network, for example, when executing Bloom joins. Second, we show how to estimate the number of distinct elements in a Bloom filter, for situations where the represented set is not materialized. These situations frequently arise in distributed databases, where estimating the cardinality of the represented sets is necessary for constructing an efficient query plan. The estimation is highly accurate and comes with tight probabilistic bounds. For both features we provide a thorough probabilistic analysis and extensive experimental evaluation which confirm the effectiveness of our approaches.Note: This is a preprint. The final version is available at http://www.springerlink. com/

show abstract

Incremental diversification for very large sets

Minack

Siberski

Nejdl

2011

View full text Add to dashboard Cite

From keywords to semantic queries—Incremental query construction on the semantic web

Zenz

Zhou

Minack

et al. 2009

Journal of Web Semantics

115

View full text Add to dashboard Cite

Super-peer-based routing and clustering strategies for RDF-based peer-to-peer networks

Nejdl¹,

Wolpers

Siberski

et al. 2003

216

View full text Add to dashboard Cite

RDF-based P2P networks have a number of advantages compared with simpler P2P networks such as Napster, Gnutella or with approaches based on distributed indices such as CAN and CHORD. RDF-based P2P networks allow complex and extendable descriptions of resources instead of fixed and limited ones, and they provide complex query facilities against these metadata instead of simple keyword-based searches.In previous papers, we have described the Edutella infrastructure and different kinds of Edutella peers implementing such an RDFbased P2P network. In this paper we will discuss these RDF-based P2P networks as a specific example of a new type of P2P networks, schema-based P2P networks, and describe the use of super-peer based topologies for these networks. Super-peer based networks can provide better scalability than broadcast based networks, and do provide perfect support for inhomogeneous schema-based networks, which support different metadata schemas and ontologies (crucial for the Semantic Web). Furthermore, as we will show in this paper, they are able to support sophisticated routing and clustering strategies based on the metadata schemas, attributes and ontologies used. Especially helpful in this context is the RDF functionality to uniquely identify schemas, attributes and ontologies. The resulting routing indices can be built using dynamic frequency counting algorithms and support local mediation and transformation rules, and we will sketch some first ideas for implementing these advanced functionalities as well.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Wolf Siberski

Progressive Distributed Top-k Retrieval in Peer-to-Peer Networks

Super-peer-based routing and clustering strategies for RDF-based peer-to-peer networks

Querying the Semantic Web with Preferences

Semantic Overlay Clusters within Super-Peer Networks

Cardinality estimation and dynamic length adaptation for Bloom filters

Incremental diversification for very large sets

From keywords to semantic queries—Incremental query construction on the semantic web

Super-peer-based routing and clustering strategies for RDF-based peer-to-peer networks

Contact Info

Product

Resources

About