Effect of phenytoin on interleukin‐1β production in human gingival fibroblasts challenged to tumor necrosis factor α<i>in vitro</i>

Denial constraints (DCs) are a generalization of many other integrity constraints (ICs) widely used in databases, such as key constraints, functional dependencies, or order dependencies. Therefore, they can serve as a unified reasoning framework for all of these ICs and express business rules that cannot be expressed by the more restrictive IC types. The process of formulating DCs by hand is difficult, because it requires not only domain expertise but also database knowledge, and due to DCs' inherent complexity, this process is tedious and error-prone. Hence, an automatic DC discovery is highly desirable: we search for all valid denial constraints in a given database instance. However, due to the large search space, the problem of DC discovery is computationally expensive. We propose a new algorithm Hydra, which overcomes the quadratic runtime complexity in the number of tuples of state-of-the-art DC discovery methods. The new algorithm's experimentally determined runtime grows only linearly in the number of tuples. This results in a speedup by orders of magnitude, especially for datasets with a large number of tuples. Hydra can deliver results in a matter of seconds that to date took hours to compute.

show abstract

Approximate Discovery of Functional Dependencies for Large Datasets

Bleifuß

Bülow

Frohnhofen

et al. 2016

View full text Add to dashboard Cite

Data Change Exploration Using Time Series Clustering

et al. 2018

View full text Add to dashboard Cite

Natural Key Discovery in Wikipedia Tables

Bornemann

Bleifuß

Kalashnikov

et al. 2020

View full text Add to dashboard Cite

Enabling Change Exploration

Bleifuß

Johnson

Kalashnikov

et al. 2017

View full text Add to dashboard Cite

Exploring and Analyzing Change

Srivastava

Bleifuß

Bornemann

et al. 2022

View full text Add to dashboard Cite

Matching Roles from Temporal Data: Why Joe Biden is not only President, but also Commander-in-Chief

Bornemann

Bleifuß

Kalashnikov³

et al. 2023

Proc. ACM Manag. Data

View full text Add to dashboard Cite

We present role matching, a novel, fine-grained integrity constraint on temporal fact data, i.e., (subject, predicate, object, timestamp)-quadruples. A role is a combination of subject and predicate and can be associated with different objects as the real world evolves and the data changes over time. A role matching states that the associated object of two or more roles should always match across time. Once discovered, role matchings can serve as integrity constraints to improve data quality, for instance of structured data in Wikipedia[3]. If violated, role matchings can alert data owners or editors and thus allow them to correct the error. Finding all role matchings is challenging due both to the inherent quadratic complexity of the matching problem and the need to identify true matches based on the possibly short history of the facts observed so far. To address the first challenge, we introduce several blocking methods both for clean and dirty input data. For the second challenge, the matching stage, we show how the entity resolution method Ditto[27] can be adapted to achieve satisfactory performance for the role matching task. We evaluate our method on datasets from Wikipedia infoboxes, showing that our blocking approaches can achieve 95% recall, while maintaining a reduction ratio of more than 99.99%, even in the presence of dirty data. In the matching stage, we achieve a macro F1-score of 89% on our datasets, using automatically generated labels.

show abstract

Inclusion Dependency Discovery

Dürsch

Stebner

Windheuser

et al. 2019

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Tobias Bleifuß

Efficient denial constraint discovery with hydra

Approximate Discovery of Functional Dependencies for Large Datasets

Data Change Exploration Using Time Series Clustering

Natural Key Discovery in Wikipedia Tables

Enabling Change Exploration

Exploring and Analyzing Change

Matching Roles from Temporal Data: Why Joe Biden is not only President, but also Commander-in-Chief

Inclusion Dependency Discovery

Contact Info

Product

Resources

About