A Deep Reinforcement Learning Approach to First-Order Logic Theorem Proving

Crouse, Maxwell; Abdelaziz, Ibrahim; Makni, Bassem; Whitehead, Spencer; Cornelio, Cristina; Kapanipathi, Pavan; Srinivas, Kavitha; Thost, Veronika; Witbrock, Michael; Fokoue, Achille

doi:10.1609/aaai.v35i7.16780

Cited by 11 publications

(10 citation statements)

References 64 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Their results also include synthetic proof logs to improve performance, while we only trained with human proof logs. The current stateof-the-art for the HOLStep and MIZAR40 benchmarks remains the DAG-LSTM from (Crouse et al 2020). As this architecture cannot compute separate embeddings for goals and premises, which makes prediction computationally intractable for end-to-end systems (Wu et al 2021a;Paliwal et al 2020;Yang et al 2023)…”

Section: Resultsmentioning

confidence: 99%

“…The embedding model is critical, being used to encode ITP expressions for subsequent tactic, premise and goal selection. Current results either use graph based approaches (Kaliszyk, Chollet, and Szegedy 2017;Paliwal et al 2020;Crouse et al 2020), or treat expressions as a sequence (Lample et al 2022;Polu et al 2023;Han et al 2022), with no thorough comparison between them across ITP systems. INT (Wu et al 2021b) provides the only comparison, in a synthetic proving environment, without directly isolating the embedding architecture.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

BAIT: Benchmarking (Embedding) Architectures for Interactive Theorem-Proving

Lamont,

Norrish,

Dezfouli

et al. 2024

AAAI

View full text Add to dashboard Cite

Artificial Intelligence for Theorem Proving (AITP) has given rise to a plethora of benchmarks and methodologies, particularly in Interactive Theorem Proving (ITP). Research in the area is fragmented, with a diverse set of approaches being spread across several ITP systems. This presents a significant challenge to the comparison of methods, which are often complex and difficult to replicate. Addressing this, we present BAIT, a framework for the fair and streamlined comparison of learning approaches in ITP. We demonstrate BAIT’s capabilities with an in-depth comparison, across several ITP benchmarks, of state-of-the-art architectures applied to the problem of formula embedding. We find that Structure Aware Transformers perform particularly well, improving on techniques associated with the original problem sets. BAIT also allows us to assess the end-to-end proving performance of systems built on interactive environments. This unified perspective reveals a novel end-to-end system that improves on prior work. We also provide a qualitative analysis, illustrating that improved performance is associated with more semantically-aware embeddings. By streamlining the implementation and comparison of Machine Learning algorithms in the ITP context, we anticipate BAIT will be a springboard for future research.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

BAIT: Benchmarking (Embedding) Architectures for Interactive Theorem-Proving

Lamont,

Norrish,

Dezfouli

et al. 2024

AAAI

View full text Add to dashboard Cite

show abstract

“…Secondly, automatic learning should be incorporated. Recently, reinforcement learning for FOL [78] has shown tremendous improvements where the systems learn to perform reasoning from scratch. For instance, in the case of reasoners that use inference-rule based methods, instead of providing the inferences (output) beforehand and then mapping the input and the output sequences using seq-2-seq translators, an RL agent can be provided with all the ontology axioms, along with the inference rules.…”

Section: Discussionmentioning

confidence: 99%

Neuro-Symbolic Techniques for Description Logic Reasoning (Student Abstract)

Singh

Mondal²,

Bhatia

et al. 2021

AAAI

View full text Add to dashboard Cite

With the goal to find scalable reasoning approaches, neuro-symbolic techniques have gained significant attention. However, the existing approaches do not take into account the inference capabilities of ontology languages that are based on expressive description logic (such as OWL 2). To fill this gap, we propose two approaches: an ontology-based embedding model for theories in EL++ description logic and a reinforcement learning-based solution for efficient tableau-based reasoning on description logic. We describe promising initial results of our efforts towards these directions and lay down the direction for future work.

show abstract

“…Moreover, the run-time variance of a theorem prover is very large: the system can at times solve some "large" problems while having difficulties with some "smaller" problems. Recent developments in the neuro-symbolic area use deeplearning techniques to enhance standard theorem provers (e.g., see Crouse et al 8 ). We are still at the early stages of this research and there is still a lot that can be done.…”

Section: System Limitations and Future Improvementsmentioning

confidence: 99%

“…Moreover, deriving models from a logical theory using formal reasoning tools is especially difficult when arithmetic and calculus operators are involved (e.g., see the work of Grigoryev et al 7 for the case of inequalities). Machine-learning techniques have been used to improve the performance of ATPs, for example, by using reinforcement learning to guide the search process 8 . This research area has received much attention recently [9][10][11] .…”

mentioning

confidence: 99%

Combining data and theory for derivable scientific discovery with AI-Descartes

Cornelio

Dash²,

Austel³

et al. 2023

Nat Commun

View full text Add to dashboard Cite

Scientists aim to discover meaningful formulae that accurately describe experimental data. Mathematical models of natural phenomena can be manually created from domain knowledge and fitted to data, or, in contrast, created automatically from large datasets with machine-learning algorithms. The problem of incorporating prior knowledge expressed as constraints on the functional form of a learned model has been studied before, while finding models that are consistent with prior knowledge expressed via general logical axioms is an open problem. We develop a method to enable principled derivations of models of natural phenomena from axiomatic knowledge and experimental data by combining logical reasoning with symbolic regression. We demonstrate these concepts for Kepler’s third law of planetary motion, Einstein’s relativistic time-dilation law, and Langmuir’s theory of adsorption. We show we can discover governing laws from few data points when logical reasoning is used to distinguish between candidate formulae having similar error on the data.

show abstract

A Deep Reinforcement Learning Approach to First-Order Logic Theorem Proving

Cited by 11 publications

References 64 publications

BAIT: Benchmarking (Embedding) Architectures for Interactive Theorem-Proving

BAIT: Benchmarking (Embedding) Architectures for Interactive Theorem-Proving

Neuro-Symbolic Techniques for Description Logic Reasoning (Student Abstract)

Combining data and theory for derivable scientific discovery with AI-Descartes

Contact Info

Product

Resources

About