Database reasoning over text

Thorne, James; Yazdani, Majid; Saeidi, Marzieh; Silvestri, Fabrizio; Riedel, Sebastian; Halevy, Alon

doi:10.18653/v1/2021.acl-long.241

Cited by 16 publications

(8 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…These models have been evaluated primarily via perplexity and non-NLP benchmarks (Tay et al, 2020b). These metrics may not be ideal (Sun et al, 2021) and may not reflect performance on complex NLP tasks (Arutiunian et al, 2020;Thorne et al, 2021). We argue these metrics have not been sufficient for the development of efficient Transformers and their practical application on long texts, and that existing benchmarks are insufficient guides for architecture selection.…”

Section: Introductionmentioning

confidence: 90%

The NLP Task Effectiveness of Long-Range Transformers

Qin¹,

Feng²,

Durme³

2022

Preprint

View full text Add to dashboard Cite

Transformer models cannot easily scale to long sequences due to their O(N 2 ) time and space complexity. This has led to Transformer variants seeking to lessen computational complexity, such as Longformer and Performer. While such models has theoretically greater efficiency, their effectiveness on real NLP tasks has not been well studied. We benchmark 7 variants of Transformer models on 5 difficult NLP tasks and 7 datasets. We design experiments to isolate the effect of pretraining and hyperparameter settings, to focus on their capacity for long-range attention. Moreover, we present various methods to investigate attention behaviors, to illuminate model details beyond metric scores. We find that attention of long-range transformers has advantages on content selection and query-guided decoding, but they come with previously unrecognized drawbacks such as insufficient attention to distant tokens.

show abstract

Section: Introductionmentioning

confidence: 90%

The NLP Task Effectiveness of Long-Range Transformers

Qin¹,

Feng²,

Durme³

2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Answering Database Queries There has been substantial effort put into converting queries expressed in natural language into SQL queries for databases with known structure [1,16,38], and there have also been advancements in adapting this approach for semistructured data and knowledge bases [4,20]. Recently, Thorne et al [28,29] proposed NeuralDB as a way to perform database queries over a collection of textual documents without the need to translate data or queries into a predefined database schema but using parallel neural techniques instead. Their approach is very effective but it: (i) requires preprocessing and analysis for the aggregation operator; (ii) is limited to simple queries and (iii) is capable of handling data just in textual format.…”

Section: Related Workmentioning

confidence: 99%

“…In this perspective paper, we propose to study, design, and build MMNDBs by combining the capabilities of large multimodal models, multi-media information retrieval, and database query processing, as shown in Figure 1. We have been inspired by the work on neural databases [26,28,29] that have garnered interest in the NLP, database, and IR communities. However, we differentiate from that work as we position ourselves as an evolution of the field of MMIR by means of modern and, more recently proposed, multimodal AI technologies.…”

Section: Introductionmentioning

confidence: 99%

Multimodal Neural Databases

Trappolini

Santilli

Rodolà

et al. 2023

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval

Self Cite

View full text Add to dashboard Cite

The rise in loosely-structured data available through text, images, and other modalities has called for new ways of querying them. Multimedia Information Retrieval has filled this gap and has witnessed exciting progress in recent years. Tasks such as search and retrieval of extensive multimedia archives have undergone massive performance improvements, driven to a large extent by recent developments in multimodal deep learning. However, methods in this field remain limited in the kinds of queries they support and, in particular, their inability to answer database-like queries. For this reason, inspired by recent work on neural databases, we propose a new framework, which we name Multimodal Neural Databases (MMNDBs). MMNDBs can answer complex database-like queries that involve reasoning over different input modalities, such as text and images, at scale. In this paper, we present the first architecture able to fulfill this set of requirements and test it with several baselines, showing the limitations of currently available models. The results show the potential of these new techniques to process unstructured data coming from different modalities, paving the way for future research in the area.

show abstract

“…While the dataset is bilingual, it uses crowdsourced questions and is not designed for compositionality analysis. Recently, Thorne et al (2021) proposed WIKINLDB, a Wikidata-based English KBQA dataset, focusing on scalability rather than compositionality. Other related datasets include QALM (Kaffee et al, 2019), a dataset for multilingual question answering over a set of different popular knowledge graphs, intended to help determine the multilinguality of those knowledge graphs.…”

Section: Related Workmentioning

confidence: 99%

Compositional Generalization in Multilingual Semantic Parsing over Wikidata

Cui

Aralikatte

Lent

et al. 2022

Transactions of the Association for Computational Linguistics

View full text Add to dashboard Cite

Semantic parsing (SP) allows humans to leverage vast knowledge resources through natural interaction. However, parsers are mostly designed for and evaluated on English resources, such as CFQ (Keysers et al., 2020), the current standard benchmark based on English data generated from grammar rules and oriented towards Freebase, an outdated knowledge base. We propose a method for creating a multilingual, parallel dataset of question-query pairs, grounded in Wikidata. We introduce such a dataset, which we call Multilingual Compositional Wikidata Questions (MCWQ), and use it to analyze the compositional generalization of semantic parsers in Hebrew, Kannada, Chinese, and English. While within- language generalization is comparable across languages, experiments on zero-shot cross- lingual transfer demonstrate that cross-lingual compositional generalization fails, even with state-of-the-art pretrained multilingual encoders. Furthermore, our methodology, dataset, and results will facilitate future research on SP in more realistic and diverse settings than has been possible with existing resources.

show abstract

Database reasoning over text

Cited by 16 publications

References 26 publications

The NLP Task Effectiveness of Long-Range Transformers

The NLP Task Effectiveness of Long-Range Transformers

Multimodal Neural Databases

Compositional Generalization in Multilingual Semantic Parsing over Wikidata

Contact Info

Product

Resources

About