Zero-Shot Text-to-SQL Learning with Auxiliary Task

Chang, Shuaichen; Liu, Pengfei; Tang, Yun; Huang, Jing; He, Xiaodong; Zhou, Bowen

doi:10.1609/aaai.v34i05.6246

Cited by 19 publications

(16 citation statements)

References 13 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To address this issue, one line of research is to augment existing datasets with automatically generated data (Su and Yan, 2017;Jia and Liang, 2016;Cai and Yates, 2013). Another line of research is to exploit available resources, such as knowledge bases (Krishnamurthy et al, 2017;Herzig and Berant, 2018;Chang et al, 2019;Lee, 2019;Zhang et al, 2019a;Guo et al, 2019;Wang et al, 2019), semantic features in different domains (Dadashkarimi et al, 2018;Li et al, 2020), or unlabeled data Kočiskỳ et al, 2016;Sun et al, 2019). Those works are orthogonal to our setting because our approach aims to efficiently exploit a handful of labeled data of new predicates, which are not limited to the ones in knowledge bases.…”

Section: Related Workmentioning

confidence: 99%

Few-Shot Semantic Parsing for New Predicates

Qu²,

Huang

et al. 2021

Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume

View full text Add to dashboard Cite

In this work, we investigate the problems of semantic parsing in a few-shot learning setting. In this setting, we are provided with k utterance-logical form pairs per new predicate. The state-of-the-art neural semantic parsers achieve less than 25% accuracy on benchmark datasets when k = 1. To tackle this problem, we proposed to i) apply a designated metalearning method to train the model; ii) regularize attention scores with alignment statistics; iii) apply a smoothing technique in pretraining. As a result, our method consistently outperforms all the baselines in both one and two-shot settings.

show abstract

Section: Related Workmentioning

confidence: 99%

Few-Shot Semantic Parsing for New Predicates

Qu²,

Huang

et al. 2021

Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume

View full text Add to dashboard Cite

show abstract

“…In this case, information from both the NL and table schema is encoded into a hidden representation by the encoder. Some of those works encode the question with each column name separately (Xu et al, 2017;Yu et al, 2018;Hwang et al, 2019) and other choose encoding the concatenation of the question with columns name (Zhong et al, 2017;Dong and Lapata, 2018;Chang et al, 2020;Hwang et al, 2019;He et al, 2019).…”

Section: Related Workmentioning

confidence: 99%

SQL Generation from Natural Language: A Sequence-to-Sequence Model Powered by the Transformers Architecture and Association Rules

Mellah¹,

Rhouati²,

Ettifouri³

et al. 2021

Journal of Computer Science

View full text Add to dashboard Cite

Using Natural Language (NL) to interacting with relational databases allows users from any background to easily query and analyze large amounts of data. This requires a system that understands user questions and automatically converts them into structured query language such as SQL. The best performing Text-to-SQL systems use supervised learning (usually formulated as a classification problem) by approaching this task as a sketch-based slot-filling problem, or by first converting questions into an Intermediate Logical Form (ILF) then convert it to the corresponding SQL query. However, non-supervised modeling that directly converts questions to SQL queries has proven more difficult. In this sense, we propose an approach to directly translate NL questions into SQL statements. In this study, we present a Sequence-to-Sequence (Seq2Seq) parsing model for the NL to SQL task, powered by the Transformers Architecture exploring the two Language Models (LM): Text-To-Text Transfer Transformer (T5) and the Multilingual pre-trained Text-To-Text Transformer (mT5). Besides, we adopt the transformationbased learning algorithm to update the aggregation predictions based on association rules. The resulting model achieves a new state-of-the-art on the WikiSQL DataSet, for the weakly supervised SQL generation.

show abstract

“…Text-to-SQL Text-to-SQL as a semantic parsing task, has attracted increasing interest, where multiple large-scale datasets have been released. Zhong et al (2017) created a large single-table Text-to-SQL dataset, WikiSQL, from Wikipedia entries, upon which many semantic parsers have been trained, achieving high accuracies surpassing 80% (Chang et al, 2020;Lyu et al, 2020;Hwang et al, 2019;He et al, 2019). Yu et al (2018) proposed SPIDER, another large-scale text-to-SQL dataset with multi-table databases, much wider grammar coverage, and more complex queries.…”

Section: Related Workmentioning

confidence: 99%

Semi-Automatic Construction of Text-to-SQL Data for Domain Transfer

Li¹,

Li²,

Steedman

2021

Proceedings of the 17th International Conference on Parsing Technologies and the IWPT 2021 Shared Task on Parsing Into Enhanced

View full text Add to dashboard Cite

Strong and affordable in-domain data is a desirable asset when transferring trained semantic parsers to novel domains. As previous methods for semi-automatically constructing such data cannot handle the complexity of realistic SQL queries, we propose to construct SQL queries via context-dependent sampling, and introduce the concept of topic. Along with our SQL query construction method, we propose a novel pipeline of semi-automatic Textto-SQL dataset construction that covers the broad space of SQL queries. We show that the created dataset is comparable with expert annotation along multiple dimensions, and is capable of improving domain transfer performance for SOTA semantic parsers.

show abstract

Zero-Shot Text-to-SQL Learning with Auxiliary Task

Cited by 19 publications

References 13 publications

Few-Shot Semantic Parsing for New Predicates

Few-Shot Semantic Parsing for New Predicates

SQL Generation from Natural Language: A Sequence-to-Sequence Model Powered by the Transformers Architecture and Association Rules

Semi-Automatic Construction of Text-to-SQL Data for Domain Transfer

Contact Info

Product

Resources

About