Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data 2020
DOI: 10.1145/3318464.3389776
|View full text |Cite
|
Sign up to set email alerts
|

Duoquest: A Dual-Specification System for Expressive SQL Queries

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 13 publications
(12 citation statements)
references
References 19 publications
0
12
0
Order By: Relevance
“…Zero noise means we sample values from the ground truth columns. In Medium noise we sample 2 3 values from the ground truth columns and 1 3 from a noise column, which is a column with a Jaccard Containment of more than 0.8 with respect to the ground truth column. Finally, in High noise we sample 1 3 values from the ground truth column and 2 3 from the noise columns.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…Zero noise means we sample values from the ground truth columns. In Medium noise we sample 2 3 values from the ground truth columns and 1 3 from a noise column, which is a column with a Jaccard Containment of more than 0.8 with respect to the ground truth column. Finally, in High noise we sample 1 3 values from the ground truth column and 2 3 from the noise columns.…”
Section: Methodsmentioning
confidence: 99%
“…Query-By-Example systems [2,17,39,40,42] and Query-Reverse-Engineering systems [13,31,37,45] are designed to perform on a database with well-defined path information and integrity constraint as indicated in the Supports Pathless column of Table 1. They explore join paths between tables through looking at the primary key/foreign key relationship and all join paths are ensured to be correct.…”
Section: Challenges Of Pathless Table Collectionsmentioning
confidence: 99%
See 1 more Smart Citation
“…We note that our system's input is comprised entirely of weakly supervised data that can be procured without the use of expert annotators. As described in §2, question-answer annotations can be provided by non-experts, unfamiliar with SQL [6,36,61,68]. As for QDMR instances, they can also be crowdsourced to non-experts [63] or automatically generated using a trained ML model [22,49].…”
Section: System Overviewmentioning
confidence: 99%
“…The NL-to-SQL model is then trained on these synthesized examples. 6 5.3.1 Training Data. We experiment with two large NL-to-SQL datasets: Spider [66] and Geo880 [67].…”
Section: Automatically Predicted Decompositionsmentioning
confidence: 99%