2022
DOI: 10.1162/tacl_a_00446
|View full text |Cite
|
Sign up to set email alerts
|

FeTaQA: Free-form Table Question Answering

Abstract: Existing table question answering datasets contain abundant factual questions that primarily evaluate a QA system’s comprehension of query and tabular data. However, restricted by their short-form answers, these datasets fail to include question–answer interactions that represent more advanced and naturally occurring information needs: questions that ask for reasoning and integration of information pieces retrieved from a structured knowledge source. To complement the existing datasets and to reveal the challe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
17
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 24 publications
(17 citation statements)
references
References 35 publications
0
17
0
Order By: Relevance
“…• FetaQA [27] This dataset contains 10,330 clean tables from Wikipedia and 2003 test questions. The questions are generally longer and more complex than those from NQ-Tables.…”
Section: Discussionmentioning
confidence: 99%
“…• FetaQA [27] This dataset contains 10,330 clean tables from Wikipedia and 2003 test questions. The questions are generally longer and more complex than those from NQ-Tables.…”
Section: Discussionmentioning
confidence: 99%
“…SQuAD (Rajpurkar et al, 2016) and CNN/Daily Mail (Hermann et al, 2015) are classic datasets for textual data. Table/KB QA datasets mainly focus on structured tables (Pasupat and Liang, 2015;Zhong et al, 2017;Yu et al, 2018;Nan et al, 2022) and knowledge bases (Berant et al, 2013;Yih et al, 2015;Talmor and Berant, 2018;Xie et al, 2022). And some recent works focus on reasoning over more complex tables including hierarchical tables (Cheng et al, 2021b 2021).…”
Section: Related Workmentioning
confidence: 99%
“…While it is domain-specific, the included tables have a very peculiar structure (with table rows containing entire natural language sentences that have been split into columns), which in our experience is not representative of tables appearing in most domains. Recently, Nan et al (2022) proposed FeTaQA; another Wikipedia-based dataset but with answers that are long free-form sentences (instead of short answers found in prior datasets).…”
Section: Related Workmentioning
confidence: 99%
“…Experimental evaluation of state-of-the-art (Pasupat and Liang, 2015) TabMCQ (Jauhar et al, 2016) 2016 (Science Exam) WikiSQL (Zhong et al, 2017) 2017 FeTaQA (Nan et al, 2022) 2021…”
Section: Introductionmentioning
confidence: 99%