Neural Conversational QA: Learning to Reason vs Exploiting Patterns

Verma, Nikhil N.; Sharma, Abhishek; Madan, Dhiraj; Contractor, Danish; Kumar, Hemant; Joshi, Sachindra

doi:10.18653/v1/2020.emnlp-main.589

Cited by 9 publications

(16 citation statements)

References 7 publications

(6 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The major differences lie in two sides: 1) machines are required to formulate follow-up questions for clarification before confident enough to make the decision, 2) machines have to make a question-related conclusion by interpreting a set of complex decision rules, instead of simply extracting the answer from the text. Existing works (Zhong and Zettlemoyer, 2019;Lawrence et al, 2019;Verma et al, 2020;Gao et al, 2020a,b) have made progress in improving the reasoning ability by modeling the interactions between the rule document and other elements. As a widely-used manner, the existing models commonly extracted the rule documents into individual rule items, and track the rule fulfillment for the dialogue states.…”

Section: Related Workmentioning

confidence: 99%

“…The corresponding rule document and the question are marked in the same color in the figure . The major challenges for the conversational machine reading include the rule document interpretation, and reasoning with the background knowledge, e.g., the provided rule document, user scenario and the input question. Existing works (Zhong and Zettlemoyer, 2019;Lawrence et al, 2019;Verma et al, 2020;Gao et al, 2020a,b) have made progress in improving the reasoning ability by modeling the interactions among rule document, user scenario and the other elements implicitly. As for rule document interpretation, most existing approaches simply split the rule document into several rule conditions to be satisfied.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Dialogue Graph Modeling for Conversational Machine Reading

Ouyang¹,

Zhang²,

Zhao³

2021

Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021

View full text Add to dashboard Cite

Conversational Machine Reading (CMR) aims at answering questions in complicated interactive scenarios. Machine needs to answer questions through interactions with users based on given rule document, user scenario and dialogue history, and even initiatively asks questions for clarification if necessary. Namely, the answer to the task needs a machine in the response of either Yes, No, Irrelevant or to raise a follow-up question for further clarification. To effectively capture multiple objects in such a challenging task, graph modeling is supposed to be adopted, though it is surprising that this does not happen until this work proposes a dialogue graph modeling framework by incorporating two complementary graph models, i.e., explicit discourse graph and implicit discourse graph, which respectively capture explicit and implicit interactions hidden in the rule documents. The proposed model is evaluated on the ShARC benchmark and achieves new state-of-the-art by first exceeding the milestone accuracy score of 80%.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Dialogue Graph Modeling for Conversational Machine Reading

Ouyang¹,

Zhang²,

Zhao³

2021

Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021

View full text Add to dashboard Cite

show abstract

“…The left part introduces the retrieval and tagging process for rule documents, which is then fed into the encoder together with other necessary information. Lawrence et al, 2019;Verma et al, 2020;Gao et al, 2020a,b;Ouyang et al, 2021) have made progress in modeling the matching relationships between the rule document and other elements such as user scenarios and questions. These studies are based on the hypothesis that the supporting information for answering the question is provided, which does not meet the real-world applications.…”

Section: Decision Makingmentioning

confidence: 99%

“…Existing studies deal with decision making and question generation independently (Zhong and Zettlemoyer, 2019;Lawrence et al, 2019;Verma et al, 2020;Gao et al, 2020a,b), and use hard-label decisions to activate question generation. These methods inevitably suffer from error propagation if the model makes the wrong decisions.…”

Section: Double-channel Decodermentioning

confidence: 99%

“…A variety of methods have been proposed for the CMR task, including 1) sequential models that encode all the elements and model the matching relationships with attention mechanisms (Zhong and Zettlemoyer, 2019;Lawrence et al, 2019;Verma et al, 2020;Gao et al, 2020a,b); 2) graph-based methods that capture the discourse structures of the rule texts and user scenario for better interactions (Ouyang et al, 2021). However, there are two sides of challenges that have been neglected: 1) Open-retrieval of supporting evidence.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Smoothing Dialogue States for Open Conversational Machine Reading

Zhang¹,

Ouyang²,

Zhao³

et al. 2021

Preprint

View full text Add to dashboard Cite

Conversational machine reading (CMR) requires machines to communicate with humans through multi-turn interactions between two salient dialogue states of decision making and question generation processes. In open CMR settings, as the more realistic scenario, the retrieved background knowledge would be noisy, which results in severe challenges in the information transmission. Existing studies commonly train independent or pipeline systems for the two subtasks. However, those methods are trivial by using hard-label decisions to activate question generation, which eventually hinders the model performance. In this work, we propose an effective gating strategy by smoothing the two dialogue states in only one decoder and bridge decision making and question generation to provide a richer dialogue state reference. Experiments on the OR-ShARC dataset show the effectiveness of our method, which achieves new state-of-the-art results.

show abstract