COALA: A Neural Coverage-Based Approach for Long Answer Selection with Small Data

Rücklé, Andreas; Moosavi, Nafise Sadat; Gurevych, Iryna

doi:10.1609/aaai.v33i01.33016932

Cited by 18 publications

(35 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To test whether our strategy to train models with title-body pairs is also suitable for answer selection, we use the data and code of Rücklé et al (2019a) and train two different types of models with WS-TB on their five datasets that are based on StackExchange Apple, Aviation, Academia, Cooking, and Travel. We train (1) a siamese BiLSTM, which learns question and answer representations; and (2) their neural relevance matching model COALA.…”

Section: Answer Selectionmentioning

confidence: 99%

“…https://github.com/huggingface/ pytorch-transformers same as for BiLSTM (e.g., loss calculation). We train the models until they do not improve for at least 20 epochs, and we restore the weights of the epoch that obtained the best development score.For all other datasets (AskUbuntu-Lei and Answer Selection datasets) we add BERT to the experimental software ofRücklé et al (2019a). We do not include it in the software ofLei et al (2016) because it is tightly coupled to the Theano framework, which is not actively maintained.…”

mentioning

confidence: 99%

See 1 more Smart Citation

Neural Duplicate Question Detection without Labeled Training Data

Rücklé

Moosavi

Gurevych

2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

Self Cite

View full text Add to dashboard Cite

Supervised training of neural models to duplicate question detection in community Question Answering (cQA) requires large amounts of labeled question pairs, which are costly to obtain. To minimize this cost, recent works thus often used alternative methods, e.g., adversarial domain adaptation. In this work, we propose two novel methods: (1) the automatic generation of duplicate questions, and (2) weak supervision using the title and body of a question. We show that both can achieve improved performances even though they do not require any labeled data. We provide comprehensive comparisons of popular training strategies, which provides important insights on how to 'best' train models in different scenarios. We show that our proposed approaches are more effective in many cases because they can utilize larger amounts of unlabeled data from cQA forums. Finally, we also show that our proposed approach for weak supervision with question title and body information is also an effective method to train cQA answer selection models without direct answer supervision.

show abstract

Section: Answer Selectionmentioning

confidence: 99%

mentioning

confidence: 99%

Neural Duplicate Question Detection without Labeled Training Data

Rücklé

Moosavi

Gurevych

2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

Self Cite

View full text Add to dashboard Cite

show abstract

“…Output matrix η nm ∈ R |Q |× |P | contains the relevance scores of all pairs between n-grams in query and m-grams in answer. From η nm , we conduct a row-wise max-pooling to obtain A nm , relaxing the length constraint in interactions (Rücklé et al, 2019).…”

Section: Matching Modulementioning

confidence: 99%

“…InsuranceQA WikiPassageQA Accuracy MAP MRR P@5 P@10 nDCG R@5 R@10 R@20 IR Based (Rücklé et al, 2019) . COALA syntax-aware (Rücklé and Gurevych, 2017) is a variant of COALA using dependency parse trees (Schuster and Manning, 2016).…”

Section: Modelmentioning

confidence: 99%

“…Later work relaxes 1:1 constraint, to 1:N and M:N, by allowing a match to n-gram (1:N) or a match between query and answer bi-grams (2:2). A state-of-the-art in this category (Rücklé et al, 2019), shown in Figure 1b, uses bi-gram Convolutional Neural Network (CNN) to represent query/answer bi-grams and their interactions. Similar architecture was generalized for N:N or N:M matches (Song et al, 2019;Chen et al, 2018), which may introduce a new challenge of multigranular interaction we discuss later.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

MICRON: Multigranular Interaction for Contextualizing RepresentatiON in Non-factoid Question Answering

Han

Choi

Park

et al. 2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

View full text Add to dashboard Cite

This paper studies the problem of non-factoid question answering, where the answer may span over multiple sentences. Existing solutions can be categorized into representationand interaction-focused approaches. We combine their complementary strength, by a hybrid approach allowing multi-granular interactions, but represented at word level, enabling an easy integration with strong word-level signals. Specifically, we propose MICRON: Multigranular Interaction for Contextualizing RepresentatiON, a novel approach which derives contextualized uni-gram representation from n-grams. Our contributions are as follows: First, we enable multi-granular matches between question and answer n-grams. Second, by contextualizing word representation with surrounding n-grams, MICRON can naturally utilize word-based signals for query term weighting, known to be effective in information retrieval. We validate MICRON in two public non-factoid question answering datasets: WikiPassageQA and InsuranceQA, showing our model achieves the state of the art among baselines with reported performances on both datasets.

show abstract

Recommending Question-Answers for Enriching Textbooks

Kumar

Chauhan

2020

Big Data Analytics

View full text Add to dashboard Cite

COALA: A Neural Coverage-Based Approach for Long Answer Selection with Small Data

Cited by 18 publications

References 24 publications

Neural Duplicate Question Detection without Labeled Training Data

Neural Duplicate Question Detection without Labeled Training Data

MICRON: Multigranular Interaction for Contextualizing RepresentatiON in Non-factoid Question Answering

Recommending Question-Answers for Enriching Textbooks

Contact Info

Product

Resources

About