Daheim, Nico scite author profile

Daheim, Nico

9Publications

5Citation Statements Received

191Citation Statements Given

How they've been cited

How they cite others

144

187

Affiliations

RWTH Aachen University

Publications

Order By: Most citations

Cascaded Span Extraction and Response Generation for Document-Grounded Dialog

Nico¹,

Thulke²,

Dugast³

et al. 2021

View full text Add to dashboard Cite

This paper summarizes our entries to both subtasks of the first DialDoc shared task which focuses on the agent response prediction task in goal-oriented document-grounded dialogs. The task is split into two subtasks: predicting a span in a document that grounds an agent turn and generating an agent response based on a dialog and grounding document. In the first subtask, we restrict the set of valid spans to the ones defined in the dataset, use a biaffine classifier to model spans, and finally use an ensemble of different models. For the second subtask, we use a cascaded model which grounds the response prediction on the predicted span instead of the full document. With these approaches, we obtain significant improvements in both subtasks compared to the baseline.

show abstract

Controllable Factuality in Document-Grounded Dialog Systems Using a Noisy Channel Model

Nico¹,

Thulke²,

Dugast³

et al. 2022

View full text Add to dashboard Cite

Opportunities and Challenges in Neural Dialog Tutoring

Jakub¹,

Nico²,

Wang³

et al. 2023

Preprint

View full text Add to dashboard Cite

Designing dialog tutors has been challenging as it involves modeling the diverse and complex pedagogical strategies employed by human tutors. Although there have been significant recent advances in neural conversational systems using large language models and growth in available dialog corpora, dialog tutoring has largely remained unaffected by these advances. In this paper, we rigorously analyze various generative language models on two dialog tutoring datasets for language learning using automatic and human evaluations to understand the new opportunities brought by these advances as well as the challenges we must overcome to build models that would be usable in real educational settings. We find that although current approaches can model tutoring in constrained learning scenarios when the number of concepts to be taught and possible teacher strategies are small, they perform poorly in less constrained scenarios. Our human quality evaluation shows that both models and ground-truth annotations exhibit low performance in terms of equitable tutoring, which measures learning opportunities for students and how engaging the dialog is. To understand the behavior of our models in a real tutoring setting, we conduct a user study using expert annotators and find a significantly large number of model reasoning errors in 45% of conversations. Finally, we connect our findings to outline future work. https://github.com/eth-nlped/ dialog-tutoring

show abstract

GEMv2: Multilingual NLG Benchmarking in a Single Line of Code

Gehrmann¹,

Bhattacharjee²,

Abinaya³

et al. 2022

Preprint

View full text Add to dashboard Cite

Controllable Factuality in Document-Grounded Dialog Systems Using a Noisy Channel Model

Nico¹,

Thulke²,

Dugast³

et al. 2022

Preprint

View full text Add to dashboard Cite

In this work, we present a model for documentgrounded response generation in dialog that is decomposed into two components according to Bayes' theorem. One component is a traditional ungrounded response generation model and the other component models the reconstruction of the grounding document based on the dialog context and generated response. We propose different approximate decoding schemes and evaluate our approach on multiple open-domain and task-oriented documentgrounded dialog datasets. Our experiments show that the model is more factual in terms of automatic factuality metrics than the baseline model. Furthermore, we outline how introducing scaling factors between the components allows for controlling the tradeoff between factuality and fluency in the model output. Finally, we compare our approach to a recently proposed method to control factuality in grounded dialog, CTRL (Rashkin et al., 2021), and show that both approaches can be combined to achieve additional improvements.

show abstract

Cascaded Span Extraction and Response Generation for Document-Grounded Dialog

Nico¹,

Thulke²,

Dugast³

et al. 2021

Preprint

View full text Add to dashboard Cite

Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference

Zouhar¹,

Dhuliawala²,

Zhou³

et al. 2023

Preprint

View full text Add to dashboard Cite

Machine translation quality estimation (QE) predicts human judgements of a translation hypothesis without seeing the reference. Stateof-the-art QE systems based on pretrained language models have been achieving remarkable correlations with human judgements yet they are computationally heavy and require human annotations, which are slow and expensive to create. To address these limitations, we define the problem of metric estimation (ME) where one predicts the automated metric scores also without the reference. We show that even without access to the reference, our model can estimate automated metrics (ρ=60% for BLEU, ρ=51% for other metrics) at the sentence-level. Because automated metrics correlate with human judgements, we can leverage the ME task for pre-training a QE model. For the QE task, we find that pre-training on TER is better (ρ=23%) than training for scratch (ρ=20%).

show abstract

Adapting Document-Grounded Dialog Systems to Spoken Conversations using Data Augmentation and a Noisy Channel Model

Thulke¹,

Nico²,

Dugast³

et al. 2021

Preprint

View full text Add to dashboard Cite

This paper summarizes our submission to Task 2 of the second track of the 10th Dialog System Technology Challenge (DSTC10) "Knowledge-grounded Task-oriented Dialogue Modeling on Spoken Conversations". Similar to the previous year's iteration, the task consists of three subtasks: detecting whether a turn is knowledge seeking, selecting the relevant knowledge document and finally generating a grounded response. This year, the focus lies on adapting the system to noisy ASR transcripts. We explore different approaches to make the models more robust to this type of input and to adapt the generated responses to the style of spoken conversations. For the latter, we get the best results with a noisy channel model that additionally reduces the number of short and generic responses. Our best system achieved the 1st rank in the automatic and the 3rd rank in the human evaluation of the challenge.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Daheim, Nico

Cascaded Span Extraction and Response Generation for Document-Grounded Dialog

Controllable Factuality in Document-Grounded Dialog Systems Using a Noisy Channel Model

Opportunities and Challenges in Neural Dialog Tutoring

GEMv2: Multilingual NLG Benchmarking in a Single Line of Code

Controllable Factuality in Document-Grounded Dialog Systems Using a Noisy Channel Model

Cascaded Span Extraction and Response Generation for Document-Grounded Dialog

Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference

Adapting Document-Grounded Dialog Systems to Spoken Conversations using Data Augmentation and a Noisy Channel Model

Contact Info

Product

Resources

About