Çağlar Tırkaz scite author profile

Recent progress through advanced neural models pushed the performance of task-oriented dialog systems to almost perfect accuracy on existing benchmark datasets for intent classification and slot labeling. However, in evolving real-world dialog systems, where new functionality is regularly added, a major additional challenge is the lack of annotated training data for such new functionality, as the necessary data collection efforts are laborious and time-consuming. A potential solution to reduce the effort is to augment initial seed data by paraphrasing existing utterances automatically. In this paper, we propose a new, data-efficient approach following this idea. Using an interpretation-to-text model for paraphrase generation, we are able to rely on existing dialog system training data, and, in combination with shuffling-based sampling techniques, we can obtain diverse and novel paraphrases from small amounts of seed data. In experiments on a public dataset and with a real-world dialog system, we observe improvements for both intent classification and slot labeling, demonstrating the usefulness of our approach.

show abstract

Leveraging User Paraphrasing Behavior In Dialog Systems To Automatically Collect Annotations For Long-Tail Utterances

Falke¹,

Boese²,

Sorokin³

et al. 2020

View full text Add to dashboard Cite

In large-scale commercial dialog systems, users express the same request in a wide variety of alternative ways with a long tail of less frequent alternatives. Handling the full range of this distribution is challenging, in particular when relying on manual annotations. However, the same users also provide useful implicit feedback as they often paraphrase an utterance if the dialog system failed to understand it. We propose MARUPA, a method to leverage this type of feedback by creating annotated training examples from it. MARUPA creates new data in a fully automatic way, without manual intervention or effort from annotators, and specifically for currently failing utterances. By re-training the dialog system on this new data, accuracy and coverage for longtail utterances can be improved. In experiments, we study the effectiveness of this approach in a commercial dialog system across various domains and three languages.

show abstract

Activity recognition using a hierarchical model

Tırkaz

Bruckner

Yin

et al. 2012

View full text Add to dashboard Cite

Abstract-In this paper, we propose a human daily activity recognition method that is used for Ambient Assisted Living. The proposed system is able to learn a user's activities using the data from motion and door sensors. We extract low level features from the sensor data and feed the features to a model that combines support vector machines (SVMs) and conditional random fields (CRFs) to give accurate recognition results. We propose to combine SVM and CRF classifiers in a hierarchical model which results in better accuracies and can also make use of high level features. We conducted experiments and presented the effectiveness and accuracies of the proposed method.

show abstract

Identifying visual attributes for object recognition from text and taxonomy

Tırkaz

Eisenstein

Sezgin

et al. 2015

Computer Vision and Image Understanding

View full text Add to dashboard Cite

Face recognition using Active Appearance Model

Tırkaz¹,

Albayrak²

2009

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.