Jan Kleindienst scite author profile

Several large cloze-style context-questionanswer datasets have been introduced recently: the CNN and Daily Mail news data and the Children's Book Test. Thanks to the size of these datasets, the associated text comprehension task is well suited for deep-learning techniques that currently seem to outperform all alternative approaches. We present a new, simple model that uses attention to directly pick the answer from the context as opposed to computing the answer using a blended representation of words in the document as is usual in similar models. This makes the model particularly suitable for questionanswering problems where the answer is a single word from the document. Ensemble of our models sets new state of the art on all evaluated datasets.

show abstract

Knowledge Base Completion: Baselines Strike Back

Kadlec

2017

View full text Add to dashboard Cite

Many papers have been published on the knowledge base completion task in the past few years. Most of these introduce novel architectures for relation learning that are evaluated on standard datasets such as FB15k and WN18. This paper shows that the accuracy of almost all models published on the FB15k can be outperformed by an appropriately tuned baseline -our reimplementation of the DistMult model. Our findings cast doubt on the claim that the performance improvements of recent models are due to architectural changes as opposed to hyperparameter tuning or different training objectives. This should prompt future research to re-consider how the performance of models is evaluated and reported.

show abstract

Text Understanding with the Attention Sum Reader Network

Kadlec¹,

Schmid²,

Bajgar³

et al. 2016

Preprint

View full text Add to dashboard Cite

Improved Deep Learning Baselines for Ubuntu Corpus Dialogs

Kadlec¹,

Schmid²,

Kleindienst³

2015

Preprint

View full text Add to dashboard Cite

This paper presents results of our experiments for the next utterance ranking on the Ubuntu Dialog Corpus -the largest publicly available multi-turn dialog corpus. First, we use an in-house implementation of previously reported models to do an independent evaluation using the same data. Second, we evaluate the performances of various LSTMs, Bi-LSTMs and CNNs on the dataset. Third, we create an ensemble by averaging predictions of multiple models. The ensemble further improves the performance and it achieves a state-of-the-art result for the next utterance ranking on this dataset. Finally, we discuss our future plans using this corpus.

show abstract

Knowledge Base Completion: Baselines Strike Back

Kadlec¹,

Bajgar²,

Kleindienst³

2017

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Jan Kleindienst

Text Understanding with the Attention Sum Reader Network

Knowledge Base Completion: Baselines Strike Back

Text Understanding with the Attention Sum Reader Network

Improved Deep Learning Baselines for Ubuntu Corpus Dialogs

Knowledge Base Completion: Baselines Strike Back

Contact Info

Product

Resources

About