Anirban Laha scite author profile

Abstractive summarization aims to generate a shorter version of the document covering all the salient points in a compact and coherent fashion. On the other hand, query-based summarization highlights those points that are relevant in the context of a given query. The encodeattend-decode paradigm has achieved notable success in machine translation, extractive summarization, dialog systems, etc. But it suffers from the drawback of generation of repeated phrases. In this work we propose a model for the query-based summarization task based on the encode-attend-decode paradigm with two key additions (i) a query attention model (in addition to document attention model) which learns to focus on different portions of the query at different time steps (instead of using a static representation for the query) and (ii) a new diversity based attention model which aims to alleviate the problem of repeating phrases in the summary. In order to enable the testing of this model we introduce a new query-based summarization dataset building on debatepedia. Our experiments show that with these two additions the proposed model clearly outperforms vanilla encode-attend-decode models with a gain of 28% (absolute) in ROUGE-L scores.

show abstract

Unsupervised Neural Text Simplification

Surya¹,

Mishra²,

Laha³

et al. 2019

View full text Add to dashboard Cite

The paper presents a first attempt towards unsupervised neural text simplification that relies only on unlabeled text corpora. The core framework is composed of a shared encoder and a pair of attentional-decoders, crucially assisted by discrimination-based losses and denoising. The framework is trained using unlabeled text collected from en-Wikipedia dump. Our analysis (both quantitative and qualitative involving human evaluators) on public test data shows that the proposed model can perform text-simplification at both lexical and syntactic levels, competitive to existing supervised methods. It also outperforms viable unsupervised baselines. Adding a few labeled pairs helps improve the performance further.

show abstract

Diversity driven Attention Model for Query-based Abstractive Summarization

Nema¹,

Khapra²,

Laha³

et al. 2017

Preprint

View full text Add to dashboard Cite

Joint Learning of Correlated Sequence Labeling Tasks Using Bidirectional Recurrent Neural Networks

Pahuja

Laha

Mirkin

et al. 2017

View full text Add to dashboard Cite

The stream of words produced by Automatic Speech Recognition (ASR) systems is typically devoid of punctuations and formatting. Most natural language processing applications expect segmented and well-formatted texts as input, which is not available in ASR output. This paper proposes a novel technique of jointly modeling multiple correlated tasks such as punctuation and capitalization using bidirectional recurrent neural networks, which leads to improved performance for each of these tasks. This method could be extended for joint modeling of any other correlated sequence labeling tasks.

show abstract

Generating Descriptions from Structured Data Using a Bifocal Attention Mechanism and Gated Orthogonalization

Nema¹,

Shetty²,

Jain³

et al. 2018

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Anirban Laha

Diversity driven attention model for query-based abstractive summarization

Unsupervised Neural Text Simplification

Diversity driven Attention Model for Query-based Abstractive Summarization

Joint Learning of Correlated Sequence Labeling Tasks Using Bidirectional Recurrent Neural Networks

Generating Descriptions from Structured Data Using a Bifocal Attention Mechanism and Gated Orthogonalization

Contact Info

Product

Resources

About