Dhiraj Madan scite author profile

Dhiraj Madan

3Publications

16Citation Statements Received

47Citation Statements Given

How they've been cited

How they cite others

Affiliations

IBM (United States), University of Sydney

Publications

Order By: Most citations

Neural Conversational QA: Learning to Reason vs Exploiting Patterns

Verma¹,

Sharma²,

Madan

et al. 2020

View full text Add to dashboard Cite

Neural Conversational QA tasks like ShARC require systems to answer questions based on the contents of a given passage. On studying recent state-of-the-art models on the ShARC QA task, we found indications that the models learn spurious clues/patterns in the dataset. Furthermore, we show that a heuristic-based program designed to exploit these patterns can have performance comparable to that of the neural models. In this paper we share our findings about four types of patterns found in the ShARC corpus and describe how neural models exploit them. Motivated by the aforementioned findings, we create and share a modified dataset that has fewer spurious patterns, consequently allowing models to learn better.

show abstract

Depth Optimized Ansatz Circuit in QAOA for Max-Cut

Majumdar¹,

Bhoumik²,

Madan³

et al. 2021

Preprint

View full text Add to dashboard Cite

While a Quantum Approximate Optimization Algorithm (QAOA) is intended to provide a quantum advantage in finding approximate solutions to combinatorial optimization problems, noise in the system is a hurdle in exploiting its full potential. Several error mitigation techniques have been studied to lessen the effect of noise on this algorithm. Recently, Majumdar et al. proposed a Depth First Search (DFS) based method to reduce n − 1 CNOT gates in the ansatz design of QAOA for finding Max-Cut in a graph G = (V, E), |V | = n. However, this method tends to increase the depth of the circuit, making it more prone to relaxation error. The depth of the circuit is proportional to the height of the DFS tree, which can be n − 1 in the worst case. In this paper, we propose an O(∆ • n 2 ) greedy heuristic algorithm, where ∆ is the maximum degree of the graph, that finds a spanning tree of lower height, thus reducing the overall depth of the circuit while still retaining the n − 1 reduction in the number of CNOT gates needed in the ansatz. We numerically show that this algorithm achieves 10 times increase in the probability of success for each iteration of QAOA for Max-Cut. We further show that although the average depth of the circuit produced by this heuristic algorithm still grows linearly with n, our algorithm reduces the slope of the linear increase from 1 to 0.11.

show abstract

Unsupervised Learning of Interpretable Dialog Models

Madan¹,

Raghu²,

Pandey³

et al. 2018

Preprint

View full text Add to dashboard Cite

Recently several deep learning based models have been proposed for end-to-end learning of dialogs. While these models can be trained from data without the need for any additional annotations, it is hard to interpret them. On the other hand, there exist traditional state based dialog systems, where the states of the dialog are discrete and hence easy to interpret. However these states need to be handcrafted and annotated in the data. To achieve the best of both worlds, we propose Latent State Tracking Network (LSTN) using which we learn an interpretable model in unsupervised manner. The model defines a discrete latent variable at each turn of the conversation which can take a finite set of values. Since these discrete variables are not present in the training data, we use EM algorithm to train our model in unsupervised manner. In the experiments, we show that LSTN can help achieve interpretability in dialog models without much decrease in performance compared to end-to-end approaches.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Dhiraj Madan

Neural Conversational QA: Learning to Reason vs Exploiting Patterns

Depth Optimized Ansatz Circuit in QAOA for Max-Cut

Unsupervised Learning of Interpretable Dialog Models

Contact Info

Product

Resources

About