Yushu Yang scite author profile

Yushu Yang

5Publications

1Citation Statement Received

92Citation Statements Given

How they've been cited

How they cite others

Affiliations

Meizu (China), Beijing University of Posts and Telecommunications

Publications

Order By: Most citations

Answer-Driven Visual State Estimator for Goal-Oriented Visual Dialogue

Feng

Wang

et al. 2020

View full text Add to dashboard Cite

A goal-oriented visual dialogue involves multi-turn interactions between two agents, Questioner and Oracle. During which, the answer given by Oracle is of great significance, as it provides golden response to what Questioner concerns. Based on the answer, Questioner updates its belief on target visual content and further raises another question. Notably, different answers drive into different visual beliefs and future questions. However, existing methods always indiscriminately encode answers after much longer questions, resulting in a weak utilization of answers. In this paper, we propose an Answer-Driven Visual State Estimator (ADVSE) to impose the effects of different answers on visual states. First, we propose an Answer-Driven Focusing Attention (ADFA) to capture the answerdriven effect on visual attention by sharpening question-related attention and adjusting it by answer-based logical operation at each turn. Then based on the focusing attention, we get the visual state estimation by Conditional Visual Information Fusion (CVIF), where overall information and difference information are fused conditioning on the question-answer state. We evaluate the proposed ADVSE to both question generator and guesser tasks on the large-scale GuessWhat?! dataset and achieve the state-of-the-art performances on both tasks. The qualitative results indicate that the ADVSE boosts the agent to generate highly efficient questions and obtains reliable visual attentions during the reasonable question generation and guess processes. CCS CONCEPTS • Computing methodologies → Computer vision tasks; Discourse, dialogue and pragmatics; Natural language generation; Computer vision representations.

show abstract

Syntactic Graph Convolutional Network for Spoken Language Understanding

He¹,

Lei²,

Yang³

et al. 2020

View full text Add to dashboard Cite

Slot filling and intent detection are two major tasks for spoken language understanding. In most existing work, these two tasks are built as joint models with multi-task learning with no consideration of prior linguistic knowledge. In this paper, we propose a novel joint model that applies a graph convolutional network over dependency trees to integrate the syntactic structure for learning slot filling and intent detection jointly. Experimental results show that our proposed model achieves state-of-the-art performance on two public benchmark datasets and outperforms existing work. At last, we apply the BERT model to further improve the performance on both slot filling and intent detection. * The work was done when the first author was an intern at Meituan Group. The first two authors contribute equally.

show abstract

Converse, Focus and Guess - Towards Multi-Document Driven Dialogue

Liu

Yuan

Wang

et al. 2021

AAAI

View full text Add to dashboard Cite

We propose a novel task, Multi-Document Driven Dialogue (MD3), in which an agent can guess the target document that the user is interested in by leading a dialogue. To benchmark progress, we introduce a new dataset of GuessMovie, which contains 16,881 documents, each describing a movie, and associated 13,434 dialogues. Further, we propose the MD3 model. Keeping guessing the target document in mind, it converses with the user conditioned on both document engagement and user feedback. In order to incorporate large-scale external documents into the dialogue, it pretrains a document representation which is sensitive to attributes it talks about an object. Then it tracks dialogue state by detecting evolvement of document belief and attribute belief, and finally optimizes dialogue policy in principle of entropy decreasing and reward increasing, which is expected to successfully guess the user's target in a minimum number of turns. Experiments show that our method significantly outperforms several strong baseline methods and is very close to human's performance.

show abstract

Converse, Focus and Guess -- Towards Multi-Document Driven Dialogue

Liu¹,

Yuan²,

Wang³

et al. 2021

Preprint

View full text Add to dashboard Cite

Chinese sentiment classification based on semantic structure of sentences

Yang

Zhou

2011

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yushu Yang

Answer-Driven Visual State Estimator for Goal-Oriented Visual Dialogue

Syntactic Graph Convolutional Network for Spoken Language Understanding

Converse, Focus and Guess - Towards Multi-Document Driven Dialogue

Converse, Focus and Guess -- Towards Multi-Document Driven Dialogue

Chinese sentiment classification based on semantic structure of sentences

Contact Info

Product

Resources

About