Yimai Fang scite author profile

Yimai Fang

5Publications

77Citation Statements Received

79Citation Statements Given

How they've been cited

How they cite others

120

Affiliations

University of Cambridge

Publications

Order By: Most citations

A Generative Model for Joint Natural Language Understanding and Generation

Tseng¹,

Cheng²,

Fang³

et al. 2020

View full text Add to dashboard Cite

Natural language understanding (NLU) and natural language generation (NLG) are two fundamental and related tasks in building task-oriented dialogue systems with opposite objectives: NLU tackles the transformation from natural language to formal representations, whereas NLG does the reverse. A key to success in either task is parallel training data which is expensive to obtain at a large scale. In this work, we propose a generative model which couples NLU and NLG through a shared latent variable. This approach allows us to explore both spaces of natural language and formal representations, and facilitates information sharing through the latent space to eventually benefit NLU and NLG. Our model achieves state-of-the-art performance on two dialogue datasets with both flat and tree-structured formal representations. We also show that the model can be trained in a semi-supervised fashion by utilising unlabelled data to boost its performance.

show abstract

Plan-then-Generate: Controlled Data-to-Text Generation via Planning

Su¹,

Vandyke²,

Wang³

et al. 2021

View full text Add to dashboard Cite

Recent developments in neural networks have led to the advance in data-to-text generation. However, the lack of ability of neural models to control the structure of generated output can be limiting in certain real-world applications. In this study, we propose a novel Plan-then-Generate (PlanGen) framework to improve the controllability of neural data-totext models. Extensive experiments and analyses are conducted on two benchmark datasets, ToTTo and WebNLG. The results show that our model is able to control both the intrasentence and inter-sentence structure of the generated output. Furthermore, empirical comparisons against previous state-of-the-art methods show that our model improves the generation quality as well as the output diversity as judged by human and automatic evaluations.

show abstract

A Summariser based on Human Memory Limitations and Lexical Competition

Fang

Teufel

2014

View full text Add to dashboard Cite

Kintsch and van Dijk proposed a model of human comprehension and summarisation which is based on the idea of processing propositions on a sentence-bysentence basis, detecting argument overlap, and creating a summary on the basis of the best connected propositions. We present an implementation of that model, which gets around the problem of identifying concepts in text by applying coreference resolution, named entity detection, and semantic similarity detection, implemented as a two-step competition. We evaluate the resulting summariser against two commonly used extractive summarisers using ROUGE, with encouraging results.

show abstract

Improving Argument Overlap for Proposition-Based Summarisation

Fang¹,

Teufel²

2016

View full text Add to dashboard Cite

We present improvements to our incremental proposition-based summariser, which is inspired by Kintsch and van Dijk's (1978) text comprehension model. Argument overlap is a central concept in this summariser. Our new model replaces the old overlap method based on distributional similarity with one based on lexical chains. We evaluate on a new corpus of 124 summaries of educational texts, and show that our new system outperforms the old method and several stateof-the-art non-proposition-based summarisers. The experiment also verifies that the incremental nature of memory cycles is beneficial in itself, by comparing it to a non-incremental algorithm using the same underlying information.

show abstract

A Generative Model for Joint Natural Language Understanding and Generation

Tseng¹,

Cheng²,

Fang³

et al. 2020

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yimai Fang

A Generative Model for Joint Natural Language Understanding and Generation

Plan-then-Generate: Controlled Data-to-Text Generation via Planning

A Summariser based on Human Memory Limitations and Lexical Competition

Improving Argument Overlap for Proposition-Based Summarisation

A Generative Model for Joint Natural Language Understanding and Generation

Contact Info

Product

Resources

About