Learning Representations for Weakly Supervised Natural Language Processing Tasks

Huang, Fei; Ahuja, Arun; Downey, Doug; Yang, Yi; Guo, Yuhong; Yates, Alexander

doi:10.1162/coli_a_00167

Cited by 49 publications

(34 citation statements)

References 56 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…They have recently been shown to capture both semantic and syntactic information about words very well, setting performance records in several word similarity tasks (Mikolov et al, 2013;Pennington et al, 2014). Using word embeddings that have been trained a priori has become common practice for enhancing many other NLP tasks (Parikh et al, 2014;Huang et al, 2014). A common method of training a neural network is to randomly initialize all parameters and then optimize them using an optimization algorithm.…”

Section: Word Embeddingsmentioning

confidence: 99%

Distant Supervision for Relation Extraction via Piecewise Convolutional Neural Networks

Zeng

Liu

Chen

et al. 2015

Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing

1,006

905

View full text Add to dashboard Cite

Two problems arise when using distant supervision for relation extraction. First, in this method, an already existing knowledge base is heuristically aligned to texts, and the alignment results are treated as labeled data. However, the heuristic alignment can fail, resulting in wrong label problem. In addition, in previous approaches, statistical models have typically been applied to ad hoc features. The noise that originates from the feature extraction process can cause poor performance.In this paper, we propose a novel model dubbed the Piecewise Convolutional Neural Networks (PCNNs) with multi-instance learning to address these two problems. To solve the first problem, distant supervised relation extraction is treated as a multi-instance problem in which the uncertainty of instance labels is taken into account. To address the latter problem, we avoid feature engineering and instead adopt convolutional architecture with piecewise max pooling to automatically learn relevant features. Experiments show that our method is effective and outperforms several competitive baseline methods.

show abstract

Section: Word Embeddingsmentioning

confidence: 99%

Distant Supervision for Relation Extraction via Piecewise Convolutional Neural Networks

Zeng

Liu

Chen

et al. 2015

Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing

1,006

905

View full text Add to dashboard Cite

show abstract

“…Fortunately, there is some indication that other typical measures of extraction performance, like precision and recall of extracted relations, correlates with the standard perplexity metric used in language modeling. In Figure 1, we show experiments from previous work [29] that demonstrate how the perplexity of a Hidden Markov Model correlates strongly with the model's accuracy in a standard "set expansion" WIE task.…”

Section: The Nl Objectivementioning

confidence: 98%

“…This objective has [29]). Number labels indicate the number of latent states in the HMM, and performance is shown for three training corpus sizes (the full corpus consists of approximately 60 million tokens).…”

Section: The Nl Objectivementioning

confidence: 99%

Using natural language to integrate, evaluate, and optimize extracted knowledge bases

Downey

Bhagavatula

Yates

2013

Proceedings of the 2013 Workshop on Automated Knowledge Base Construction

Self Cite

View full text Add to dashboard Cite

Web Information Extraction (WIE) systems can extract billions of unique facts, but integrating the assertions into a coherent knowledge base and evaluating across different WIE techniques remains a challenge. We propose a framework that utilizes natural language to integrate and evaluate extracted knowledge bases (KBs). In the framework, KBs are integrated by exchanging probability distributions over natural language, and evaluated by how well the output distributions predict held-out text. We describe the advantages of the approach, and detail remaining research challenges.

show abstract

“…This method aims to extract the knowledge of previously trained models from source domains and use it to facilitate the training procedure of the learning tasks in target domains where there may be limited labeled data. Till now, transfer learning has been widely applied in image recognition [14,15,16], natural language process [17,18,19] and robotics [20], and achieved a big success. Yet, the applications in marketing campaign analysis are not many.…”

Section: Introductionmentioning

confidence: 99%

A Multiple Source based Transfer Learning Framework for Marketing Campaigns

Brownlow¹,

Chu²,

et al. 2018

2018 International Joint Conference on Neural Networks (IJCNN)

View full text Add to dashboard Cite

The rapid growing number of marketing campaigns demands an efficient learning model to identify prospective customers to target. Transfer learning is widely considered as a major way to improve the learning performance by using the generated knowledge from previous learning tasks. Most recent studies focused on transferring knowledge from source domains to target domains which may result in knowledge missing. To avoid this, we proposed a multiple source based transfer learning framework to do it reversely. The data in target domains is transferred into source domains by normalizing them into the same distributions and then improving the learning task in target domains by its generated knowledge in source domains. The proposed method is general and can deal with supervised and unsupervised inductive and transductive learning simultaneously with a compatibility to work with different machine learning models. The experiments on real-world campaign data demonstrate that the proposed method outperform state-of-theart methods.

show abstract

Learning Representations for Weakly Supervised Natural Language Processing Tasks

Cited by 49 publications

References 56 publications

Distant Supervision for Relation Extraction via Piecewise Convolutional Neural Networks

Distant Supervision for Relation Extraction via Piecewise Convolutional Neural Networks

Using natural language to integrate, evaluate, and optimize extracted knowledge bases

A Multiple Source based Transfer Learning Framework for Marketing Campaigns

Contact Info

Product

Resources

About