Tim Van de Cruys scite author profile

Tim Van de Cruys

5Publications

159Citation Statements Received

170Citation Statements Given

How they've been cited

How they cite others

Affiliations

KU Leuven, Université Toulouse III - Paul Sabatier, University of Groningen

Publications

Order By: Most citations

A Neural Network Approach to Selectional Preference Acquisition

Cruys¹

2014

View full text Add to dashboard Cite

This paper investigates the use of neural networks for the acquisition of selectional preferences. Inspired by recent advances of neural network models for NLP applications, we propose a neural network model that learns to discriminate between felicitous and infelicitous arguments for a particular predicate. The model is entirely unsupervised -preferences are learned from unannotated corpus data. We propose two neural network architectures: one that handles standard two-way selectional preferences and one that is able to deal with multi-way selectional preferences. The model's performance is evaluated on a pseudo-disambiguation task, on which it is shown to achieve state of the art performance.

show abstract

Semantics-based multiword expression extraction

Cruys

Moirón

2007

View full text Add to dashboard Cite

This paper describes a fully unsupervised and automated method for large-scale extraction of multiword expressions (MWEs) from large corpora. The method aims at capturing the non-compositionality of MWEs; the intuition is that a noun within a MWE cannot easily be replaced by a semantically similar noun. To implement this intuition, a noun clustering is automatically extracted (using distributional similarity measures), which gives us clusters of semantically related nouns. Next, a number of statistical measures -based on selectional preferences -is developed that formalize the intuition of non-compositionality. Our approach has been tested on Dutch, and automatically evaluated using Dutch lexical resources.

show abstract

Mining Discourse Markers for Unsupervised Sentence Representation Learning

Sileo¹,

Cruys²,

Pradel³

et al. 2019

View full text Add to dashboard Cite

Current state of the art systems in NLP heavily rely on manually annotated datasets, which are expensive to construct. Very little work adequately exploits unannotated data -such as discourse markers between sentences -mainly because of data sparseness and ineffective extraction methods. In the present work, we propose a method to automatically discover sentence pairs with relevant discourse markers, and apply it to massive amounts of data. Our resulting dataset contains 174 discourse markers with at least 10K examples each, even for rare markers such as coincidentally or amazingly. We use the resulting data as supervision for learning transferable sentence embeddings. In addition, we show that even though sentence representation learning through prediction of discourse markers yields state of the art results across different transfer tasks, it is not clear that our models made use of the semantic relation between sentences, thus leaving room for further improvements. Our datasets are publicly available 1

show abstract

Integrating Type Theory and Distributional Semantics: A Case Study on Adjective–Noun Compositions

Asher

Cruys

Bride

et al. 2016

Computational Linguistics

View full text Add to dashboard Cite

show abstract

A non-negative tensor factorization model for selectional preference induction

Cruys¹

2010

Nat. Lang. Eng.

View full text Add to dashboard Cite

Link to this article: http://journals.cambridge.org/abstract_S1351324910000148How to cite this article: TIM VAN DE CRUYS (2010). A non-negative tensor factorization model for selectional preference induction. Natural Language Engineering, 16, pp 417-437 AbstractThe distributional similarity methods have proven to be a valuable tool for the induction of semantic similarity. Until now, most algorithms use two-way co-occurrence data to compute the meaning of words. Co-occurrence frequencies, however, need not be pairwise. One can easily imagine situations where it is desirable to investigate co-occurrence frequencies of three modes and beyond. This paper will investigate tensor factorization methods to build a model of three-way co-occurrences. The approach is applied to the problem of selectional preference induction, and automatically evaluated in a pseudo-disambiguation task. The results show that tensor factorization, and non-negative tensor factorization in particular, is a promising tool for Natural Language Processing (nlp).

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Tim Van de Cruys

A Neural Network Approach to Selectional Preference Acquisition

Semantics-based multiword expression extraction

Mining Discourse Markers for Unsupervised Sentence Representation Learning

Integrating Type Theory and Distributional Semantics: A Case Study on Adjective–Noun Compositions

A non-negative tensor factorization model for selectional preference induction

Contact Info

Product

Resources

About