Armand Joulin scite author profile

Continuous word representations, trained on large unlabeled corpora are useful for many natural language processing tasks. Popular models that learn such representations ignore the morphology of words, by assigning a distinct vector to each word. This is a limitation, especially for languages with large vocabularies and many rare words. In this paper, we propose a new approach based on the skipgram model, where each word is represented as a bag of character n-grams. A vector representation is associated to each character n-gram; words being represented as the sum of these representations. Our method is fast, allowing to train models on large corpora quickly and allows us to compute word representations for words that did not appear in the training data. We evaluate our word representations on nine different languages, both on word similarity and analogy tasks. By comparing to recently proposed morphological word representations, we show that our vectors achieve state-of-the-art performance on these tasks.

show abstract

Deep Clustering for Unsupervised Learning of Visual Features

Caron¹,

Bojanowski²,

Joulin³

et al. 2018

1,798

1,779

View full text Add to dashboard Cite

Clustering is a class of unsupervised learning methods that has been extensively applied and studied in computer vision. Little work has been done to adapt it to the end-to-end training of visual features on large scale datasets. In this work, we present DeepCluster, a clustering method that jointly learns the parameters of a neural network and the cluster assignments of the resulting features. DeepCluster iteratively groups the features with a standard clustering algorithm, kmeans, and uses the subsequent assignments as supervision to update the weights of the network. We apply DeepCluster to the unsupervised training of convolutional neural networks on large datasets like ImageNet and YFCC100M. The resulting model outperforms the current state of the art by a significant margin on all the standard benchmarks.

show abstract

Bag of Tricks for Efficient Text Classification

et al. 2017

View full text Add to dashboard Cite

This paper explores a simple and efficient baseline for text classification. Our experiments show that our fast text classifier fastText is often on par with deep learning classifiers in terms of accuracy, and many orders of magnitude faster for training and evaluation. We can train fastText on more than one billion words in less than ten minutes using a standard multicore CPU, and classify half a million sentences among 312K classes in less than a minute.

show abstract

Emerging Properties in Self-Supervised Vision Transformers

et al. 2021

View full text Add to dashboard Cite

Discriminative clustering for image co-segmentation

2010

View full text Add to dashboard Cite

show abstract

Bag of Tricks for Efficient Text Classification

Joulin¹,

Grave²,

Bojanowski³

et al. 2016

Preprint

352

494

View full text Add to dashboard Cite

Unsupervised Joint Object Discovery and Segmentation in Internet Images

Rubinstein¹,

Joulin²,

Kopf

et al. 2013

346

482

View full text Add to dashboard Cite

International audienceWe present a new unsupervised algorithm to discover and segment out common objects from large and diverse image collections. In contrast to previous co-segmentation methods, our algorithm performs well even in the presence of significant amounts of noise images (images not containing a common object), as typical for datasets collected from Internet search. The key insight to our algorithm is that common object patterns should be salient within each image, while being sparse with respect to smooth transformations across images. We propose to use dense correspondences between images to capture the sparsity and visual variability of the common object over the entire database, which enables us to ignore noise objects that may be salient within their own images but do not commonly occur in others. We performed extensive numerical evaluation on established co-segmentation datasets, as well as several new datasets generated using Internet search. Our approach is able to effectively segment out the common object for diverse object categories, while naturally identifying images where the common object is not present

show abstract

Enriching Word Vectors with Subword Information

Bojanowski¹,

Grave²,

Joulin³

et al. 2016

Preprint

333

468

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Armand Joulin

Enriching Word Vectors with Subword Information

Deep Clustering for Unsupervised Learning of Visual Features

Bag of Tricks for Efficient Text Classification

Emerging Properties in Self-Supervised Vision Transformers

Discriminative clustering for image co-segmentation

Bag of Tricks for Efficient Text Classification

Unsupervised Joint Object Discovery and Segmentation in Internet Images

Enriching Word Vectors with Subword Information

Contact Info

Product

Resources

About