2021
DOI: 10.1145/3434237
|View full text |Cite
|
Sign up to set email alerts
|

A Comprehensive Survey on Word Representation Models: From Classical to State-of-the-Art Word Representation Language Models

Abstract: Word representation has always been an important research area in the history of natural language processing (NLP). Understanding such complex text data is imperative, given that it is rich in information and can be used widely across various applications. In this survey, we explore different word representation models and its power of expression, from the classical to modern-day state-of-the-art word representation language models (LMS). We describe a variety of text representation methods, and model designs … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
39
0
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 83 publications
(40 citation statements)
references
References 95 publications
0
39
0
1
Order By: Relevance
“…Natural language processing (NLP) has flourished over the past couple of years as a result of advancements in deep learning [263]. One important component of NLP is building language models [252]. Today, deep neural networks using transformers [325] are utilized to construct language models [111,131] for a variety of tasks.…”
Section: Natural Language Modelingmentioning
confidence: 99%
“…Natural language processing (NLP) has flourished over the past couple of years as a result of advancements in deep learning [263]. One important component of NLP is building language models [252]. Today, deep neural networks using transformers [325] are utilized to construct language models [111,131] for a variety of tasks.…”
Section: Natural Language Modelingmentioning
confidence: 99%
“…For addressing this problem, many works have used feature selection techniques [25,26] applying various machine learning approaches. Of the various machine learning classification methods used to classify users' sentiments from a text, decision tree, LDA, Naive Bayes, Support Vector Machine (SVM), and artificial neural networks are the most common and have achieved a higher performance [9,22,27,28]. However, these methods need massive training data and are often slow.…”
Section: Sentiment Analysismentioning
confidence: 99%
“…In most natural language processing applications, words are used as features. The most popular word vector representations are distributed representation and one-hot representation [27,47]. However, the one-hot representation has various problems, such as the too-large vector dimension, the sparsity of the word vector, and ignoring the word semantic association.…”
Section: Embedding (Word Representation)mentioning
confidence: 99%
“…The authors of [5] conducted a rich survey of language representation models, including many models from 2020 and 2021. They additionally provided useful insights about pre-processing, classification techniques, evaluation metrics, and applications.…”
Section: Related Workmentioning
confidence: 99%