2019
DOI: 10.48550/arxiv.1901.10125
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Glyce: Glyph-vectors for Chinese Character Representations

Abstract: It is intuitive that NLP tasks for logographic languages like Chinese should benefit from the use of the glyph information in those languages. However, due to the lack of rich pictographic evidence in glyphs and the weak generalization ability of standard computer vision models on character data, an effective way to utilize the glyph information remains to be found. In this paper, we address this gap by presenting Glyce, the glyph-vectors for Chinese character representations. We make three major innovations: … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 12 publications
(10 citation statements)
references
References 53 publications
0
8
0
Order By: Relevance
“…Experiment results show that ERNIE 3.0 also outperforms the current SoTA system by a great margin. , SKEP [80], RoBERTa-wwm-ext-large [81] (marked as RoBERTa*), ALBERT [82], MacBERT [83], Zen 2.0 [84], Glyce [85] and crossed BERT siamese BiGRU [86] (marked as BERT_BiGRU*).…”
Section: Fine-tuning On Natural Language Understanding Tasksmentioning
confidence: 99%
“…Experiment results show that ERNIE 3.0 also outperforms the current SoTA system by a great margin. , SKEP [80], RoBERTa-wwm-ext-large [81] (marked as RoBERTa*), ALBERT [82], MacBERT [83], Zen 2.0 [84], Glyce [85] and crossed BERT siamese BiGRU [86] (marked as BERT_BiGRU*).…”
Section: Fine-tuning On Natural Language Understanding Tasksmentioning
confidence: 99%
“…In addition, some researchers extract the glyph features of Chinese characters from their graphic aspects. Meng Chinese characters as images and used CNNs to obtain their representations [27]. FGN was proposed by Xuan et al On the one hand, a new CNN structure was proposed, called CGS-CNN.…”
Section: Related Workmentioning
confidence: 99%
“…Compared with recent research, we pay more attention to the radical meaning and the frame structure of Chinese characters. The Glyce-Bert encoder [19] which utilized a Tianzige-CNN structure is adopted as the visual information encoder. To some extent, it fits the origin of Chinese characters better than other methods, such as stroke sequence [8,17] or object detection [12].…”
Section: Embedding Layermentioning
confidence: 99%
“…where MLP is a linear layer, ℎ 𝑖 ∈ R 𝑑 𝑡 and 𝑑 𝑡 is the output dimension of the Transformer encoder. Following Meng et al [19], we combine the loss of token classification task and glyph classification task as the final training objective. The training objective L is given as follows:…”
Section: Outputmentioning
confidence: 99%
See 1 more Smart Citation