2022
DOI: 10.21203/rs.3.rs-2299197/v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A New Evaluation Method: Evaluation Data and Metrics for Chinese Grammatical Error Correction

Abstract: As a fundamental task in natural language processing (NLP), Chinese Grammatical Error Correction (CGEC) [1–3] has gradually received widespread attention and become a research hotspot. However, one obvious deficiency of the existing CGEC evaluation systems is that the evaluation values of the same error correction models are signif- icantly influenced by the Chinese word segmentation (CWS) results or different language models. However, it is expected that these met- rics should be independent of the CWS result… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 8 publications
(11 reference statements)
0
1
0
Order By: Relevance
“…Wickstrøm et al [42] proposed a contrastive learning framework that enabled transfer learning clinical time series by exploiting a data augmentation scheme in which new samples were generated by mixing two data samples with a mixing component. For the task of Chinese spell-checking, Lin et al [43] proposed reverse contrastive learning which explicitly forced the model to minimize the distance in language representation space between similar sample pairs. In the context of MDD, we can anchor the transcription in order to generate the dissimilarity/similarity.…”
Section: Previous Methods For Mddmentioning
confidence: 99%
“…Wickstrøm et al [42] proposed a contrastive learning framework that enabled transfer learning clinical time series by exploiting a data augmentation scheme in which new samples were generated by mixing two data samples with a mixing component. For the task of Chinese spell-checking, Lin et al [43] proposed reverse contrastive learning which explicitly forced the model to minimize the distance in language representation space between similar sample pairs. In the context of MDD, we can anchor the transcription in order to generate the dissimilarity/similarity.…”
Section: Previous Methods For Mddmentioning
confidence: 99%
“…The synthetic data was used to train the seq2seq Transformer model and scored as first on the GEC track and second on the GEC+Fluency track (where, which shared task, the name, and date are required). Lin et al (2023) introduced a synthetic training data based on confusion sets for the Philippines Tagalog GEC system and gained competitive performance on the Tagalog corpus. In Arabic GEC, Solyman et al (2021) introduces an unsupervised method to construct a large confusion sets-based synthetic data, which was used to train the SCUT-AGEC model.…”
Section: Related Workmentioning
confidence: 99%
“…The length, width, surface, amenities, accessibility and surroundings of a greenway can affect the perception of access [55][56][57]. Factors such as sky openness, green visibility, visual complexity, skyline complexity and plant color richness can affect the perception of trail vision [58,59]. Essential green space features also include the presence of amenities [60] and aesthetic qualities [61].…”
Section: Linking Escape Motivation Trail Quality Perception and Resto...mentioning
confidence: 99%