2022
DOI: 10.1371/journal.pone.0270154
|View full text |Cite
|
Sign up to set email alerts
|

A hybrid Chinese word segmentation model for quality management-related texts based on transfer learning

Abstract: Text information mining is a key step to data-driven automatic/semi-automatic quality management (QM). For Chinese texts, a word segmentation algorithm is necessary for pre-processing since there are no explicit marks to define word boundaries. Because of intrinsic characteristics of QM-related texts, word segmentation algorithms for normal Chinese texts cannot be directly applied. Hence, based on the analysis of QM-related texts, we summarized six features, and proposed a hybrid Chinese word segmentation mode… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
0
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 40 publications
(45 reference statements)
0
0
0
Order By: Relevance
“…Hence, this paper adopts a manual approach to screen official websites to collect the After acquiring the original corpus, it is of great importance to firstly conduct data processing, i.e. text pre-processing, which includes word cut, removing stopwords, information cleaning and merging, etc., before carrying out the mining of the intrinsic information of the text [11]. In this paper, we employ word frequency analysis, keyword extraction, and LDA topic modelling to mine the intrinsic information of the text, extract key indicator variables, and refer to relevant references to construct a rather rigorous review system for pharmaceutical policies.…”
Section: Research Frameworkmentioning
confidence: 99%
“…Hence, this paper adopts a manual approach to screen official websites to collect the After acquiring the original corpus, it is of great importance to firstly conduct data processing, i.e. text pre-processing, which includes word cut, removing stopwords, information cleaning and merging, etc., before carrying out the mining of the intrinsic information of the text [11]. In this paper, we employ word frequency analysis, keyword extraction, and LDA topic modelling to mine the intrinsic information of the text, extract key indicator variables, and refer to relevant references to construct a rather rigorous review system for pharmaceutical policies.…”
Section: Research Frameworkmentioning
confidence: 99%