“…The former is mainly based on word‐based rule matching for a given constructed dictionary, such as positive maximum matching rules, reverse maximum matching rules (Luo et al., 2018) and bidirectional matching rules (Huang et al., 2015; Yunita et al., 2010). The latter is trained on annotated Chinese text to obtain different models: Hidden Markov models (HMMs) and Conditional Random Fields (CRFs), statistical machine learning models (Du et al., 2018; Huang et al., 2017; Liang et al., 2019; Y. Liu et al., 2014; Zhang & Li, 2016), deep learning models (Xu & Sun, 2016; Zhao et al., 2020), etc. Based on the trained model, the text of the unknown label is segmented.…”