Uyghur is one of the most populous and civilized groups with Turkic ethnicity and mainly located Xinjiang Uyghur Autonomous Region of China. Uyghur language belongs to the Karluk branch of the Turkic language family in Altaic language system, and holds agglutinative characteristics in morphological structure. Named Entity Recognition (NER) is an Information Extraction task that has become an essential part of Natural Language Processing (NLP) tasks, such as Machine Translation and Information Retrieval. In this paper, as a subtask of NER, the importance of Uyghur Named Entity Recognition (UPNR) task is demonstrated, the main characteristics of the Uyghur language are highlighted, and the aspects of standardization in annotating named entities are illustrated. Moreover, the approaches used in Uyghur NPNR field are explained and the features of common tools used in Uyghur NPNR are described. A brief review of the state of the art of Uyghur NPNR research is discussed, too. Finally, we present our conclusions. Throughout the presentation, illustrative examples are used for clarification.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.