Instruction Tuning on Large Language Models is an essential process for model to function well and achieve high performance in specific tasks. Accordingly, in mainstream languages such as English, instruction-based datasets are being constructed and made publicly available. In the case of Korean, publicly available models and datasets all rely on using the output of ChatGPT or translating datasets built in English. In this paper, We introduce KIT-19 as an instruction dataset for the development of LLM in Korean. KIT-19 is a dataset created in an instruction format, comprising 19 existing open-source datasets for Korean NLP tasks. In this paper, we train a Korean Pretrained LLM using KIT-19 to demonstrate its effectiveness. The experimental results show that the model trained on KIT-19 significantly outperforms existing Korean LLMs. Based on the its quality and empirical results, this paper proposes that KIT-19 has the potential to make a substantial contribution to the future improvement of Korean LLMs' performance.
Four species of Laboulbeniales collected from Java Island, Indonesia between August and September in 2006 are described. These species, which belong to the genus Coreomyces and were found on the family Corixidae of the order Hemiptera, were as follows; Coreomycus corixae Thaxter, Coreomyces micronectae Thaxter and Coreomyces orientalis Thaxter, which were found on Micronecta sedula Horvath, and Coreomyces recurvatus Thaxter, which was found on Micronecta sedula Horvath and Xenocorixa sp. C. corixae Thaxter and C. orientalis Thaxter were originally found on Micronecta, whereas C. recurvatus Thaxter was originally found on Xenocorixa. All species described herein are new to Java Island. The specimens were deposited in the Biological Herbarium, Division of Science Education, College of Education, Chosun University.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.