Henry Lucky scite author profile

The Sundanese language has over 32 million speakers worldwide, but the language has reaped little to no benefits from the recent advances in natural language understanding. Like other low-resource languages, the only alternative is to fine-tune existing multilingual models. In this paper, we pre-trained three monolingual Transformer-based language models on Sundanese data. When evaluated on a downstream text classification task, we found that most of our monolingual models outperformed larger multilingual models despite the smaller overall pre-training data. In the subsequent analyses, our models benefited strongly from the Sundanese pre-training corpus size and do not exhibit socially biased behavior. We released our models for other researchers and practitioners to use.

show abstract

Using Regression to Predict Number of Tourism in Indonesia based of Global COVID-19 Cases

Brilliandy

Lucky

Hartanto

et al. 2022

View full text Add to dashboard Cite

Towards a more general drug target interaction prediction model using transfer learning

Suhartono

Majiid

Handoyo

et al. 2023

Procedia Computer Science

View full text Add to dashboard Cite

Chatbot Application to Automate Services in FnB Business Using Seq2Seq LSTM

Wang

Putera

Lucky

et al. 2022

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Henry Lucky

Towards Classification of Personality Prediction Model: A Combination of BERT Word Embedding and MLSMOTE

Pre-trained transformer-based language models for Sundanese

Using Regression to Predict Number of Tourism in Indonesia based of Global COVID-19 Cases

Towards a more general drug target interaction prediction model using transfer learning

Chatbot Application to Automate Services in FnB Business Using Seq2Seq LSTM

Contact Info

Product

Resources

About