Haryo Akbarianto Wibowo scite author profile

This paper describes Kata.ai's submission for the Social Media Mining for Health (SMM4H) 2021 shared task. We participated in three tasks: classifying adverse drug effect, COVID-19 self-report, and COVID-19 symptoms. Our system is based on BERT model pre-trained on the domain-specific text. In addition, we perform data cleaning and augmentation, as well as hyperparameter optimization and model ensemble to further boost the BERT performance. We achieved the first rank in both classifying adverse drug effects and COVID-19 selfreport tasks.

show abstract

IndoCollex: A Testbed for Morphological Transformation of Indonesian Word Colloquialism

Wibowo¹,

Nityasya²,

Akyürek³

et al. 2021

View full text Add to dashboard Cite

Indonesian language is heavily riddled with colloquialism whether in written or spoken forms. In this paper, we identify a class of Indonesian colloquial words that have undergone morphological transformations from their standard forms, categorize their word formations, and propose a benchmark dataset of Indonesian Colloquial Lexicons (IndoCollex) consisting of informal words on Twitter expertly annotated with their standard forms and their word formation types/tags. We evaluate several models for character-level transduction to perform morphological word normalization on this testbed to understand their failure cases and provide baselines for future work. As IndoCollex catalogues word formation phenomena that are also present in the non-standard text of other languages, it can also provide an attractive testbed for methods tailored for cross-lingual word normalization and non-standard word formation.

show abstract

Costs to Consider in Adopting NLP for Your Business

Nityasya¹,

Wibowo²,

Prasojo³

et al. 2020

Preprint

View full text Add to dashboard Cite

Semi-Supervised Low-Resource Style Transfer of Indonesian Informal to Formal Language with Iterative Forward-Translation

Wibowo¹,

Prawiro²,

Ihsan³

et al. 2020

Preprint

View full text Add to dashboard Cite

Which Student is Best? A Comprehensive Knowledge Distillation Exam for Task-Specific BERT Models

Nityasya¹,

Wibowo²,

Chevi³

et al. 2022

Preprint

View full text Add to dashboard Cite

On “Scientific Debt” in NLP: A Case for More Rigour in Language Model Pre-Training Research

Nityasya¹,

Wibowo²,

Aji³

et al. 2023

View full text Add to dashboard Cite

Towards Product Attributes Extraction in Indonesian e-Commerce Platform

Rif’at¹,

Mahendra²,

Budi³

et al. 2018

CyS

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.