Salomon Kabongo scite author profile

Salomon Kabongo

5Publications

95Citation Statements Received

115Citation Statements Given

How they've been cited

How they cite others

119

115

Affiliations

L3S Research Center, African Institute for Mathematical Sciences Ghana

Publications

Order By: Most citations

Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages

Nekoto¹,

Marivate²,

Matsila³

et al. 2020

View full text Add to dashboard Cite

Research in NLP lacks geographic diversity, and the question of how NLP can be scaled to low-resourced languages has not yet been adequately solved. "Lowresourced"-ness is a complex problem going beyond data availability and reflects systemic problems in society. * ∀ to represent the whole Masakhane community.As MT researchers cannot solve the problem of low-resourcedness alone, we propose participatory research as a means to involve all necessary agents required in the MT development process. We demonstrate the feasibility and scalability of participatory research with a case study on MT for African languages. Its implementation leads to a collection of novel translation datasets, MT benchmarks for over 30 languages, with human evaluations for a third of them, and enables participants without formal training to make a unique scientific contribution. Benchmarks, models, data, code, and evaluation results are released at https://github. com/masakhane-io/masakhane-mt.

show abstract

Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages

Nekoto¹,

Marivate²,

Matsila³

et al. 2020

Preprint

View full text Add to dashboard Cite

show abstract

Automated Mining of Leaderboards for Empirical AI Research

Kabongo

D’Souza

Auer

2021

View full text Add to dashboard Cite

We present a large-scale empirical investigation of the zero-shot learning phenomena in a specific recognizing textual entailment (RTE) task category, i.e. the automated mining of leaderboards for Empirical AI Research. The prior reported state-of-the-art models for leaderboards extraction formulated as an RTE task, in a non-zero-shot setting, are promising with above 90% reported performances. However, a central research question remains unexamined: did the models actually learn entailment? Thus, for the experiments in this paper, two prior reported state-of-the-art models are tested out-of-the-box for their ability to generalize or their capacity for entailment, given leaderboard labels that were unseen during training. We hypothesize that if the models learned entailment, their zero-shot performances can be expected to be moderately high as well-perhaps, concretely, better than chance. As a result of this work, a zero-shot labeled dataset is created via distant labeling formulating the leaderboard extraction RTE task.

show abstract

Masakhane -- Machine Translation For Africa

Orife¹,

Kreutzer²,

Whitenack³

et al. 2020

Preprint

View full text Add to dashboard Cite

Automated Mining of Leaderboards for Empirical AI Research

Kabongo¹,

D’Souza²,

Auer³

2021

Preprint

View full text Add to dashboard Cite

With the rapid growth of research publications, empowering scientists to keep oversight over the scientific progress is of paramount importance. In this regard, the Leaderboards facet of information organization provides an overview on the state-of-the-art by aggregating empirical results from various studies addressing the same research challenge. Crowdsourcing efforts like PapersWithCode among others are devoted to the construction of Leaderboards predominantly for various subdomains in Artificial Intelligence. Leaderboards provide machine-readable scholarly knowledge that has proven to be directly useful for scientists to keep track of research progress. The construction of Leaderboards could be greatly expedited with automated text mining. This study presents a comprehensive approach for generating Leaderboards for knowledge-graph-based scholarly information organization. Specifically, we investigate the problem of automated Leaderboard construction using state-of-the-art transformer models, viz. Bert, SciBert, and XLNet. Our analysis reveals an optimal approach that significantly outperforms existing baselines for the task with evaluation scores above 90% in F1. This, in turn, offers new state-of-the-art results for Leaderboard extraction. As a result, a vast share of empirical AI research can be organized in the next-generation digital libraries as knowledge graphs.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Salomon Kabongo

Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages

Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages

Automated Mining of Leaderboards for Empirical AI Research

Masakhane -- Machine Translation For Africa

Automated Mining of Leaderboards for Empirical AI Research

Contact Info

Product

Resources

About