Building Text and Speech Benchmark Datasets and Models for Low‐Resourced East African Languages: Experiences and Lessons

Nakatumba‐Nabende, Joyce; Babirye, Claire; Nabende, Peter; Tusubira, Jeremy Francis; Mukiibi, Jonathan; Wairagala, Eric Peter; Mutebi, Chodrine; Bateesa, Tobius Saul; Nahabwe, Alvin; Tusiime, Hewitt; Katumba, Andrew

doi:10.1002/ail2.92

Applied AI Letters

2024

DOI: 10.1002/ail2.92

|View full text |Cite

Building Text and Speech Benchmark Datasets and Models for Low‐Resourced East African Languages: Experiences and Lessons

Joyce Nakatumba‐Nabende,

Claire Babirye,

Peter Nabende

et al.

Abstract: Africa has over 2000 languages; however, those languages are not well represented in the existing natural language processing ecosystem. African languages lack essential digital resources to effectively engage in advancing language technologies. There is a need to generate high‐quality natural language processing resources for low‐resourced African languages. Obtaining high‐quality speech and text data is expensive and tedious because it can involve manual sourcing and verification of data sources. This paper … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2024

Publication Types

Select...

Article2

Relationship

Self Cite0

Independent2

Authors

Journals

Cited by 2 publications

References 51 publications

(69 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

Developing and Deploying End‐to‐End Machine Learning Systems for Social Impact: A Rubric and Practical Artificial Intelligence Case Studies From African Contexts

Bainomugisha,

Nakatumba‐Nabende

2024

Applied AI Letters

View full text Add to dashboard Cite

Artificial intelligence (AI) and machine learning have demonstrated the potential to provide solutions to societal challenges, for example, automated crop diagnostics for smallholder farmers, environmental pollution modelling and prediction for cities and machine translation systems for languages that enable information access and communication for segments of the population who are unable to speak or write official languages, among others. Despite the potential of AI, the practical and technical issues related to its development and deployment in the African context are the least documented and understood. The development and deployment of AI for social impact systems in the developing world present new intricacies and requirements emanating from the unique technology and social ecosystems in these settings. This paper provides a rubric for developing and deploying AI systems for social impact with a focus on the African context. The rubric is derived from the analysis of a series of selected real‐world case studies of AI applications in Africa. We assessed the selected AI case studies against the proposed rubric. The rubric and examples of AI applications presented in this paper are expected to contribute to the development and application of AI systems in other African contexts.

show abstract

Developing and Deploying End‐to‐End Machine Learning Systems for Social Impact: A Rubric and Practical Artificial Intelligence Case Studies From African Contexts

Bainomugisha,

Nakatumba‐Nabende

2024

Applied AI Letters

View full text Add to dashboard Cite

show abstract

Machine Learning Analysis of Radio Data to Uncover Community Perceptions on the Ebola Outbreak in Uganda

Nakatumba-Nabende,

Mukiibi,

Bateesa

et al. 2024

ACM J. Comput. Sustain. Soc.

View full text Add to dashboard Cite

Radio is vital for people, especially in rural areas, to share their concerns through interactive talk shows. Understanding public perceptions of pandemics is crucial because they influence people’s attitudes and health-seeking behaviours. This study used machine learning to analyze English and Luganda radio broadcast data to understand public perceptions and perspectives on the Ebola outbreak in Uganda. Our findings revealed three main speaker categories: media personalities, community guests and listeners, and government officials. The government made the most significant effort to educate the public about the Ebola outbreak. The analysis showed that the community was hesitant to use Ebola vaccines, believing that they had not been tested on other populations where the Ebola virus had originated. The community was also concerned about the effects of the lockdown measures imposed during the COVID-19 pandemic. The analysis of the radio broadcast data revealed differences in the timing and content of the conversations between male and female speakers. These experiences can inform population-specific policies for handling ongoing and future pandemics.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Building Text and Speech Benchmark Datasets and Models for Low‐Resourced East African Languages: Experiences and Lessons

Cited by 2 publications

References 51 publications

Developing and Deploying End‐to‐End Machine Learning Systems for Social Impact: A Rubric and Practical Artificial Intelligence Case Studies From African Contexts

Developing and Deploying End‐to‐End Machine Learning Systems for Social Impact: A Rubric and Practical Artificial Intelligence Case Studies From African Contexts

Machine Learning Analysis of Radio Data to Uncover Community Perceptions on the Ebola Outbreak in Uganda

Contact Info

Product

Resources

About