Alp Öktem scite author profile

et al. 2020

Research in NLP lacks geographic diversity, and the question of how NLP can be scaled to low-resourced languages has not yet been adequately solved. "Lowresourced"-ness is a complex problem going beyond data availability and reflects systemic problems in society. * ∀ to represent the whole Masakhane community.As MT researchers cannot solve the problem of low-resourcedness alone, we propose participatory research as a means to involve all necessary agents required in the MT development process. We demonstrate the feasibility and scalability of participatory research with a case study on MT for African languages. Its implementation leads to a collection of novel translation datasets, MT benchmarks for over 30 languages, with human evaluations for a third of them, and enables participants without formal training to make a unique scientific contribution. Benchmarks, models, data, code, and evaluation results are released at https://github. com/masakhane-io/masakhane-mt.

show abstract

Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages

Nekoto¹,

Marivate²,

Matsila³

et al. 2020

Preprint

View full text Add to dashboard Cite

Research in NLP lacks geographic diversity, and the question of how NLP can be scaled to low-resourced languages has not yet been adequately solved. "Lowresourced"-ness is a complex problem going beyond data availability and reflects systemic problems in society. * ∀ to represent the whole Masakhane community.As MT researchers cannot solve the problem of low-resourcedness alone, we propose participatory research as a means to involve all necessary agents required in the MT development process. We demonstrate the feasibility and scalability of participatory research with a case study on MT for African languages. Its implementation leads to a collection of novel translation datasets, MT benchmarks for over 30 languages, with human evaluations for a third of them, and enables participants without formal training to make a unique scientific contribution. Benchmarks, models, data, code, and evaluation results are released at https://github. com/masakhane-io/masakhane-mt.

show abstract

Prosodic Phrase Alignment for Machine Dubbing

Öktem¹,

Farrús²,

Bonafonte³

2019

View full text Add to dashboard Cite

Dubbing is a type of audiovisual translation where dialogues are translated and enacted so that they give the impression that the media is in the target language. It requires a careful alignment of dubbed recordings with the lip movements of performers in order to achieve visual coherence. In this paper, we deal with the specific problem of prosodic phrase synchronization within the framework of machine dubbing. Our methodology exploits the attention mechanism output in neural machine translation to find plausible phrasing for the translated dialogue lines and then uses them to condition their synthesis. Our initial work in this field records comparable speech rate ratio to professional dubbing translation, and improvement in terms of lip-syncing of long dialogue lines.

show abstract

Masakhane -- Machine Translation For Africa

Orife¹,

Kreutzer²,

Whitenack³

et al. 2020

Preprint

View full text Add to dashboard Cite

Prosodic Phrase Alignment for Machine Dubbing

Öktem¹,

Farrús²,

Bonafonte³

2019

Preprint

View full text Add to dashboard Cite

TICO-19: the Translation Initiative for COvid-19

Anastasopoulos¹,

Cattelan²,

Dou³

et al. 2020

View full text Add to dashboard Cite

The COVID-19 pandemic is the worst pandemic to strike the world in over a century. Crucial to stemming the tide of the SARS-CoV-2 virus is communicating to vulnerable populations the means by which they can protect themselves. To this end, the collaborators forming the Translation Initiative for COvid-19 (TICO-19) 1 have made test and development data available to AI and MT researchers in 35 different languages in order to foster the development of tools and resources for improving access to information about COVID-19 in these languages. In addition to 9 highresourced, "pivot" languages, the team is targeting 26 lesser resourced languages, in particular languages of Africa, South Asia and South-East Asia, whose populations may be the most vulnerable to the spread of the virus. The same data is translated into all of the languages represented, meaning that testing or development can be done for any pairing of languages in the set. Further, the team is converting the test and development data into translation memories (TMXs) that can be used by localizers from and to any of the languages. 2

show abstract

Attentional Parallel RNNs for Generating Punctuation in Transcribed Speech

Öktem

¹

,

Farrús

²

,

Wanner

³

2017

View full text Add to dashboard Cite

Abstract. Until very recently, the generation of punctuation marks for automatic speech recognition (ASR) output has been mostly done by looking at the syntactic structure of the recognized utterances. Prosodic cues such as breaks, speech rate, pitch intonation that influence placing of punctuation marks on speech transcripts have been seldom used. We propose a method that uses recurrent neural networks, taking prosodic and lexical information into account in order to predict punctuation marks for raw ASR output. Our experiments show that an attention mechanism over parallel sequences of prosodic cues aligned with transcribed speech improves accuracy of punctuation generation.

show abstract

TICO-19: the Translation Initiative for Covid-19

Anastasopoulos¹,

Cignetti²,

Dou³

et al. 2020

Preprint

View full text Add to dashboard Cite

The COVID-19 pandemic is the worst pandemic to strike the world in over a century. Crucial to stemming the tide of the SARS-CoV-2 virus is communicating to vulnerable populations the means by which they can protect themselves. To this end, the collaborators forming the Translation Initiative for COvid-19 (TICO-19) 1 have made test and development data available to AI and MT researchers in 35 different languages in order to foster the development of tools and resources for improving access to information about COVID-19 in these languages. In addition to 9 highresourced, "pivot" languages, the team is targeting 26 lesser resourced languages, in particular languages of Africa, South Asia and South-East Asia, whose populations may be the most vulnerable to the spread of the virus. The same data is translated into all of the languages represented, meaning that testing or development can be done for any pairing of languages in the set. Further, the team is converting the test and development data into translation memories (TMXs) that can be used by localizers from and to any of the languages. 223

show abstract

Alp Öktem

Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages

Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages

Prosodic Phrase Alignment for Machine Dubbing

Masakhane -- Machine Translation For Africa

Prosodic Phrase Alignment for Machine Dubbing

TICO-19: the Translation Initiative for COvid-19

Attentional Parallel RNNs for Generating Punctuation in Transcribed Speech

TICO-19: the Translation Initiative for Covid-19

Contact Info

Product

Resources

About