Françoise Bacquelaine scite author profile

Malgré les progrès de la traduction automatique neuronale, l'intelligence artificielle ne permet toujours pas à la machine de comprendre pour déjouer tous les pièges de la traduction, notamment ceux de l'ambiguïté lexicale, phraséologique, syntaxique et sémantique (Koehn 2020). Deux structures portugaises moyennement figées présentent les caractéristiques des « unités de construction préformées » (UCP) décrites par Schmale (2013). Elles relèvent donc de la phraséologie au sens large et doivent être traduites en bloc. Les principaux défis de traduction en bloc que lancent ces UCP binaires à la machine résultent, d'une part, de variables simples ou complexes, et, d'autre part, des propriétés syntaxiques de scission et d'inversion des éléments sur l'axe syntagmatique. Un échantillon de 168 occurrences de ces UCP en contexte phrastique a été prélevé sur un corpus journalistique portugais. Cet échantillon a été traduit en français par DeepL et Google Translate en 2019 et en 2021. Les traductions automatiques brutes ont été confrontées à un modèle de biotraduction établi à partir de corpus parallèles ou alignés portugais-français et analysées en fonction de deux critères généraux (non-littéralité et acceptabilité) et de quelques défis spécifiques à chaque UCP. Cette analyse permet d'évaluer l'évolution de ces deux systèmes de traduction automatique face à l'ambiguïté phraséologique et d'en tirer des conclusions quant à la possibilité d'extinction de la biotraduction et aux implications de ces outils performants sur la formation des futurs prestataires de services linguistiques. Mots-clefs traduction automatique neuronale ; post-édition ; levée d'ambiguïté ; unité de construction préformée ; portugais ; français

show abstract

O Choro do homem branco (1983) Prefácio (2002)

Bruckner¹,

Bacquelaine²

2022

View full text Add to dashboard Cite

Corpógrafo, Terminologie, Phraséologie

Bacquelaine¹

2015

OSLa

View full text Add to dashboard Cite

The Corpógrafo results from interdisciplinary collaboration between linguists andcomputer engineers under Belinda Maia’s direction. This user-friendly tool for building and using tailor-made corpora allows not only for terminology extractionand management, but also for any research based on monolingual, comparable or parallel corpora. This paper presents the Corpógrafo’s evolution from the first to the fourth version, and two experiences of its use in three languages (English, French and Portuguese). The first experience is in the field of Bluetooth technology terminology extraction and management. The second deals with four Portuguese structures containing the universal quantifier 'cada' and expressing progression, «dropper», proportion between two sets of events or entities and proportion between a set and a subset of events or entities. These experiences show the strengths, weaknesses and limits of the Corpógrafo.

show abstract

Terminologie néologique de la technologie Bluetooth (1999-2007)

Bacquelaine¹

2017

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.