Traitement de l'ambiguïté syntaxique et sémantique en ta neuronale: analyse de la traduction de l'anglais vers le français, l'espagnol et l'italien

Other Title(s)

Syntactic and semantic ambiguity processing in neural MT from English to French, Spanish and Italian

Author

Maniez, Francois

Source

Revue traduction et langues

Issue

Vol. 21, Issue 1 (31 Aug. 2022), pp.10-27, 18 p.

Publisher

University of Oran 2 Mohamed Ben Ahmad

Publication Date

2022-06-30

Country of Publication

Algeria

No. of Pages

18

Main Subjects

Languages

Topics

Abstract EN

Despite recent advances in artificial intelligence, human translators outperform MT for at least three types of tasks: identifying referents in anaphora (especially of the interphrastic kind), resolving semantic ambiguity (which is mainly due to polysemy or homonymy), and resolving syntactic ambiguity (especially with poorly inflected source languages such as English).

using the results obtained by two freely available online machine translation programs, Google translate and Deepl, we examine how these two types of ambiguity are processed in translation from English into French, Spanish and Italian.

our results show that the two programs perform well overall in resolving the simplest cases of syntactic ambiguity, with difficulties arising more frequently for noun phrases featuring atypical syntactic divisions and rarely used collocations.

MT output for ambiguous structures involving verb roots followed by the-ing morpheme (flying planes, growing pains) is studied, as well as syntactic structures in which two or more nouns are preceded by one or more adjectives.

MT handles relatively well the longest of those structures (ADJ ADJ N N N N), probably because their subsets are part of the bilingual or target language monolingual corpora that underlie MT systems.

structures involving head modification and coordination (ADJ N and N) are also known to pose problems for MT and human translators alike.

but since many of the most frequent n and n structures involve cohyponyms (men and women, brothers and sisters), antonyms (rights and duties, costs and benefits) or near-synonyms (aid and advice), their translation as a whole unit generally triggers the choice of correct syntact dependencies in translation.

structures in which the adjective only modifies the first noun (fresh air and exercise, social sciences and humanities) are much less frequent and are also probably translated as a whole unit.

structures involving premodification, coordination and post-modification may give rise to four distinct types of structures depending on whether long-range dependencies apply (detailed [knowledge and understanding] of the it industry, [ethnic group] and [place of birth], invaluable [context and [source of information], [close friend] and confidant] of Mr Jones.

structures in which both long-range dependencies apply (integrated prevention and control of pollution) are the ones which most frequently cause errors for MT.

semantic ambiguity cases have been processed with increasing success by MT, especially when collocates vary widely for the main two meanings of homonyms (a well-known example is the word pen).

processing polysemy (for instance the medical use of conditions in pre-existing conditions) is a bit more of a challenge for MT.

other cases involving concentration of several polysemic terms in the same sentence (changing the placement of beams relative to the staff involves changing the direction of the stems in the beam) also create difficulties for MT when the polysemic terms are used without any of their usual collocates (here in the specialised field of musical edition).

homonymy cases involving grammatical category changes (N-to-V or V-to-N conversion) seem to continue to pose the most difficulties to neural MT, despite increasing consideration of intra-and extraphrastic context.

potentially ambiguous word sequences (treatment increase in as the daily dose and duration of treatment increase), which were processed incorrectly before neuronal MT, are now correctly translated.

but word sequences in which one word belongs to a part of speech which is not the most commonly used one (e.g.

the noun remains in what remains can be considered) may cause occasional errors.

several examples that involve the verb founder are studied, and they frequently trigger translation of the noun in all three romance languages (or translations of the verbs find or found due to incorrect segmentation).

Abstract FRE

Malgré les progrès récents de l’intelligence artificielle, les traducteurs humains obtiennent de meilleurs résultats que la TA pour au moins trois types de tâches: l’identification des référents dans les cas d’anaphore (notamment interphrastique), la résolution de l’ambiguïté sémantique (principalement due à la polysémie ou à l’homonymie) et celle de l’ambiguïté syntaxique (notamment dans le cas de langues sources à flexion limitée comme l’Anglais).

en nous appuyant sur les résultats obtenus par deux programmes de traduction automatique gratuitement accessibles en ligne, google translate et deepl, nous examinons la façon dont sont traités ces deux types d’ambiguïté dans la traduction de l’Anglais vers le Français, l’espagnol et l’italien.

nos résultats font apparaître une bonne performance globale des deux logiciels utilisés pour la résolution des cas d’ambiguïté syntaxique les plus simples, les difficultés se présentant plus fréquemment dans le cas de groupes nominaux aux découpages syntaxiques atypiques et contenant des collocations peu usitées.

les cas d’homonymie impliquant un changement de catégorie grammaticale (conversion) semblent être ceux qui continuent de poser le plus de difficultés à la TA neuronale, en dépit d’une prise en compte croissante du contexte intra- et extraphrastique.

American Psychological Association (APA)

Maniez, Francois. 2022. Traitement de l'ambiguïté syntaxique et sémantique en ta neuronale: analyse de la traduction de l'anglais vers le français, l'espagnol et l'italien. Revue traduction et langues،Vol. 21, no. 1, pp.10-27.
https://search.emarefa.net/detail/BIM-1442016

Modern Language Association (MLA)

Maniez, Francois. Traitement de l'ambiguïté syntaxique et sémantique en ta neuronale: analyse de la traduction de l'anglais vers le français, l'espagnol et l'italien. Revue traduction et langues Vol. 21, no. 1 (Aug. 2022), pp.10-27.
https://search.emarefa.net/detail/BIM-1442016

American Medical Association (AMA)

Maniez, Francois. Traitement de l'ambiguïté syntaxique et sémantique en ta neuronale: analyse de la traduction de l'anglais vers le français, l'espagnol et l'italien. Revue traduction et langues. 2022. Vol. 21, no. 1, pp.10-27.
https://search.emarefa.net/detail/BIM-1442016

Data Type

Journal Articles

Language

French

Notes

Includes appendix: p. 27

Record ID

BIM-1442016