Towards Example-Based NMT with Multi-Levenshtein Transformers - Machine Learning and Information Access Accéder directement au contenu
Communication Dans Un Congrès Année : 2023

Towards Example-Based NMT with Multi-Levenshtein Transformers

Vers la traduction à base d'exemples avec des transformers de Levenshtein à entrées multiples

Josep Crego
  • Fonction : Auteur
  • PersonId : 1136018
François Yvon

Résumé

Retrieval-Augmented Machine Translation (RAMT) is attracting growing attention. This is because RAMT not only improves translation metrics, but is also assumed to implement some form of domain adaptation. In this contribution, we study another salient trait of RAMT, its ability to make translation decisions more transparent by allowing users to go back to examples that contributed to these decisions. For this, we propose a novel architecture aiming to increase this transparency. This model adapts a retrieval-augmented version of the Levenshtein Transformer and makes it amenable to simultaneously edit multiple fuzzy matches found in memory. We discuss how to perform training and inference in this model, based on multiway alignment algorithms and imitation learning. Our experiments show that editing several examples positively impacts translation scores, notably increasing the number of target spans that are copied from existing instances.
Fichier principal
Vignette du fichier
2023.emnlp-main.113.pdf (803.35 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
licence : CC BY - Paternité

Dates et versions

hal-04332427 , version 1 (08-12-2023)

Licence

Paternité - Partage selon les Conditions Initiales

Identifiants

  • HAL Id : hal-04332427 , version 1

Citer

Maxime Bouthors, Josep Crego, François Yvon. Towards Example-Based NMT with Multi-Levenshtein Transformers. Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Dec 2023, Singapour, Singapore. pp.1830-1846. ⟨hal-04332427⟩
22 Consultations
42 Téléchargements

Partager

Gmail Facebook X LinkedIn More