Counterfactual Data Augmentation for Neural Machine Translation

On 2 Jun, 2021 By admin 0 Comments

June, 2021

Abstract

We propose a data augmentation method for neural machine translation. It works by interpreting language models and phrasal alignment causally. Specifically, it creates augmented parallel translation corpora by generating (path-specific) counterfactual aligned phrases. We generate these by sampling new source phrases from a masked language model, then sampling an aligned counterfactual target phrase by noting that a translation language model can be interpreted as a GumbelMax Structural Causal Model (Oberst and Sontag, 2019). Compared to previous work, our method takes both context and alignment into account to maintain the symmetry between source and target sequences. Experiments on IWSLT’15 English → Vietnamese, WMT’17 English → German, WMT’18 English → Turkish, and WMT’19 robust English → French show that the method can improve the performance of translation, backtranslation and translation robustness.

Attachment:

Counterfactual Data Augmentation for Neural Machine Translation.pdf

Resource Type:

Academic Paper

Tags:

Neural Machine Translation

NMT

Counterfactual Data Augmentation

Counterfactual Logic

GumbelMax Structural Causal Model

You are here

Counterfactual Data Augmentation for Neural Machine Translation