HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

The University of Sydney's Machine Translation System for WMT19

Liang Ding; Dacheng Tao

The University of Sydney's Machine Translation System for WMT19

Abstract

This paper describes the University of Sydney's submission of the WMT 2019 shared news translation task. We participated in the Finnish$\rightarrow$English direction and got the best BLEU(33.0) score among all the participants. Our system is based on the self-attentional Transformer networks, into which we integrated the most recent effective strategies from academic research (e.g., BPE, back translation, multi-features data selection, data augmentation, greedy model ensemble, reranking, ConMBR system combination, and post-processing). Furthermore, we propose a novel augmentation method $Cycle Translation$ and a data mixture strategy $Big$/$Small$ parallel construction to entirely exploit the synthetic corpus. Extensive experiments show that adding the above techniques can make continuous improvements of the BLEU scores, and the best result outperforms the baseline (Transformer ensemble model trained with the original parallel corpus) by approximately 5.3 BLEU score, achieving the state-of-the-art performance.

Benchmarks

BenchmarkMethodologyMetrics
machine-translation-on-wmt-2018-finnishCT+B/S construction
BLEU: 26.5
machine-translation-on-wmt2016-finnishCT+B/S construction
BLEU: 32.4
machine-translation-on-wmt2017-finnishCT+B/S construction
BLEU: 35.5
machine-translation-on-wmt2019-finnishCT+B/S construction
BLEU: 34.1

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
The University of Sydney's Machine Translation System for WMT19 | Papers | HyperAI