HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

TNT-NLG, System 1: Using a statistical NLG to massively augment crowd-sourced data for neural generation

{Marilyn A. Walker Stephanie Lukin Shubhangi Tandon Shereen Oraby Lena Reed}

Abstract

Ever since the successful application of sequence to sequence learning for neural machine translation systems (Sutskever et al., 2014), interest has surged in its applicability towards language generation in other problem domains. In the area of natural language generation (NLG), there has been a great deal of interest in end-to-end (E2E) neural models that learn and generate natural language sentence realizations in one step. In this paper, we present TNT-NLG System 1, our first system submission to the E2E NLG Challenge, where we generate natural language (NL) realizations from meaning representations (MRs) in the restaurant domain by massively expanding the training dataset. We develop two models for this system, based on Dusek et al.’s (2016a) open source baseline model and context-aware neural language generator. Starting with the MR and NL pairs from the E2E generation challenge dataset, we explode the size of the training set using PERSONAGE (Mairesse and Walker, 2010), a statistical generator able to produce varied realizations from MRs, and use our expanded data as contextual input into our models. We present evaluation results using automated and human evaluation metrics, and describe directions for future work.

Benchmarks

BenchmarkMethodologyMetrics
data-to-text-generation-on-e2e-nlg-challengeSys1-Primary
BLEU: 65.61
CIDEr: 2.2183
METEOR: 45.17
NIST: 8.5105
ROUGE-L: 68.39

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
TNT-NLG, System 1: Using a statistical NLG to massively augment crowd-sourced data for neural generation | Papers | HyperAI