HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

FactSpotter: Evaluating the Factual Faithfulness of Graph-to-Text Generation

{Ioana Manolescu Oana Balalau Kun Zhang}

FactSpotter: Evaluating the Factual Faithfulness of Graph-to-Text Generation

Abstract

Graph-to-text (G2T) generation takes a graph as input and aims to generate a fluent and faith- ful textual representation of the information in the graph. The task has many applications, such as dialogue generation and question an- swering. In this work, we investigate to what extent the G2T generation problem is solved for previously studied datasets, and how pro- posed metrics perform when comparing generated texts. To help address their limitations, we propose a new metric that correctly identifies factual faithfulness, i.e., given a triple (subject, predicate, object), it decides if the triple is present in a generated text. We show that our metric FactSpotter achieves the highest correlation with human annotations on data correct- ness, data coverage, and relevance. In addition, FactSpotter can be used as a plug-in feature to improve the factual faithfulness of existing models. Finally, we investigate if existing G2T datasets are still challenging for state-of-the-art models. Our code is available online: https://github.com/guihuzhang/FactSpotter.

Benchmarks

BenchmarkMethodologyMetrics
data-to-text-generation-on-dartT5-B Baseline
BLEU: 48.47
BLEURT: 67.49
METEOR: 40.74
data-to-text-generation-on-webnlgT5-B Baseline
BLEU: 67.04
BLEURT: 73.22
METEOR: 48.35
data-to-text-generation-on-webnlgJointGT Baseline
BLEU: 67.08
BLEURT: 73.44
METEOR: 48.34

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
FactSpotter: Evaluating the Factual Faithfulness of Graph-to-Text Generation | Papers | HyperAI