HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Imagine, Reason and Write: Visual Storytelling with Graph Knowledge and Relational Reasoning

{and Ruifeng Xu Xiang Ao Ying Shen Chengming Li Min Yang Chunpu Xu}

Abstract

Visual storytelling is a task of creating a short story based on photo streams. Different from visual captions, stories contain not only factual descriptions, but also imaginary concepts that do not appear in the images. In this paper, we propose a novel imagine-reason-write generation framework (IRW) for visual storytelling, inspired by the logic of humans when they write the story. First, an imagine module is leveraged to learn the imaginative storyline explicitly, improving the coherence and reasonability of the generated story. Second, we employ a reason module to fully exploit the external knowledge (commonsense knowledge base) and task-specific knowledge (scene graph and event graph) with relational reasoning method based on the storyline. In this way, we can effectively capture the most informative commonsense and visual relationships among objects in images, which enhances the diversity and informativeness of the generated story. Finally, we integrate the imaginary concepts and relational knowledge to generate human-like story based on the original semantics of images. Extensive experiments on a benchmark dataset (i.e., VIST) demonstrate that the proposed IRW framework significantly outperforms the state-of-the-art methods across multiple evaluation metrics.

Benchmarks

BenchmarkMethodologyMetrics
visual-storytelling-on-vistIRW
BLEU-1: 66.7
BLEU-2: 41.6
BLEU-3: 25.0
BLEU-4: 15.4
CIDEr: 11.0
METEOR: 35.6
ROUGE-L: 29.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Imagine, Reason and Write: Visual Storytelling with Graph Knowledge and Relational Reasoning | Papers | HyperAI