HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Diverse and Relevant Visual Storytelling with Scene Graph Embeddings

{Bernt Schiele Vera Demberg Khushboo Mehra Asad Sayeed Rakshith Shetty Xudong Hong}

Diverse and Relevant Visual Storytelling with Scene Graph Embeddings

Abstract

A problem in automatically generated stories for image sequences is that they use overly generic vocabulary and phrase structure and fail to match the distributional characteristics of human-generated text. We address this problem by introducing explicit representations for objects and their relations by extracting scene graphs from the images. Utilizing an embedding of this scene graph enables our model to more explicitly reason over objects and their relations during story generation, compared to the global features from an object classifier used in previous work. We apply metrics that account for the diversity of words and phrases of generated stories as well as for reference to narratively-salient image features and show that our approach outperforms previous systems. Our experiments also indicate that our models obtain competitive results on reference-based metrics.

Benchmarks

BenchmarkMethodologyMetrics
visual-storytelling-on-vistSGEmb
BLEU-1: 62.2
BLEU-2: 38.7
BLEU-3: 23.5
BLEU-4: 14.8
CIDEr: 8.6
METEOR: 35.6
ROUGE-L: 30.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Diverse and Relevant Visual Storytelling with Scene Graph Embeddings | Papers | HyperAI