HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations

Leonard Salewski A. Sophia Koepke Hendrik P. A. Lensch Zeynep Akata

CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations

Abstract

Providing explanations in the context of Visual Question Answering (VQA) presents a fundamental problem in machine learning. To obtain detailed insights into the process of generating natural language explanations for VQA, we introduce the large-scale CLEVR-X dataset that extends the CLEVR dataset with natural language explanations. For each image-question pair in the CLEVR dataset, CLEVR-X contains multiple structured textual explanations which are derived from the original scene graphs. By construction, the CLEVR-X explanations are correct and describe the reasoning and visual information that is necessary to answer a given question. We conducted a user study to confirm that the ground-truth explanations in our proposed dataset are indeed complete and relevant. We present baseline results for generating natural language explanations in the context of VQA using two state-of-the-art frameworks on the CLEVR-X dataset. Furthermore, we provide a detailed analysis of the explanation generation quality for different question and answer types. Additionally, we study the influence of using different numbers of ground-truth explanations on the convergence of natural language generation (NLG) metrics. The CLEVR-X dataset is publicly available at \url{https://explainableml.github.io/CLEVR-X/}.

Code Repositories

explainableml/clevr-x
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
explanation-generation-on-clevr-xPJ-X
Acc: 63.0
B4: 87.4
C: 639.8
M: 58.9
RL: 93.4
explanation-generation-on-clevr-xFM
Acc: 80.3
B4: 78.8
C: 566.8
M: 52.5
RL: 85.8

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp