HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

RealCQA: Scientific Chart Question Answering as a Test-bed for First-Order Logic

Ahmed Saleem ; Jawade Bhavin ; Pandey Shubham ; Setlur Srirangaraj ; Govindaraju Venu

RealCQA: Scientific Chart Question Answering as a Test-bed for
  First-Order Logic

Abstract

We present a comprehensive study of chart visual question-answering(QA) task,to address the challenges faced in comprehending and extracting data from chartvisualizations within documents. Despite efforts to tackle this problem usingsynthetic charts, solutions are limited by the shortage of annotated real-worlddata. To fill this gap, we introduce a benchmark and dataset for chart visualQA on real-world charts, offering a systematic analysis of the task and a noveltaxonomy for template-based chart question creation. Our contribution includesthe introduction of a new answer type, 'list', with both ranked and unrankedvariations. Our study is conducted on a real-world chart dataset fromscientific literature, showcasing higher visual complexity compared to otherworks. Our focus is on template-based QA and how it can serve as a standard forevaluating the first-order logic capabilities of models. The results of ourexperiments, conducted on a real-world out-of-distribution dataset, provide arobust evaluation of large-scale pre-trained models and advance the field ofchart visual QA and formal logic verification for neural networks in general.

Code Repositories

cse-ai-lab/RealCQA
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
chart-question-answering-on-realcqavlt5 - 11th ep FineTune
1:1 Accuracy: 0.310618012706403
chart-question-answering-on-realcqacrct- 11th ep FineTune
1:1 Accuracy: 0.239897973990427

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp