HyperAIHyperAI

VisualOverload Scene Image Understanding Dataset

Date

15 days ago

Size

601.3 MB

Publish URL

huggingface.co

License

CC BY-SA 4.0

VisualOverload is a scene image understanding evaluation dataset that aims to examine the model's visual understanding and reasoning ability of details in complex scenes without relying on external knowledge.

This dataset contains 2,720 question-answer pairs, consisting of public-domain, high-resolution paintings that often feature multiple characters, actions, subplots, and complex backgrounds. The questions are manually designed to comprehensively test the model's scene understanding. This dataset is suitable for visual question answering research, detailed image understanding and reasoning, and evaluation of complex scenes with multiple characters and elements.

Dataset Example
VisualOverload.torrent
Seeding 1Downloading 0Completed 1Total Downloads 10
  • VisualOverload/
    • README.md
      1.31 KB
    • README.txt
      2.62 KB
      • data/
        • VisualOverload.zip
          601.3 MB