Date

6 months ago

Size

369.86 MB

Organization

Paper URL

2508.10433

License

Non-Commercial

*This dataset supports online use.Click here to jump.

We-Math2.0-Standard is a standard dataset for visual mathematical reasoning released by Beijing University of Posts and Telecommunications, Tencent and Tsinghua University in 2025. The related paper results are "WE-MATH 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning", aims to provide a diagnosable, explainable and comparable evaluation basis.

This dataset builds a unified label space around 1,819 precisely defined knowledge principles, explicitly annotating each question with the principle and rigorously curating it, thereby achieving broad and balanced coverage overall, particularly strengthening mathematical subfields and question types that were previously underrepresented. The dataset adopts a dual expansion design:

First, multiple images per question are used to test the integration and alignment of multi-source visual evidence;
Second, multi-questions per image are used to test multi-principle transfer and conceptual flexibility in the same visual context.

Each example consists of an image and a text stem, and is accompanied by annotations of the knowledge principles and standard answers that the question relies on.

We-Mathv2-Standard.torrent

Seeding 1Downloading 0Completed 55Total Downloads 153

We-Mathv2-Standard/
- README.md
  1.82 KB
- README.txt
  3.65 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at support@hyper.ai for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Use this Dataset

Discuss on Discord

Date

6 months ago

Size

369.86 MB

Organization

Paper URL

2508.10433

License

Non-Commercial

*This dataset supports online use.Click here to jump.

First, multiple images per question are used to test the integration and alignment of multi-source visual evidence;
Second, multi-questions per image are used to test multi-principle transfer and conceptual flexibility in the same visual context.

Each example consists of an image and a text stem, and is accompanied by annotations of the knowledge principles and standard answers that the question relies on.

We-Mathv2-Standard.torrent

Seeding 1Downloading 0Completed 55Total Downloads 153

We-Mathv2-Standard/
- README.md
  1.82 KB
- README.txt
  3.65 KB

Related Datasets

Nemotron-Math-v2 Mathematical Inference Dataset

a month ago

MUVR Multimodal Uncropped Video Retrieval Benchmark

2 months ago

IF-Bench Infrared Image Understanding Benchmark Dataset

2 months ago

GroundingME Complex Scene Understanding Evaluation Dataset

a month ago

VERA Voice Reasoning Evaluation Dataset

3 months ago

2.37 GB59

Nemotron-Math-Proofs-v1 Mathematical Formal Proofs Dataset

a month ago

VenusBench-GD Cross-Platform Interface Understanding Dataset

a month ago

UNO-Bench full-modal Evaluation Benchmark Dataset

3 months ago

9.71 GB69

PhysToolBench Physics Tool Task Dataset

2 months ago

1.56 GB58

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

We-Math2.0-Standard Visual Mathematical Reasoning Benchmark Dataset

*This dataset supports online use.Click here to jump.

Build AI with AI

HyperAI Newsletters

Command Palette

We-Math2.0-Standard Visual Mathematical Reasoning Benchmark Dataset

*This dataset supports online use.Click here to jump.

Related Datasets

Nemotron-Math-v2 Mathematical Inference Dataset

MUVR Multimodal Uncropped Video Retrieval Benchmark

IF-Bench Infrared Image Understanding Benchmark Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

VERA Voice Reasoning Evaluation Dataset

Nemotron-Math-Proofs-v1 Mathematical Formal Proofs Dataset

VenusBench-GD Cross-Platform Interface Understanding Dataset

UNO-Bench full-modal Evaluation Benchmark Dataset

PhysToolBench Physics Tool Task Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

We-Math2.0-Standard Visual Mathematical Reasoning Benchmark Dataset

*This dataset supports online use.Click here to jump.

Related Datasets

Nemotron-Math-v2 Mathematical Inference Dataset

MUVR Multimodal Uncropped Video Retrieval Benchmark

IF-Bench Infrared Image Understanding Benchmark Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

VERA Voice Reasoning Evaluation Dataset

Nemotron-Math-Proofs-v1 Mathematical Formal Proofs Dataset

VenusBench-GD Cross-Platform Interface Understanding Dataset

UNO-Bench full-modal Evaluation Benchmark Dataset

PhysToolBench Physics Tool Task Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

Nemotron-Math-v2 Mathematical Inference Dataset

MUVR Multimodal Uncropped Video Retrieval Benchmark

IF-Bench Infrared Image Understanding Benchmark Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

VERA Voice Reasoning Evaluation Dataset

Nemotron-Math-Proofs-v1 Mathematical Formal Proofs Dataset

VenusBench-GD Cross-Platform Interface Understanding Dataset

UNO-Bench full-modal Evaluation Benchmark Dataset

PhysToolBench Physics Tool Task Dataset

Related Datasets

Nemotron-Math-v2 Mathematical Inference Dataset

MUVR Multimodal Uncropped Video Retrieval Benchmark

IF-Bench Infrared Image Understanding Benchmark Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

VERA Voice Reasoning Evaluation Dataset

Nemotron-Math-Proofs-v1 Mathematical Formal Proofs Dataset

VenusBench-GD Cross-Platform Interface Understanding Dataset

UNO-Bench full-modal Evaluation Benchmark Dataset

PhysToolBench Physics Tool Task Dataset