HyperAI

OpenThoughts3-1.2M Reasoning Dataset

Date

18 days ago

Size

23.45 GB

Publish URL

huggingface.co

Categories

OpenThoughts3-1.2M is an open source reasoning dataset released by Open Thoughts in 2025. It is the third iteration of the OpenThoughts dataset series. The related paper results are:OpenThoughts: Data Recipes for Reasoning Models".

The dataset contains 850,000 math problems, 250,000 coding problems, and 100,000 science problems, and the annotations are completed using the QwQ-32B model.

Dataset Framework

OpenThoughts3-1.2M.torrent
Seeding 1Downloading 0Completed 2Total Downloads 3
  • OpenThoughts3-1.2M/
    • README.md
      1.14 KB
    • README.txt
      2.27 KB
      • data/
        • OpenThoughts3-1.2M.zip
          23.45 GB