OpenThoughts3-1.2M Reasoning Dataset
Date
18 days ago
Size
23.45 GB
Publish URL
OpenThoughts3-1.2M is an open source reasoning dataset released by Open Thoughts in 2025. It is the third iteration of the OpenThoughts dataset series. The related paper results are:OpenThoughts: Data Recipes for Reasoning Models".
The dataset contains 850,000 math problems, 250,000 coding problems, and 100,000 science problems, and the annotations are completed using the QwQ-32B model.

Dataset Framework
OpenThoughts3-1.2M.torrent
Seeding 1Downloading 0Completed 2Total Downloads 3