HyperAI

Mixture-of-Thoughts Reasoning Dataset

Date

a month ago

Size

5.05 GB

Publish URL

huggingface.co

Categories

Mixture-of-Thoughts is a multi-domain reasoning dataset that integrates high-quality reasoning tracks from three major fields: mathematics, programming, and science. It aims to train large language models (LLMs) to perform reasoning step by step. Each sample in this dataset contains messages Fields store the reasoning process in the form of multiple rounds of dialogue (such as: question → thinking steps → answer), supporting the model's ability to learn step-by-step reasoning.

Dataset structure:

  • Mathematics: 93.7k math problem reasoning traces
  • Programming: 83.1k reasoning tracks for competitive programming problems in Python and C++
  • Science: 173k Reasoning tracks for scientific questions
Mixture-of-Thoughts.torrent
Seeding 1Downloading 0Completed 7Total Downloads 17
  • Mixture-of-Thoughts/
    • README.md
      1.29 KB
    • README.txt
      2.58 KB
      • data/
        • Mixture-of-Thoughts.zip
          5.05 GB