HyperAI

Douban Conversation Corpus

Date

2 years ago

Size

683.06 MB

Organization

Beijing University of Aeronautics and Astronautics

Publish URL

github.com

License

其他

This dataset includes a training dataset, a development set, and a test set for a retrieval-based chatbot. The test data contains 1,000 conversation contexts, and for each context, the researchers created 10 responses as candidates. The researchers recruited three annotators to judge whether the candidates responded appropriately to the meeting, and a correct response means that the response can naturally reply to the message given the context. Each pair received three labels, and the majority of the labels were considered the final decision.

Douban.torrent
Seeding 2Downloading 0Completed 199Total Downloads 466
  • Douban/
    • README.md
      1.29 KB
    • README.txt
      2.57 KB
      • data/
        • README.md
          3.28 KB
        • dev.txt
          32.43 MB
        • test.txt
          39.42 MB
        • train.txt
          683.06 MB