HyperAI超神经

End To End Dialogue Modelling On Multiwoz 2 0

评估指标

BLEU
MultiWOZ (Inform)
MultiWOZ (Success)

评测结果

各个模型在此基准测试上的表现结果

比较表格
模型名称BLEUMultiWOZ (Inform)MultiWOZ (Success)
galaxy-a-generative-pre-trained-model-for20.594.485.3
pretraining-the-noisy-channel-model-for-task20.686.976.2
task-oriented-dialog-systems-that-consider18.676.360.4
soloist-few-shot-task-oriented-dialog-with-a16.585.572.9
augpt-dialogue-with-pre-trained-language17.290.275.5
a-simple-language-model-for-task-oriented15.084.470.1