HyperAI超神经

Question Answering On Social Iqa

评估指标

Accuracy

评测结果

各个模型在此基准测试上的表现结果

比较表格
模型名称Accuracy
llama-open-and-efficient-foundation-language-150.4
llama-open-and-efficient-foundation-language-148.9
unifiedqa-crossing-format-boundaries-with-a79.8
training-compute-optimal-large-language51.3
llama-open-and-efficient-foundation-language-152.3
two-is-better-than-many-binary-classification80.2
roberta-a-robustly-optimized-bert-pretraining76.7
two-is-better-than-many-binary-classification79.9
mixlora-enhancing-large-language-models-fine78.8
task-compass-scaling-multi-task-pre-training82.2
mixture-of-subspaces-in-low-rank-adaptation81.0
socialiqa-commonsense-reasoning-about-social33.3
scaling-language-models-methods-analysis-150.6
socialiqa-commonsense-reasoning-about-social63.1
task-compass-scaling-multi-task-pre-training79.6
socialiqa-commonsense-reasoning-about-social64.5
socialiqa-commonsense-reasoning-about-social63
mixlora-enhancing-large-language-models-fine82.5
llama-open-and-efficient-foundation-language-150.4
task-compass-scaling-multi-task-pre-training81.7
textbooks-are-all-you-need-ii-phi-1-552.6
mixlora-enhancing-large-language-models-fine78
textbooks-are-all-you-need-ii-phi-1-553.0
unicorn-on-rainbow-a-universal-commonsense83.2