HyperAI超神经

Bias Detection On Rt Inod Bias

评估指标

Best-of

评测结果

各个模型在此基准测试上的表现结果

比较表格
模型名称Best-of
benchmarking-llama2-mistral-gemma-and-gpt-for0.36
benchmarking-llama2-mistral-gemma-and-gpt-for0.34
benchmarking-llama2-mistral-gemma-and-gpt-for0.41
benchmarking-llama2-mistral-gemma-and-gpt-for0.41
benchmarking-llama2-mistral-gemma-and-gpt-for0.5