HyperAI超神经

Code Generation On Webapp1K Duo React

评估指标

pass@1

评测结果

各个模型在此基准测试上的表现结果

比较表格
模型名称pass@1
a-case-study-of-web-app-coding-with-openai0.679
a-case-study-of-web-app-coding-with-openai0.449
a-case-study-of-web-app-coding-with-openai0.49
a-case-study-of-web-app-coding-with-openai0.652
a-case-study-of-web-app-coding-with-openai0.667
a-case-study-of-web-app-coding-with-openai0.531