HyperAI

Calm

Metrics

0-shot cot
0-shot icl
1-shot icl
3-shot icl
average
basic
cn
doubt
ef
en
ignore
llm_model
manual cot
model_url
organization
parameters
release_date
robustness
std
updated_time

Results

Performance results of various models on this benchmark

Model Name
0-shot cot
0-shot icl
1-shot icl
3-shot icl
average
basic
cn
doubt
ef
en
ignore
llm_model
manual cot
model_url
organization
parameters
release_date
robustness
std
updated_time
Paper TitleRepository
API54.551.954.160.056.854.452.654.652.158.453.9GPT-475.4https://openai.com/product/gpt-4OpenAIN/A2023/3/1483.79.92024/5/1--
0 of 1 row(s) selected.