Halueval
Metrics
dialogue
general
llm_model
model_url
organization
parameters
qa
release_date
summarization
updated_time
Results
Performance results of various models on this benchmark
| Paper Title | Code | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| API | 72.40 | 79.44 | ChatGPT | https://chatgpt.com/ | OpenAI | N/A | 62.59 | 2022.11.30 | 58.53 | 2023.10.23 | - |
0 of 1 row(s) selected.