Visual Instruction Following On Llava Bench
评估指标
avg score
评测结果
各个模型在此基准测试上的表现结果
比较表格
模型名称 | avg score |
---|---|
sharegpt4v-improving-large-multi-modal-models | 79.9 |
cumo-scaling-multimodal-llm-with-co-upcycled | 85.7 |
improved-baselines-with-visual-instruction | 70.7 |
instructblip-towards-general-purpose-vision | 58.2 |
blip-2-bootstrapping-language-image-pre | 38.1 |
sharegpt4v-improving-large-multi-modal-models | 72.6 |
instructblip-towards-general-purpose-vision | 60.9 |
improved-baselines-with-visual-instruction | 63.4 |