HyperAI超神经

Visual Instruction Following On Llava Bench

评估指标

avg score

评测结果

各个模型在此基准测试上的表现结果

比较表格
模型名称avg score
sharegpt4v-improving-large-multi-modal-models79.9
cumo-scaling-multimodal-llm-with-co-upcycled85.7
improved-baselines-with-visual-instruction70.7
instructblip-towards-general-purpose-vision58.2
blip-2-bootstrapping-language-image-pre38.1
sharegpt4v-improving-large-multi-modal-models72.6
instructblip-towards-general-purpose-vision60.9
improved-baselines-with-visual-instruction63.4