HyperAI超神经

Image Classification On Coloninst V1 Unseen

评估指标

Accuray

评测结果

各个模型在此基准测试上的表现结果

模型名称
Accuray
Paper TitleRepository
Bunny-v1.0-3B (w/ LoRA, w/ extra data)79.50Efficient Multimodal Learning from Data-centric Perspective
LLaVA-Med-v1.5 (w/ LoRA, w/o extra data)79.24LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
MobileVLM-1.7B (w/o LoRA, w/ extra data)78.75MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices
MiniGPT-v2 (w/ LoRA, w/ extra data)76.82MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
LLaVA-Med-v1.0 (w/o LoRA, w/o extra data)78.04LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
LLaVA-v1.5 (w/ LoRA, w/o extra data)79.10Improved Baselines with Visual Instruction Tuning
ColonGPT (w/ LoRA, w/o extra data)83.24Frontiers in Intelligent Colonoscopy
MGM-2B (w/o LoRA, w/ extra data)78.69Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
MGM-2B (w/o LoRA, w/o extra data)78.99Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
LLaVA-Med-v1.0 (w/o LoRA, w/ extra data)77.38LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
LLaVA-v1 (w/ LoRA, w/ extra data)42.17Visual Instruction Tuning
Bunny-v1.0-3B (w/ LoRA, w/o extra data)75.50Efficient Multimodal Learning from Data-centric Perspective
LLaVA-v1 (w/ LoRA, w/o extra data)72.08Visual Instruction Tuning
LLaVA-v1.5 (w/ LoRA, w/ extra data)80.89Improved Baselines with Visual Instruction Tuning
MiniGPT-v2 (w/ LoRA, w/o extra data)77.93MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
LLaVA-Med-v1.5 (w/ LoRA, w/ extra data)66.51LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
MobileVLM-1.7B (w/ LoRA, w/ extra data)80.44MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices
0 of 17 row(s) selected.