HyperAI超神经

Vcgbench Diverse On Videoinstruct

评估指标

Consistency
Contextual Understanding
Correctness of Information
Dense Captioning
Detail Orientation
Reasoning
Spatial Understanding
Temporal Understanding
mean

评测结果

各个模型在此基准测试上的表现结果

比较表格
模型名称ConsistencyContextual UnderstandingCorrectness of InformationDense CaptioningDetail OrientationReasoningSpatial UnderstandingTemporal Understandingmean
one-for-all-video-conversation-is-feasible2.272.592.201.032.623.622.351.292.19
videogpt-integrating-image-and-video-encoders2.592.812.461.382.733.632.801.782.47
chat-univi-unified-visual-representation2.362.662.291.332.563.592.361.562.29
mvbench-a-comprehensive-multi-modal-video2.272.512.131.262.423.132.431.662.20
vtimellm-empower-llm-to-grasp-video-moments2.352.482.161.132.413.452.291.462.17
video-chatgpt-towards-detailed-video2.062.462.070.892.423.602.251.392.08