HyperAI超神经

Zero Shot Video Retrieval On Msvd

评估指标

text-to-video R@1
text-to-video R@10
text-to-video R@5
video-to-text R@1
video-to-text R@10
video-to-text R@5

评测结果

各个模型在此基准测试上的表现结果

比较表格
模型名称text-to-video R@1text-to-video R@10text-to-video R@5video-to-text R@1video-to-text R@10video-to-text R@5
internvideo2-scaling-video-foundation-models58.188.483.083.396.994.3
vid-tldr-training-free-token-merging-for50.085.577.675.795.190.0
clip4clip-an-empirical-study-of-clip-for-end38.576.866.9---
noise-estimation-using-density-estimation-for13.6647.7435.7---
miles-visual-bert-pre-training-with-injected44.487.076.2---
howtocaption-prompting-llms-to-transform44.582.173.3---
bridgeformer-bridging-video-text-retrieval43.684.974.9---
languagebind-extending-video-language53.987.880.472.096.391.4
howtocaption-prompting-llms-to-transform54.887.280.9---
internvideo2-scaling-video-foundation-models59.389.684.483.197.094.2
unmasked-teacher-towards-training-efficient49.084.776.974.592.889.7
lat-latent-translation-with-cycle-consistency36.981.068.634.479.269.0
languagebind-extending-video-language54.188.181.169.797.991.8
internvideo-general-video-foundation-models43.4--67.6--