HyperAI超神经

Image To Text Retrieval On Coco

评估指标

Recall@1
Recall@10
Recall@5

评测结果

各个模型在此基准测试上的表现结果

比较表格
模型名称Recall@1Recall@10Recall@5
blip-2-bootstrapping-language-image-pre83.598.096.0
blip-2-bootstrapping-language-image-pre85.498.597.0
deep-visual-semantic-alignments-for-74.8-
one-peace-exploring-one-general84.198.396.3
unicoder-vl-a-universal-encoder-for-vision-97.2-
sigmoid-loss-for-language-image-pre-training70.6--
oscar-object-semantics-aligned-pre-training-99.8-
flava-a-foundational-language-and-vision42.74-76.76
learning-relation-alignment-for-calibrated67.7894.4889.7
learning-transferable-visual-models-from58.488.181.5