Visual Question Answering On Textvqa Test 1
评估指标
overall
评测结果
各个模型在此基准测试上的表现结果
模型名称 | overall | Paper Title | Repository |
---|---|---|---|
TAG | 53.69 | TAG: Boosting Text-VQA via Text-aware Visual Question-answer Generation | |
CRN (Single Model) | 40.96 | - | - |
PaLI | 73.1 | PaLI: A Jointly-Scaled Multilingual Language-Image Model | |
SAM (Single Model) | 44.8 | - | - |
CIG | 40.77 | - | - |
SMA single model | 45.51 | - | - |
Shuai | 39.95 | - | - |
mmgnn | 32.46 | - | - |
ssbaseline | 45.66 | - | - |
colab_buaa | 44.73 | - | - |
M4C | 40.46 | - | - |
TAP | 53.97 | - | - |
0 of 12 row(s) selected.