Home News Latest Papers Tutorials Datasets Wiki SOTA LLM Models GPU Leaderboard Events

English

Conversational Web Navigation On Weblinx

Metrics

Element (IoU)

Intent Match

Overall score

Text (F1)

Results

Performance results of various models on this benchmark

Model Name	Element (IoU)	Intent Match	Overall score	Text (F1)	Paper Title	Repository
GPT-3.5T (Zero-Shot)	8.62	42.77	8.51	3.45	WebLINX: Real-World Website Navigation with Multi-Turn Dialogue	-
S-LLaMA-1.3B	20.54	83.32	23.73	25.85	WebLINX: Real-World Website Navigation with Multi-Turn Dialogue	-
Pix2Act-1.3B	8.28	81.80	16.88	25.21	WebLINX: Real-World Website Navigation with Multi-Turn Dialogue	-
MindAct-3B	16.50	79.89	20.94	23.16	WebLINX: Real-World Website Navigation with Multi-Turn Dialogue	-
Fuyu-8B	15.70	80.07	19.97	22.30	WebLINX: Real-World Website Navigation with Multi-Turn Dialogue	-
Llama-2-13B	22.82	81.91	25.21	26.60	WebLINX: Real-World Website Navigation with Multi-Turn Dialogue	-
GPT-3.5F	18.64	77.56	21.22	22.39	WebLINX: Real-World Website Navigation with Multi-Turn Dialogue	-
MindAct-780M	13.39	75.87	15.13	13.58	WebLINX: Real-World Website Navigation with Multi-Turn Dialogue	-
Flan-T5-780M	15.36	80.02	17.27	14.05	WebLINX: Real-World Website Navigation with Multi-Turn Dialogue	-
MindAct-250M	12.05	74.25	12.63	7.67	WebLINX: Real-World Website Navigation with Multi-Turn Dialogue	-
Pix2Act-282M	6.20	79.71	12.51	16.40	WebLINX: Real-World Website Navigation with Multi-Turn Dialogue	-
S-LLaMA-2.7B	22.60	84.00	25.02	27.17	WebLINX: Real-World Website Navigation with Multi-Turn Dialogue	-
GPT-4T (Zero-Shot)	10.85	41.66	10.72	6.75	WebLINX: Real-World Website Navigation with Multi-Turn Dialogue	-
Flan-T5-250M	14.86	79.69	14.99	9.21	WebLINX: Real-World Website Navigation with Multi-Turn Dialogue	-
Flan-T5-3B	20.31	81.14	23.77	25.75	WebLINX: Real-World Website Navigation with Multi-Turn Dialogue	-
GPT-4V (Zero-Shot)	10.91	42.36	10.45	6.21	WebLINX: Real-World Website Navigation with Multi-Turn Dialogue	-
Llama-2-7B	22.26	82.64	24.57	26.50	WebLINX: Real-World Website Navigation with Multi-Turn Dialogue	-

0 of 17 row(s) selected.

Conversational Web Navigation On Weblinx | SOTA | HyperAI