Skip to yearly menu bar Skip to main content


VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks

Jing Yu Koh · Robert Lo · Lawrence Jang · Vikram Duvvur · Ming Lim · Po-Yu Huang · Graham Neubig · Shuyan Zhou · Ruslan Salakhutdinov · Daniel Fried

Abstract

Chat is not available.