GUI grounding, which maps natural-language instructions to actionable UI elements, is a core capability of GUI agents. Prior works largely treats instructions as a static proxy for user intent, ...
Abstract: Many organizations rely on software systems to perform their core business operations. These systems often require modernization to accommodate new requirements and demands over time. Visual ...
We are happy to release MMBench-GUI, a hierarchical, multi-platform benchmark framework and toolbox, to evaluate GUI agents. MMBench-GUI is comprising four evaluation levels: GUI Content Understanding ...