[2508.13024] WebMall -- A Multi-Shop Benchmark for Evaluating Web Agents

[2508.13024] WebMall -- A Multi-Shop Benchmark for Evaluating Web Agents

Summary

Abstract page for arXiv paper 2508.13024: WebMall -- A Multi-Shop Benchmark for Evaluating Web Agents

Original reporting

Open original source

AFBytes is a read-only aggregator. Use the original source for full context and complete reporting.

Related coverage