WebArbiter: Reasoning Process Reward Model for Web Agents. Models, training data, and WebPRMBench. ICLR 2026.
Yao Zhang
ZYao720
AI & ML interests
None yet
Recent Activity
published a dataset 1 day ago
ZYao720/WEBPRMBENCH updated a collection 1 day ago
WebArbiter updated a collection 1 day ago
WebArbiter