Generate AI-powered code for HTML, React, Streamlit, and more
Tracks perf of LLMs, VLMs and agents on web navigation tasks