Benchmark AI agents on multi‑hop, multi‑source enterprise tasks
Configurable Generalist Agent, leader in AppWorld Benchmark