AgentLongBench: A Controllable Long Benchmark For Long-Contexts Agents via Environment Rollouts Paper • 2601.20730 • Published 16 days ago • 19