From Scale to Speed: Adaptive Test-Time Scaling for Image Editing Paper • 2603.00141 • Published 11 days ago • 130
MobilityBench: A Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios Paper • 2602.22638 • Published 9 days ago • 104
Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models Paper • 2601.20354 • Published Jan 28 • 111