OmniForcing: Unleashing Real-time Joint Audio-Visual Generation Paper • 2603.11647 • Published 28 days ago • 31
Can Vision-Language Models Solve the Shell Game? Paper • 2603.08436 • Published about 1 month ago • 39
BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing? Paper • 2603.03194 • Published Mar 3 • 57
SWE-World: Building Software Engineering Agents in Docker-Free Environments Paper • 2602.03419 • Published Feb 3 • 41
SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training Paper • 2602.03411 • Published Feb 3 • 39
IterResearch: Rethinking Long-Horizon Agents via Markovian State Reconstruction Paper • 2511.07327 • Published Nov 10, 2025 • 80
ReForm Collection ReForm: Reflective Autoformalization with Prospective Bounded Sequence Optimization • 4 items • Updated Oct 29, 2025 • 4
ReForm: Reflective Autoformalization with Prospective Bounded Sequence Optimization Paper • 2510.24592 • Published Oct 28, 2025 • 17
WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents Paper • 2509.13309 • Published Sep 16, 2025 • 67