ISO-Bench: Can Coding Agents Optimize Real-World Inference Workloads? Paper • 2602.19594 • Published 6 days ago • 2
Why LLMs Aren't Scientists Yet: Lessons from Four Autonomous Research Attempts Paper • 2601.03315 • Published Jan 6 • 6