Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data Paper • 2602.21320 • Published Feb 24 • 12
Learning on the Job: Test-Time Curricula for Targeted Reinforcement Learning Paper • 2510.04786 • Published Oct 6, 2025 • 3
Test-Time Curricula for Targeted RL (Qwen3-4B-Instruct-2507) Collection 8 items • Updated Oct 3, 2025