The ToolRL model trained for tool use through GRPO
Cheng Qian
chengq9
AI & ML interests
Agent, Tool Learning
Recent Activity
upvoted a paper 4 days ago
Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data upvoted a collection 27 days ago
AgentDoG upvoted a paper 3 months ago
JustRL: Scaling a 1.5B LLM with a Simple RL Recipe