arxiv:2508.15763
Zhouqi Hua
ZhouqiHUA
AI & ML interests
reasoning LLM
Recent Activity
upvoted a paper 17 days ago
DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning upvoted a collection 22 days ago
RL and Agents liked
a model 24 days ago
internlm/Intern-S1-Pro Organizations
None yet