Zhouqi Hua
ZhouqiHUA
AI & ML interests
reasoning LLM
Recent Activity
upvoted a paper 26 days ago
DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning upvoted a collection about 1 month ago
RL and Agents liked
a model about 1 month ago
internlm/Intern-S1-Pro Organizations
None yet