QZX
zexuanqiu22
AI & ML interests
None yet
Recent Activity
upvoted a paper about 1 month ago
Search-R2: Enhancing Search-Integrated Reasoning via Actor-Refiner Collaboration upvoted a paper 4 months ago
Low-probability Tokens Sustain Exploration in Reinforcement Learning
with Verifiable Reward upvoted a paper 5 months ago
MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal
Mathematical ReasoningOrganizations
None yet