arxiv:2602.14234
Chenxiao Zhao
ChenShawn
AI & ML interests
Reinforcement learning
Recent Activity
authored
a paper
about 11 hours ago
REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents
authored
a paper
about 11 hours ago
VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training
authored
a paper
about 11 hours ago
DeepEyesV2: Toward Agentic Multimodal Model
Organizations
None yet