Shenzhi Wang
shenzhi-wang
AI & ML interests
Large Language Model, Reinforcement Learning, and AI Agents
Recent Activity
authored a paper 10 days ago
Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models authored a paper 10 days ago
HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning submitted a paper 11 days ago
HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning