hua
zhihua95
AI & ML interests
None yet
Recent Activity
upvoted a paper about 2 months ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization upvoted a paper over 1 year ago
Towards Achieving Human Parity on End-to-end Simultaneous Speech
Translation via LLM Agent Organizations
None yet