Yu Zeng
YuZeng260
AI & ML interests
VLMs, LLMs, RL, Agent, Reasoning
Recent Activity
upvoted
a
paper
33 minutes ago
Secure Code Generation via Online Reinforcement Learning with Vulnerability Reward Model
authored
a paper
2 days ago
Internalizing Meta-Experience into Memory for Guided Reinforcement Learning in Large Language Models