Liming Wu
Limiww
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 6 hours ago
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation
upvoted
a
paper
4 months ago
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding
liked
a model
5 months ago
inclusionAI/LLaDA-MoE-7B-A1B-Base
Organizations
None yet