arxiv:2501.12959
Xueyan Niu
niuxueyan
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper about 2 months ago
On the Non-decoupling of Supervised Fine-tuning and Reinforcement Learning in Post-training submitted
a paper
about 2 months ago
On the Non-decoupling of Supervised Fine-tuning and Reinforcement Learning in Post-training updated
a model about 2 months ago
niuxueyan/qwen3-0.6b-rl-sft-ckpt Organizations
None yet