arxiv:2511.15299
Jialong Sun
Pillow-1
AI & ML interests
None yet
Recent Activity
upvoted a paper about 5 hours ago
Can LLMs Learn to Reason Robustly under Noisy Supervision? upvoted a paper 5 days ago
A Survey of On-Policy Distillation for Large Language Models upvoted a paper 6 days ago
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization