YijuGuo
AI & ML interests
LLM Alignment
Recent Activity
upvoted a paper 10 days ago
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation upvoted a paper 14 days ago
AgentCPM-Report: Interleaving Drafting and Deepening for Open-Ended Deep Research authored
a paper
19 days ago
Controllable Preference Optimization: Toward Controllable
Multi-Objective Alignment