Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text Paper โข 2601.22975 โข Published Jan 30 โข 109
Expected Harm: Rethinking Safety Evaluation of (Mis)Aligned LLMs Paper โข 2602.01600 โข Published Feb 2 โข 21