FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published 23 days ago • 330
PRBench: End-to-end Paper Reproduction in Physics Research Paper • 2603.27646 • Published 14 days ago • 29
Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems Paper • 2602.08847 • Published Feb 9 • 29
InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery Paper • 2602.08990 • Published Feb 9 • 77
Spec-o3: A Tool-Augmented Vision-Language Agent for Rare Celestial Object Candidate Vetting via Automated Spectral Inspection Paper • 2601.06498 • Published Jan 10 • 1
Spec-o3 Collection A Tool-Augmented Vision-Language Agent for Rare Celestial Object Candidate Vetting via Automated Spectral Inspection • 5 items • Updated Jan 13 • 1
Spec-o3: A Tool-Augmented Vision-Language Agent for Rare Celestial Object Candidate Vetting via Automated Spectral Inspection Paper • 2601.06498 • Published Jan 10 • 1
Spec-o3 Collection A Tool-Augmented Vision-Language Agent for Rare Celestial Object Candidate Vetting via Automated Spectral Inspection • 5 items • Updated Jan 13 • 1
Spec-o3 Collection A Tool-Augmented Vision-Language Agent for Rare Celestial Object Candidate Vetting via Automated Spectral Inspection • 5 items • Updated Jan 13 • 1