DavidAU/Llama3.3-8B-Instruct-Thinking-Claude-4.5-Opus-High-Reasoning Text Generation • 8B • Updated Jan 6 • 562 • 100
Reinforcement Learning for Self-Improving Agent with Skill Library Paper • 2512.17102 • Published Dec 18, 2025 • 36
view article Article How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day Dec 8, 2025 • 53
view article Article Building for an Open Future - our new partnership with Google Cloud Nov 13, 2025 • 48
view article Article The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix Nov 3, 2025 • 63
AgentFold: Long-Horizon Web Agents with Proactive Context Management Paper • 2510.24699 • Published Oct 28, 2025 • 71
view article Article Back to The Future: Evaluating AI Agents on Predicting Future Events +5 Jul 17, 2025 • 51