In-Context Reinforcement Learning for Tool Use in Large Language Models Paper • 2603.08068 • Published 11 days ago • 39
Reading, Not Thinking: Understanding and Bridging the Modality Gap When Text Becomes Pixels in Multimodal LLMs Paper • 2603.09095 • Published 10 days ago • 28
AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios Paper • 2602.23166 • Published 22 days ago • 44
DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval Paper • 2603.04743 • Published 15 days ago • 51
Heterogeneous Agent Collaborative Reinforcement Learning Paper • 2603.02604 • Published 17 days ago • 185
Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling Paper • 2603.04791 • Published 15 days ago • 16
Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory Paper • 2603.04257 • Published 16 days ago • 19
ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning Paper • 2602.21534 • Published 23 days ago • 23
PyVision-RL: Forging Open Agentic Vision Models via RL Paper • 2602.20739 • Published 24 days ago • 31
EgoPush: Learning End-to-End Egocentric Multi-Object Rearrangement for Mobile Robots Paper • 2602.18071 • Published 28 days ago • 22
GLM-5: from Vibe Coding to Agentic Engineering Paper • 2602.15763 • Published about 1 month ago • 115
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 28 days ago • 488