CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published 3 days ago • 224
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published 12 days ago • 288
Gen-Searcher: Reinforcing Agentic Search for Image Generation Paper • 2603.28767 • Published 2 days ago • 50
TAPS: Task Aware Proposal Distributions for Speculative Sampling Paper • 2603.27027 • Published 5 days ago • 133
Calibri: Enhancing Diffusion Transformers via Parameter-Efficient Calibration Paper • 2603.24800 • Published 7 days ago • 65
RealRestorer: Towards Generalizable Real-World Image Restoration with Large-Scale Image Editing Models Paper • 2603.25502 • Published 6 days ago • 55
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale Paper • 2603.25040 • Published 7 days ago • 124
PixelSmile: Toward Fine-Grained Facial Expression Editing Paper • 2603.25728 • Published 6 days ago • 116
EVA: Efficient Reinforcement Learning for End-to-End Video Agent Paper • 2603.22918 • Published 9 days ago • 42
CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents Paper • 2603.24440 • Published 7 days ago • 92
When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning Paper • 2603.21289 • Published 10 days ago • 34
SIMART: Decomposing Monolithic Meshes into Sim-ready Articulated Assets via MLLM Paper • 2603.23386 • Published 8 days ago • 40
From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents Paper • 2603.22386 • Published 9 days ago • 54