MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants Paper • 2603.09652 • Published 3 days ago • 12
MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants Paper • 2603.09652 • Published 3 days ago • 12
Don't Just Fine-tune the Agent, Tune the Environment Paper • 2510.10197 • Published Oct 11, 2025 • 30
Don't Just Fine-tune the Agent, Tune the Environment Paper • 2510.10197 • Published Oct 11, 2025 • 30
Recon-Act: A Self-Evolving Multi-Agent Browser-Use System via Web Reconnaissance, Tool Generation, and Task Execution Paper • 2509.21072 • Published Sep 25, 2025 • 15
Recon-Act: A Self-Evolving Multi-Agent Browser-Use System via Web Reconnaissance, Tool Generation, and Task Execution Paper • 2509.21072 • Published Sep 25, 2025 • 15
Recon-Act: A Self-Evolving Multi-Agent Browser-Use System via Web Reconnaissance, Tool Generation, and Task Execution Paper • 2509.21072 • Published Sep 25, 2025 • 15 • 2
AWorld: Orchestrating the Training Recipe for Agentic AI Paper • 2508.20404 • Published Aug 28, 2025 • 38
AWorld: Orchestrating the Training Recipe for Agentic AI Paper • 2508.20404 • Published Aug 28, 2025 • 38
AWorld: Orchestrating the Training Recipe for Agentic AI Paper • 2508.20404 • Published Aug 28, 2025 • 38 • 2
AWorld: Dynamic Multi-Agent System with Stable Maneuvering for Robust GAIA Problem Solving Paper • 2508.09889 • Published Aug 13, 2025 • 32
AWorld: Dynamic Multi-Agent System with Stable Maneuvering for Robust GAIA Problem Solving Paper • 2508.09889 • Published Aug 13, 2025 • 32 • 2
view reply hi, we will release the paper soon. Or, you can see the prompt here: https://github.com/inclusionAI/AWorld/blob/main/examples/gaia/mcp_collections/intelligence/guard.py
FunReason: Enhancing Large Language Models' Function Calling via Self-Refinement Multiscale Loss and Automated Data Refinement Paper • 2505.20192 • Published May 26, 2025 • 2
CSR:Achieving 1 Bit Key-Value Cache via Sparse Representation Paper • 2412.11741 • Published Dec 16, 2024