Calibrate-Then-Act: Cost-Aware Exploration in LLM Agents Paper • 2602.16699 • Published 8 days ago • 14
ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models Paper • 2505.13444 • Published May 19, 2025 • 17 • 3
ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models Paper • 2505.13444 • Published May 19, 2025 • 17
ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models Paper • 2505.13444 • Published May 19, 2025 • 17 • 3
ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models Paper • 2505.13444 • Published May 19, 2025 • 17
MiniCheck & LLM-AggreFact Collection MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents • 7 items • Updated May 17, 2025 • 4