HJH2CMD/reasoning_evaluation_outcome-Claude-4.5-Haiku-Qwen3-8B-Think Viewer • Updated 1 day ago • 40.5k • 4
HJH2CMD/reasoning_evaluation_outcome-Claude-4.5-Haiku-Qwen3-8B-Think Viewer • Updated 1 day ago • 40.5k • 4
Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning Paper • 2601.06943 • Published Jan 11 • 214
Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning Paper • 2601.06943 • Published Jan 11 • 214