ChatGPT's One-year Anniversary: Are Open-Source Large Language Models Catching up? Paper • 2311.16989 • Published Nov 28, 2023
Verify-and-Edit: A Knowledge-Enhanced Chain-of-Thought Framework Paper • 2305.03268 • Published May 5, 2023 • 3
Retrieving Multimodal Information for Augmented Generation: A Survey Paper • 2303.10868 • Published Mar 20, 2023
How Much are LLMs Contaminated? A Comprehensive Survey and the LLMSanitize Library Paper • 2404.00699 • Published Mar 31, 2024
Can We Further Elicit Reasoning in LLMs? Critic-Guided Planning with Retrieval-Augmentation for Solving Challenging Tasks Paper • 2410.01428 • Published Oct 2, 2024 • 1
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs Paper • 2504.00993 • Published Apr 1, 2025 • 3
OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe Paper • 2511.16334 • Published Nov 20, 2025 • 94
LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling Paper • 2511.20785 • Published Nov 25, 2025 • 188
MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification Paper • 2603.15726 • Published 6 days ago • 175
Learning Latent Proxies for Controllable Single-Image Relighting Paper • 2603.15555 • Published 6 days ago • 8
MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier Paper • 2603.03756 • Published 19 days ago • 89
VTC-R1: Vision-Text Compression for Efficient Long-Context Reasoning Paper • 2601.22069 • Published Jan 29 • 7
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation Paper • 2601.09688 • Published Jan 14 • 127
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation Paper • 2601.09688 • Published Jan 14 • 127
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation Paper • 2601.09688 • Published Jan 14 • 127
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation Paper • 2601.09688 • Published Jan 14 • 127
A Comprehensive Study on Visual Token Redundancy for Discrete Diffusion-based Multimodal Large Language Models Paper • 2511.15098 • Published Nov 19, 2025