Enabling Versatile Controls for Video Diffusion Models Paper • 2503.16983 • Published Mar 21, 2025 • 15
PP-DocBee: Improving Multimodal Document Understanding Through a Bag of Tricks Paper • 2503.04065 • Published Mar 6, 2025
baidu/ERNIE-4.5-VL-28B-A3B-Thinking Image-Text-to-Text • 30B • Updated 4 days ago • 784 • 522
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey Paper • 2509.02547 • Published Sep 2, 2025 • 232