SpatialBoost: Enhancing Visual Representation through Language-Guided Reasoning Paper • 2603.22057 • Published 1 day ago • 38
Vision-aligned Latent Reasoning for Multi-modal Large Language Model Paper • 2602.04476 • Published Feb 4 • 14