3ViewSense: Spatial and Mental Perspective Reasoning from Orthographic Views in Vision-Language Models Paper • 2603.07751 • Published 10 days ago • 11
LexSemBridge: Fine-Grained Dense Representation Enhancement through Token-Aware Embedding Augmentation Paper • 2508.17858 • Published Aug 25, 2025 • 10