ViT-AdaLA: Adapting Vision Transformers with Linear Attention Paper • 2603.16063 • Published 1 day ago • 2
SNCE: Geometry-Aware Supervision for Scalable Discrete Image Generation Paper • 2603.15150 • Published 2 days ago
Lavida-O: Elastic Large Masked Diffusion Models for Unified Multimodal Understanding and Generation Paper • 2509.19244 • Published Sep 23, 2025 • 12