Submitted by CNcreator0331 76 LongAnimation: Long Animation Generation with Dynamic Global-Local Memory · 4 authors 227 10
Submitted by Yifan-Zhong 39 A Survey on Vision-Language-Action Models: An Action Tokenization Perspective · 14 authors 459 1
Submitted by zhuoyang20 23 Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation · 7 authors 90 1
Submitted by yukangcao 18 FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model · 4 authors 85 1
Submitted by SiyouLi 15 μ^2Tokenizer: Differentiable Multi-Scale Multi-Modal Tokenizer for Radiology Report Generation Alpachino 275 1
Submitted by multimodalart 11 ARIG: Autoregressive Interactive Head Generation for Real-time Conversations · 5 authors 1
Submitted by shash42 9 Answer Matching Outperforms Multiple Choice for Language Model Evaluation · 5 authors 16 2
Submitted by jslee525 5 STR-Match: Matching SpatioTemporal Relevance Score for Training-Free Video Editing · 3 authors 7 1