stablegravity 's Collections checkitoutlater
updated
MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with
Mixture of Score Guidance
Paper
• 2412.05355
• Published
• 8
SwiftEdit: Lightning Fast Text-Guided Image Editing via One-Step
Diffusion
Paper
• 2412.04301
• Published
• 40
PanoDreamer: 3D Panorama Synthesis from a Single Image
Paper
• 2412.04827
• Published
• 10
Around the World in 80 Timesteps: A Generative Approach to Global Visual
Geolocation
Paper
• 2412.06781
• Published
• 23
From Elements to Design: A Layered Approach for Automatic Graphic Design
Composition
Paper
• 2412.19712
• Published
• 15
MetaChain: A Fully-Automated and Zero-Code Framework for LLM Agents
Paper
• 2502.05957
• Published
• 15
Expect the Unexpected: FailSafe Long Context QA for Finance
Paper
• 2502.06329
• Published
• 133
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic
Understanding, Localization, and Dense Features
Paper
• 2502.14786
• Published
• 158
X-Dancer: Expressive Music to Human Dance Video Generation
Paper
• 2502.17414
• Published
• 14
MagicInfinite: Generating Infinite Talking Videos with Your Words and
Voice
Paper
• 2503.05978
• Published
• 36
Motion Anything: Any to Motion Generation
Paper
• 2503.06955
• Published
• 35
InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity
Paper
• 2503.16418
• Published
• 36
Paper
• 2503.14378
• Published
• 61
LeX-Art: Rethinking Text Generation via Scalable High-Quality Data
Synthesis
Paper
• 2503.21749
• Published
• 26
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual
Scenes
Paper
• 2503.23461
• Published
• 94
MoCha: Towards Movie-Grade Talking Character Synthesis
Paper
• 2503.23307
• Published
• 139
UniTEX: Universal High Fidelity Generative Texturing for 3D Shapes
Paper
• 2505.23253
• Published
• 4
How Animals Dance (When You're Not Looking)
Paper
• 2505.23738
• Published
• 3
Sherlock: Self-Correcting Reasoning in Vision-Language Models
Paper
• 2505.22651
• Published
• 48
Paper2Poster: Towards Multimodal Poster Automation from Scientific
Papers
Paper
• 2505.21497
• Published
• 109
OmniConsistency: Learning Style-Agnostic Consistency from Paired
Stylization Data
Paper
• 2505.18445
• Published
• 63
EgoX: Egocentric Video Generation from a Single Exocentric Video
Paper
• 2512.08269
• Published
• 119
PersonaLive! Expressive Portrait Image Animation for Live Streaming
Paper
• 2512.11253
• Published
• 38
OmniPSD: Layered PSD Generation with Diffusion Transformer
Paper
• 2512.09247
• Published
• 48
Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization
Paper
• 2601.05432
• Published
• 167
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation
Paper
• 2512.23576
• Published
• 65