RealMaster: Lifting Rendered Scenes into Photorealistic Video Paper • 2603.23462 • Published 7 days ago • 31
MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding Paper • 2603.22458 • Published 8 days ago • 131
The Pulse of Motion: Measuring Physical Frame Rate from Visual Dynamics Paper • 2603.14375 • Published 17 days ago • 18
MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data Paper • 2603.25319 • Published 6 days ago • 30
Calibri: Enhancing Diffusion Transformers via Parameter-Efficient Calibration Paper • 2603.24800 • Published 6 days ago • 62
RealRestorer: Towards Generalizable Real-World Image Restoration with Large-Scale Image Editing Models Paper • 2603.25502 • Published 6 days ago • 54
PixelSmile: Toward Fine-Grained Facial Expression Editing Paper • 2603.25728 • Published 5 days ago • 115
prithivMLmods/Gliese-Qwen3.5-9B-Abliterated-Caption Image-Text-to-Text • 9B • Updated 21 days ago • 14.4k • 39