MoCha: Towards Movie-Grade Talking Character Synthesis Paper β’ 2503.23307 β’ Published Mar 30, 2025 β’ 141
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality +2 Mar 4, 2025 β’ 78
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. β’ 25 items β’ Updated 29 days ago β’ 578
Text-to-Image History Collection How Text-to-Image evolved on HF and inspired the Community β’ 54 items β’ Updated 16 days ago β’ 16
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions Paper β’ 2402.17485 β’ Published Feb 27, 2024 β’ 194
MobileDiffusion: Subsecond Text-to-Image Generation on Mobile Devices Paper β’ 2311.16567 β’ Published Nov 28, 2023 β’ 20