Open-Models
updated
Text Generation
• 120B • Updated • 4.65M
• • 4.59k
Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning
Paper
• 2512.20605
• Published • 62
Nested Browser-Use Learning for Agentic Information Seeking
Paper
• 2512.23647
• Published • 19
TimeBill: Time-Budgeted Inference for Large Language Models
Paper
• 2512.21859
• Published • 25
ResembleAI/chatterbox-turbo
Text-to-Speech
• Updated • 626
mHC: Manifold-Constrained Hyper-Connections
Paper
• 2512.24880
• Published • 318
GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models
Paper
• 2512.15560
• Published • 25
Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone
Paper
• 2512.22615
• Published • 50
Text-to-3D
• Updated • 514
• 385
Image-to-Video
• Updated • 1.3M
• • 1.64k
LightOnOCR: A 1B End-to-End Multilingual Vision-Language Model for State-of-the-Art OCR
Paper
• 2601.14251
• Published • 26
DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation
Paper
• 2601.22153
• Published • 74
tencent/Youtu-VL-4B-Instruct
Image-Text-to-Text
• 5B • Updated • 373
• 153
Generation Enhances Understanding in Unified Multimodal Models via Multi-Representation Generation
Paper
• 2601.21406
• Published • 5
Reinforcement Learning via Self-Distillation
Paper
• 2601.20802
• Published • 42
DeepSeek-OCR 2: Visual Causal Flow
Paper
• 2601.20552
• Published • 65
Image-to-Text
• Updated • 3.03M
• • 1.39k
unsloth/Qwen3-Coder-Next-FP8-Dynamic
Text Generation
• 80B • Updated • 66k
• 37
Text Generation
• 80B • Updated • 1.21M
• • 1.15k
GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning
Paper
• 2602.12099
• Published • 60
Image-to-Video
• Updated • 796k
• 692
Updated • 176
• 110