namezz/lvm-rel-a-qwen2.5-3b-instruct-b-qwen2.5-1.5b-instruct Text Generation • 3B • Updated about 7 hours ago
namezz/lvm-rel-a-qwen2.5-3b-instruct-b-qwen2.5-1.5b-instruct Text Generation • 3B • Updated about 7 hours ago
namezz/lvm-rel-a-qwen2.5-3b-instruct-b-qwen2.5-3b-instruct Text Generation • 3B • Updated about 18 hours ago
namezz/lvm-rel-a-qwen2.5-3b-instruct-b-qwen2.5-3b-instruct Text Generation • 3B • Updated about 18 hours ago
ThinkRouter: Efficient Reasoning via Routing Thinking between Latent and Discrete Spaces Paper • 2602.11683 • Published 8 days ago • 7
namezz/cold-start-qwen-8b-base-inittag-keepthink-lr1e-5-gpu4-bs2-ga8-ep2-wr0.1-cut12000 308k • Updated Dec 12, 2025