loaiabdalslam/Ouroboros-1MContext-Gemma-270m Text Generation • 0.3B • Updated 5 days ago • 307 • 8
HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing Paper • 2602.03560 • Published 14 days ago • 43
SamsungSAILMontreal/Qwen3-30B-A3B-Instruct-2507-REAM Text Generation • 23B • Updated 25 days ago • 59 • 7
utter-project/EuroMoE-2.6B-A0.6B-Instruct-2512 Text Generation • 3B • Updated 2 days ago • 226 • 6
mistralai/Voxtral-Mini-4B-Realtime-2602 Automatic Speech Recognition • 4B • Updated about 10 hours ago • 6.24k • 539