Running 5 TurboQuant on Consumer GPUs β 100K Context on RTX 3090, 64K on RTX 4070 π 5 Extend LLM context to 100K tokens on consumer GPUs
ibm-granite/granite-docling-258M Image-Text-to-Text β’ 0.3B β’ Updated Sep 23, 2025 β’ 63.8k β’ 1.15k
Qwen/Qwen3-Coder-30B-A3B-Instruct Text Generation β’ 31B β’ Updated Dec 3, 2025 β’ 2.16M β’ β’ 1.02k
Running on Zero Agents Featured 260 SmolDocling π¦ 260 Convert images and queries into structured document text