khazarai/Qwen3-4B-Qwen3.6-plus-Reasoning-Distilled Text Generation • 4B • Updated about 6 hours ago • 1.09k • 3
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 707
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 84 items • Updated 1 day ago • 518
view post Post 1995 What do you think of my LLM Chat app so far? Here are some of the features already included (and more are coming):- Chat with AI models – Local inference via Ollama- Reasoning support – View model thinking process (DeepSeek-R1, Qwen-QwQ, etc.)- Vision models – Analyze images with llava, bakllava, moondream- Image generation – Local GGUF models with GPU acceleration (CUDA)- Fullscreen images – Click generated images to view in fullscreen- Image attachments – File picker or clipboard paste (Ctrl+V)- DeepSearch – Web search with tool use- Inference Stats – Token counts, speed, duration (like Ollama verbose)- Regenerate – Re-run any AI response- Copy – One-click copy AI responses See translation 5 replies · 👀 4 4 + Reply