-
Whisper Realtime Transcription (Gradio UI)
π4Transcribe audio in realtime - Gradio UI version
-
DeepSeek R1 Distill Qwen 1.5B Demo Q8
π₯9DeepSeek R1 Distill Qwen 1.5B Demo GGUF(Q8) Fully in CPU
-
Chain of Draft: Thinking Faster by Writing Less
Paper β’ 2502.18600 β’ Published β’ 50 -
Llama-4-Maverick-17B Research
π88Llama-4-Maverick-17B + Real Time Deep Research
Matricardi Fabio
FM-1976
AI & ML interests
control system engineering, AI, LLM with python. ThePoorGPUguy on substack
Recent Activity
liked a model 3 days ago
nvidia/nemotron-speech-streaming-en-0.6b liked a model 16 days ago
codelion/SmolLM2-70M liked a dataset 16 days ago
cerealt/open-image-preferences-v1-binarizedOrganizations
None yet
PAPERS
-
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Paper β’ 2412.13663 β’ Published β’ 162 -
A Survey of Small Language Models
Paper β’ 2410.20011 β’ Published β’ 46 -
No More Adam: Learning Rate Scaling at Initialization is All You Need
Paper β’ 2412.11768 β’ Published β’ 43 -
Chain of Draft: Thinking Faster by Writing Less
Paper β’ 2502.18600 β’ Published β’ 50
SMALL-TINY
A Collection of Small native Models
-
vicgalle/gpt2-alpaca-gpt4
Text Generation β’ 0.1B β’ Updated β’ 995 β’ 25 -
andreaskoepf/pythia-1.4b-gpt4all-pretrain
Text Generation β’ Updated β’ 18 β’ 7 -
EleutherAI/pythia-1b
Text Generation β’ 1B β’ Updated β’ 43.1k β’ 44 -
EleutherAI/pythia-410m-deduped
Text Generation β’ 0.5B β’ Updated β’ 32.1k β’ 20
Image Creation
Good and working HF spaces to create images with Diffusion models
- Runtime errorFeatured2k
Stable Diffusion 3.5 Large
π2kGenerate images with SD3.5
- Running on ZeroFeatured9.41k
FLUX.1 [dev]
π₯9.41kGenerate images from text prompts
- Running on ZeroFeatured5.05k
FLUX.1 [Schnell]
π5.05kGenerate images from text prompts with FLUX.1 Schnell
- Running on Zero1.79k
DALLE 3 XL v2
π₯1.79kGenerate highβquality images from text prompts
Playgrounds
GRADIO examples
- Runtime error4
Whisper Realtime Transcription (Gradio UI)
π4Transcribe audio in realtime - Gradio UI version
- Running9
DeepSeek R1 Distill Qwen 1.5B Demo Q8
π₯9DeepSeek R1 Distill Qwen 1.5B Demo GGUF(Q8) Fully in CPU
-
Chain of Draft: Thinking Faster by Writing Less
Paper β’ 2502.18600 β’ Published β’ 50 - Running88
Llama-4-Maverick-17B Research
π88Llama-4-Maverick-17B + Real Time Deep Research
Image Creation
Good and working HF spaces to create images with Diffusion models
- Runtime errorFeatured2k
Stable Diffusion 3.5 Large
π2kGenerate images with SD3.5
- Running on ZeroFeatured9.41k
FLUX.1 [dev]
π₯9.41kGenerate images from text prompts
- Running on ZeroFeatured5.05k
FLUX.1 [Schnell]
π5.05kGenerate images from text prompts with FLUX.1 Schnell
- Running on Zero1.79k
DALLE 3 XL v2
π₯1.79kGenerate highβquality images from text prompts
PAPERS
-
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Paper β’ 2412.13663 β’ Published β’ 162 -
A Survey of Small Language Models
Paper β’ 2410.20011 β’ Published β’ 46 -
No More Adam: Learning Rate Scaling at Initialization is All You Need
Paper β’ 2412.11768 β’ Published β’ 43 -
Chain of Draft: Thinking Faster by Writing Less
Paper β’ 2502.18600 β’ Published β’ 50
Playgrounds
SMALL-TINY
A Collection of Small native Models
-
vicgalle/gpt2-alpaca-gpt4
Text Generation β’ 0.1B β’ Updated β’ 995 β’ 25 -
andreaskoepf/pythia-1.4b-gpt4all-pretrain
Text Generation β’ Updated β’ 18 β’ 7 -
EleutherAI/pythia-1b
Text Generation β’ 1B β’ Updated β’ 43.1k β’ 44 -
EleutherAI/pythia-410m-deduped
Text Generation β’ 0.5B β’ Updated β’ 32.1k β’ 20