Curated models for AI infrastructure, LLM deployment, and edge computing. Optimized for NVIDIA DGX Spark and Docker Swarm clusters.
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 1.16M • • 2.01k -
sentence-transformers/all-MiniLM-L6-v2
Sentence Similarity • 22.7M • Updated • 201M • • 4.71k -
BAAI/bge-large-en-v1.5
Feature Extraction • 0.3B • Updated • 10.3M • • 650 -
meta-llama/Llama-3.3-70B-Instruct
Text Generation • 71B • Updated • 496k • • 2.73k