view post Post 321 Just published: Nano-vLLM meets Inference EndpointsI show how to bind Nano-vLLM (supporting Qwen3-0.6B) to a web service — and deploy it easily on Hugging Face Inference Endpoints.Minimalist engine, maximum fun!https://huggingface.co/blog/angt/nano-vllm-meets-inference-endpoints See translation 🔥 2 2 🤗 2 2 + Reply
Running 6 Specification-induced correlations 💻 6 Evaluate gender pronoun predictions in text using BERT models