Open to Collab

Nima Nooshiri

nimanzik

AI & ML interests

None yet

Recent Activity

liked a Space 4 days ago

lm-provers/qed-nano-blogpost

upvoted a paper 9 days ago

PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

upvoted an article 25 days ago

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

View all activity

Organizations

liked a Space 4 days ago

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

📝

Who needs 1T parameters? Olympiad proofs with a 4B model

upvoted a paper 9 days ago

PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Paper • 2510.14528 • Published Oct 16, 2025 • 118

upvoted an article 25 days ago

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

•

175

upvoted an article about 1 month ago

Article

Open Responses: What you need to know

Jan 15

•

107

upvoted an article about 2 months ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

Dec 18, 2025

•

120

upvoted a collection about 2 months ago

📝 Research & Long-Form Blog Posts

Collection

In-depth technical articles and research pieces published by Hugging Face • 11 items • Updated 7 days ago • 21

upvoted a paper about 2 months ago

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2512.20848 • Published Dec 23, 2025 • 38

upvoted an article about 2 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

600

upvoted an article 2 months ago

Article

Codex is Open Sourcing AI models

Dec 11, 2025

•

reacted to danielhanchen's post with 🔥 2 months ago

Post

5547

NVIDIA releases Nemotron 3 Nano, a new 30B hybrid reasoning model! 🔥

Has 1M context window & best in class performance for SWE-Bench, reasoning & chat. Run the MoE model locally with 24GB RAM.

GGUF: unsloth/Nemotron-3-Nano-30B-A3B-GGUF
💚 Step-by-step Guide: https://docs.unsloth.ai/models/nemotron-3