12 20

Henry

danzh0

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

Qwen/Qwen3.5-397B-A17B-GPTQ-Int4

upvoted a paper about 2 months ago

When Models Manipulate Manifolds: The Geometry of a Counting Task

liked a model 2 months ago

unsloth/Qwen3-Coder-Next-GGUF

View all activity

Organizations

None yet

liked a model about 1 month ago

Qwen/Qwen3.5-397B-A17B-GPTQ-Int4

Image-Text-to-Text • Updated Mar 3 • 26.7k • 21

upvoted a paper about 2 months ago

When Models Manipulate Manifolds: The Geometry of a Counting Task

Paper • 2601.04480 • Published Jan 8 • 4

liked 5 models 2 months ago

upvoted an article 2 months ago

Article

Open Responses: What you need to know

Jan 15

•

111

upvoted an article 3 months ago

Article

Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval with Llama Nemotron RAG Models

Jan 6

•

liked 3 models 3 months ago

zai-org/GLM-4.7-Flash

Text Generation • 31B • Updated Jan 29 • 925k • • 1.66k

meituan-longcat/LongCat-Flash-Thinking-2601

Text Generation • 562B • Updated Jan 23 • 3.43k • 108

unsloth/Qwen3-Next-80B-A3B-Instruct-GGUF

Text Generation • 80B • Updated Jan 14 • 11.7k • 174

upvoted an article 3 months ago

Article

Deriving the PPO Loss from First Principles

Dec 25, 2025

•

liked a model 3 months ago

MiniMaxAI/MiniMax-M2.1

Text Generation • 229B • Updated Feb 13 • 34k • • 1.27k

upvoted an article 3 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Sep 11, 2025

•

186

upvoted an article 4 months ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

Dec 18, 2025

•

124

liked 3 models 4 months ago

apple/Sharp

Image-to-3D • Updated Dec 18, 2025 • 12.6k • 367

janhq/Jan-v2-VL-high-gguf

Image-Text-to-Text • 8B • Updated Nov 26, 2025 • 118k • 37

apple/starflow

Updated Jan 29 • 282

upvoted an article 4 months ago

Article

Introducing swift-huggingface: The Complete Swift Client for Hugging Face

Dec 5, 2025

•

Henry

AI & ML interests

Recent Activity

Organizations

danzh0's activity

Open Responses: What you need to know

Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval with Llama Nemotron RAG Models

Deriving the PPO Loss from First Principles

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

Introducing swift-huggingface: The Complete Swift Client for Hugging Face