39 32 21

Denis Kuznedelev

SpiridonSunRotator

https://github.com/Godofnothing

Godofnothing

AI & ML interests

Model compression, computer vision, NLP

Recent Activity

liked a dataset 3 days ago

ma-xu/fine-t2i

upvoted a paper 4 days ago

MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization

liked a model 12 days ago

black-forest-labs/FLUX.2-klein-4B

View all activity

Organizations

liked a dataset 3 days ago

ma-xu/fine-t2i

Viewer • Updated 9 days ago • 727k • 54.5k • 89

upvoted a paper 4 days ago

MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization

Paper • 2602.03537 • Published 26 days ago • 3

liked a model 12 days ago

black-forest-labs/FLUX.2-klein-4B

Image-to-Image • Updated 6 days ago • 218k • • 501

New activity in Skywork/unipic_nano_2images 19 days ago

Fix of cat command

#2 opened 19 days ago by

SpiridonSunRotator

upvoted a paper 20 days ago

DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers

Paper • 2602.02016 • Published 27 days ago • 12

upvoted a paper 27 days ago

Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation

Paper • 2601.22813 • Published about 1 month ago • 57

liked a model about 1 month ago

tencent/HunyuanImage-3.0-Instruct-Distil

Image-to-Image • 83B • Updated 26 days ago • 943 • 50

New activity in tencent/HunyuanImage-3.0-Instruct-Distil about 1 month ago

OOM on 4 GPU

#3 opened about 1 month ago by

SpiridonSunRotator

New activity in tencent/HunyuanImage-3.0-Instruct about 1 month ago

cuBLAS error on image generation

#6 opened about 1 month ago by

SpiridonSunRotator

New activity in tencent/HunyuanImage-3.0-Instruct-Distil about 1 month ago

Issues with loading the model

#2 opened about 1 month ago by

SpiridonSunRotator

updated a model 2 months ago

yresearch/Alice-AI-ART-dev

Text-to-Image • Updated Dec 30, 2025 • 1

published a model 2 months ago

yresearch/Alice-AI-ART-dev

Text-to-Image • Updated Dec 30, 2025 • 1

upvoted a paper 3 months ago

WUSH: Near-Optimal Adaptive Transforms for LLM Quantization

Paper • 2512.00956 • Published Nov 30, 2025 • 23

updated a model 3 months ago

ISTA-DASLab/Kimi-K2-Thinking-GPTQ-2b-32g-experts

170B • Updated Nov 20, 2025 • 11

upvoted a paper 3 months ago

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12, 2025 • 128

upvoted an article 4 months ago

Article

A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: Pitfalls and Lessons

Feb 4, 2025

•

liked a model 4 months ago

t-tech/flex-sae

Updated Oct 27, 2025 • 4 • 12

published a model 4 months ago

ISTA-DASLab/Kimi-K2-Thinking-GPTQ-2b-32g-experts

170B • Updated Nov 20, 2025 • 11

liked a model 4 months ago

moonshotai/Kimi-K2-Thinking

Text Generation • Updated about 1 month ago • 154k • • 1.68k

liked a Space 4 months ago

The Smol Training Playbook

📚

3.02k

The secrets to building world-class LLMs

Denis Kuznedelev

AI & ML interests

Recent Activity

Organizations

SpiridonSunRotator's activity

Fix of cat command

OOM on 4 GPU

cuBLAS error on image generation

Issues with loading the model

A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: Pitfalls and Lessons

The Smol Training Playbook