35 16 6

Andrei Panferov

BlackSamorez

BlackSamorez

AI & ML interests

NLP

Recent Activity

updated a model 3 days ago

daslab-testing/Apertus-1.7B-it800000-SFT

published a model 3 days ago

daslab-testing/Apertus-1.7B-it800000-SFT

updated a model 16 days ago

daslab-testing/Apertus-1.7B-it800000

View all activity

Organizations

updated a model 3 days ago

daslab-testing/Apertus-1.7B-it800000-SFT

2B • Updated 3 days ago • 76

published a model 3 days ago

daslab-testing/Apertus-1.7B-it800000-SFT

2B • Updated 3 days ago • 76

updated a model 16 days ago

daslab-testing/Apertus-1.7B-it800000

2B • Updated 16 days ago • 11

published a model 16 days ago

daslab-testing/Apertus-1.7B-it800000

2B • Updated 16 days ago • 11

upvoted a paper 28 days ago

DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers

Paper • 2602.02016 • Published Feb 2 • 12

upvoted a paper about 1 month ago

SLIME: Stabilized Likelihood Implicit Margin Enforcement for Preference Optimization

Paper • 2602.02383 • Published Feb 2 • 29

commented a paper about 1 month ago

Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation

Paper • 2601.22813 • Published Jan 30 • 57 •

authored a paper about 1 month ago

Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation

Paper • 2601.22813 • Published Jan 30 • 57

upvoted a paper about 1 month ago

Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation

Paper • 2601.22813 • Published Jan 30 • 57

submitted a paper to Daily Papers about 1 month ago

Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation

Paper • 2601.22813 • Published Jan 30 • 57

updated a model 2 months ago

daslab-testing/Apertus-1.7B-it360000-SFT

2B • Updated Jan 1 • 3 • 1

published a model 2 months ago

daslab-testing/Apertus-1.7B-it360000-SFT

2B • Updated Jan 1 • 3 • 1

upvoted a paper 3 months ago

WUSH: Near-Optimal Adaptive Transforms for LLM Quantization

Paper • 2512.00956 • Published Nov 30, 2025 • 23

New activity in ISTA-DASLab/Meta-Llama-3.1-70B-Instruct-AQLM-PV-2Bit-1x16 3 months ago

VLLM error

#2 opened over 1 year ago by

mlinmg

upvoted a paper 4 months ago

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

Paper • 2511.13254 • Published Nov 17, 2025 • 136

updated 5 models 4 months ago

Andrei Panferov

AI & ML interests

Recent Activity

Organizations

BlackSamorez's activity

VLLM error