Pruned version of Nemotron-3-Nano-20B-A3B to run with full context in RTX5080
#12 opened about 2 months ago
by
pirola
Install & run this model easily using llmpm
#11 opened about 2 months ago
by
sarthak-saxena
quantization script (not QAD)
1
#9 opened 3 months ago
by
cudaoom
AttributeError: 'NemotronHConfig' object has no attribute 'rms_norm_eps'
👍 1
#6 opened 3 months ago
by
spakment
genuinenly very impressive model
1
#5 opened 3 months ago
by
szilard995
Efficiency of NVFP4 vs FP16/8
➕ 3
#4 opened 3 months ago
by
Michalea
Tool use crash the model
6
#3 opened 3 months ago
by
ruben-bibsyst