Prompt template for nemotron model
#60 opened 15 days ago
by
falcon25051997
What a hell? Useless for story
1
#58 opened about 1 month ago
by
kellysan
math-oai.yaml file for aime eval
#56 opened about 1 month ago
by
Michalea
Install & run this model easily using llmpm
#55 opened about 2 months ago
by
sarthak-saxena
Update nano_v3_reasoning_parser.py
#54 opened 2 months ago
by
HyzeAI
Add GPQA evaluation result
#43 opened 3 months ago
by
burtenshaw
what is the implementation of the bench "AIME25 (with tools)"?
1
#42 opened 3 months ago
by
YF-T
[Research] Adaptive-K Routing Validation: 33% Compute Savings on Nemotron 3 Nano
❤️ 3
#41 opened 3 months ago
by
Gabrobals
Correct `get_decoder`/`set_decoder`
3
#40 opened 3 months ago
by
kylemylonakisprotopia
Is this model going to be seriously considered? Seeking Official Channels to Contact the Model’s Developers or an Active Community
1
#36 opened 4 months ago
by
j3st3r666
Inquiry about Nemotron 3 Nano technical report training details
1
#34 opened 4 months ago
by
andresnowak
Failure in basic question, is it any good at programming
👀 1
9
#31 opened 4 months ago
by
engrtipusultan
Recommended parameters?
1
#30 opened 4 months ago
by
leonsarmiento
Fix streaming output when enable_thinking is disabled
1
#29 opened 4 months ago
by
Kwindla
No, Bad Logic (((
2
#28 opened 4 months ago
by
BuBaLoM
vLLM implementation for reasoning budget
2
#27 opened 4 months ago
by
lssj14
Tool calling issue: got "True" as a String instead of a valid JSON format such as true (the primitive, unquoted value)
3
#25 opened 4 months ago
by
j3st3r666
Recommended way of fine-tuning?
2
#17 opened 5 months ago
by
devon-kindo
Unexpected... "Performance"?
👍 2
9
#15 opened 5 months ago
by
ponzles
doesn't do kv caching on transformers
➕ 2
2
#14 opened 5 months ago
by
adaface-neurips
Does not work with dgx spark
🔥 1
6
#13 opened 5 months ago
by
sotaaa
Actual context length
6
#12 opened 5 months ago
by
yuchsiao
I really hope this model works
1
#8 opened 5 months ago
by
BVEsun
Simple minesweeper game is failing.
1
#7 opened 5 months ago
by
robert1968
Good model but it is very flawed in recalling input
6
#5 opened 5 months ago
by
cmp-nct
Problem working with long text
5
#4 opened 5 months ago
by
Kosh69
Tool calling with reasoning parsing broken
11
#3 opened 5 months ago
by
nephepritou