Update README.md

#4
by YizheZ - opened
Files changed (1) hide show
  1. README.md +5 -0
README.md CHANGED
@@ -15,6 +15,11 @@ This model is an example of the **Simple Self-Distillation (SimpleSD)** method t
15
  - **Self-distillation sampling:** temperature=1.1, top_p=0.95, top_k=20
16
  - **Evaluation sampling:** temperature=0.7, top_p=0.95, top_k=20
17
 
 
 
 
 
 
18
  ## Notes
19
  - These are research checkpoints for reproducibility.
20
  - They are not optimized Qwen releases.
 
15
  - **Self-distillation sampling:** temperature=1.1, top_p=0.95, top_k=20
16
  - **Evaluation sampling:** temperature=0.7, top_p=0.95, top_k=20
17
 
18
+ paper: https://arxiv.org/abs/2604.01193
19
+
20
+ code: https://github.com/apple/ml-ssd
21
+
22
+
23
  ## Notes
24
  - These are research checkpoints for reproducibility.
25
  - They are not optimized Qwen releases.