Update README.md
Browse files
README.md
CHANGED
|
@@ -27,7 +27,7 @@ Theta-35 is the advanced reasoning model in the Theta series by SVECTOR. Compare
|
|
| 27 |
- Number of Parameters (Non-Embedding): 33B
|
| 28 |
- Number of Layers: 64
|
| 29 |
- Number of Attention Heads (GQA): 40 for Q and 8 for KV
|
| 30 |
-
- Context Length: Full
|
| 31 |
- Sliding Window: 32,768 tokens
|
| 32 |
|
| 33 |
**Note:** For the best experience, please review the [usage guidelines](#usage-guidelines) before deploying Theta models.
|
|
|
|
| 27 |
- Number of Parameters (Non-Embedding): 33B
|
| 28 |
- Number of Layers: 64
|
| 29 |
- Number of Attention Heads (GQA): 40 for Q and 8 for KV
|
| 30 |
+
- Context Length: Full 32k tokens
|
| 31 |
- Sliding Window: 32,768 tokens
|
| 32 |
|
| 33 |
**Note:** For the best experience, please review the [usage guidelines](#usage-guidelines) before deploying Theta models.
|