36 GB

Ctrl+K

1 contributor

History: 11 commits

XuebinWang

Update models and readme with accuracy number and disclaimer (#7)

5cfde48 verified 3 months ago

.gitattributes

1.57 kB
Initial commit to be used in vllm PR#27334 (#1) 6 months ago
LICENSE

11.4 kB
Update README and upload original files (#6) 6 months ago
README.md

7.22 kB
Update models and readme with accuracy number and disclaimer (#7) 3 months ago
USAGE_POLICY

200 Bytes
Update README and upload original files (#6) 6 months ago
chat_template.jinja

16.7 kB
Initial commit to be used in vllm PR#27334 (#1) 6 months ago
config.json

11.5 kB
Update models and readme with accuracy number and disclaimer (#7) 3 months ago
generation_config.json

172 Bytes
Initial commit to be used in vllm PR#27334 (#1) 6 months ago
model-00001-of-00003.safetensors

5 GB
xet

Update models and readme with accuracy number and disclaimer (#7) 3 months ago
model-00001-of-00005.safetensors

5 GB
xet

Change to FP8 customized attention quantization, and update README (#4) 6 months ago
model-00002-of-00003.safetensors

4.99 GB
xet

Update models and readme with accuracy number and disclaimer (#7) 3 months ago
model-00002-of-00005.safetensors

5 GB
xet

Change to FP8 customized attention quantization, and update README (#4) 6 months ago
model-00003-of-00003.safetensors

3.77 GB
xet

Update models and readme with accuracy number and disclaimer (#7) 3 months ago
model-00003-of-00005.safetensors

4.99 GB
xet

Change to FP8 customized attention quantization, and update README (#4) 6 months ago
model-00004-of-00005.safetensors

4.99 GB
xet

Change to FP8 customized attention quantization, and update README (#4) 6 months ago
model-00005-of-00005.safetensors

2.18 GB
xet

Change to FP8 customized attention quantization, and update README (#4) 6 months ago
model.safetensors.index.json

607 kB
Update models and readme with accuracy number and disclaimer (#7) 3 months ago
special_tokens_map.json

323 Bytes
Initial commit to be used in vllm PR#27334 (#1) 6 months ago
tokenizer.json

27.9 MB
xet

Initial commit to be used in vllm PR#27334 (#1) 6 months ago
tokenizer_config.json

4.22 kB
Initial commit to be used in vllm PR#27334 (#1) 6 months ago