Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
OpenTransformer
/
llama.cpp-prismml
like
0
arxiv:
2302.13971
arxiv:
2005.14165
arxiv:
2203.02155
Model card
Files
Files and versions
xet
Community
main
llama.cpp-prismml
/
ggml
/
src
/
ggml-cpu
/
arch
1.76 MB
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
OpenTransformer
perf: optimized AVX2 kernel + COM6-inspired matmul dispatch (0.2 -> 3.43 t/s)
8f4b822
verified
9 days ago
arm
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
9 days ago
loongarch
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
9 days ago
powerpc
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
9 days ago
riscv
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
9 days ago
s390
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
9 days ago
wasm
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
9 days ago
x86
perf: optimized AVX2 kernel + COM6-inspired matmul dispatch (0.2 -> 3.43 t/s)
9 days ago