Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
OpenTransformer
/
llama.cpp-prismml
like
0
arxiv:
2302.13971
arxiv:
2005.14165
arxiv:
2203.02155
Model card
Files
Files and versions
xet
Community
main
llama.cpp-prismml
/
ggml
/
src
/
ggml-cuda
/
diagmask.cuh
OpenTransformer
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
03ba2cd
verified
27 days ago
raw
Copy download link
history
blame
contribute
delete
150 Bytes
#
include
"common.cuh"
#
define
CUDA_DIAG_MASK_INF_BLOCK_SIZE 32
void
ggml_cuda_op_diag_mask_inf
(ggml_backend_cuda_context & ctx, ggml_tensor * dst)
;