GLM5 - Kernel
#4
by rahul7star - opened
Hey guys does this make sense why we use Flash Attention
https://huggingface.co/rahul7star/LLM-Brain/blob/main/Kernels-GLM5.md
Hey guys does this make sense why we use Flash Attention
https://huggingface.co/rahul7star/LLM-Brain/blob/main/Kernels-GLM5.md