guankoala's picture

1 3

guankoala

guankoala

·

purekoala

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing

liked a Space about 1 year ago

nanotron/ultrascale-playbook

liked a model over 1 year ago

medxiaorudan/CodeLlama_CPP_FineTuned

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing

Paper • 2602.03560 • Published Feb 3 • 47

liked a Space about 1 year ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

liked 2 models over 1 year ago

medxiaorudan/CodeLlama_CPP_FineTuned

Updated Jan 24, 2024 • 4 • 1

ajibawa-2023/Code-Llama-3-8B

Text Generation • 8B • Updated May 8, 2024 • 103 • 31