https://github.com/jzhang38/LongMamba
Zhang Peiyuan
PY007
AI & ML interests
None yet
Organizations
EasyContext
https://github.com/jzhang38/EasyContext
-
PY007/slimpajama_llama_tokenized_upsample_4096_chunk_1M
Viewer • Updated • 5.04k • 87 • 2 -
PY007/slimpajama_llama_tokenized_upsample_4096_chunk_256K
Viewer • Updated • 3.94k • 99 • 1 -
PY007/EasyContext-1M-Llama-2-7B
Text Generation • 7B • Updated • 13 • 4 -
PY007/slimpajama_mistral_tokenized_upsample_4096_chunk_128K
Viewer • Updated • 37.9k • 130
LongMamba
https://github.com/jzhang38/LongMamba
EasyContext
https://github.com/jzhang38/EasyContext
-
PY007/slimpajama_llama_tokenized_upsample_4096_chunk_1M
Viewer • Updated • 5.04k • 87 • 2 -
PY007/slimpajama_llama_tokenized_upsample_4096_chunk_256K
Viewer • Updated • 3.94k • 99 • 1 -
PY007/EasyContext-1M-Llama-2-7B
Text Generation • 7B • Updated • 13 • 4 -
PY007/slimpajama_mistral_tokenized_upsample_4096_chunk_128K
Viewer • Updated • 37.9k • 130
models 5
PY007/slimpajama_LLAMA3_tokenized_chunk_512K_debug
Updated
PY007/vicuna-7b-v1.5
Text Generation • 7B • Updated • 3
PY007/EasyContext-256K-danube2-1.8b
Text Generation • 2B • Updated • 6 • 5
PY007/EasyContext-1M-Llama-2-7B
Text Generation • 7B • Updated • 13 • 4
PY007/LongMamba_16384_bs128_step400
Updated • 9 • 5
datasets 27
PY007/Attn-QAT
Viewer • Updated • 3 • 217
PY007/bf16_videos
Viewer • Updated • 3 • 170
PY007/nvfp4_videos
Viewer • Updated • 3 • 214
PY007/sage3_videos
Viewer • Updated • 3 • 214
PY007/crush-smol
Viewer • Updated • 4 • 19
PY007/slimpajama_Qwen2_tokenized_upsample_4096_chunk_256K
Viewer • Updated • 6.79k • 73
PY007/slimpajama_Yi1.5_tokenized_upsample_4096_chunk_256K
Viewer • Updated • 7.48k • 77
PY007/slimpajama_llama2_tokenized_upsample_4096_chunk_256K
Viewer • Updated • 7.79k • 81
PY007/slimpajama_LLAMA3_tokenized_upsample_4096_chunk_256K
Viewer • Updated • 6.64k • 56
PY007/wild_chat_llama3_template_tokenized_merged_1M
Viewer • Updated • 1.27k • 39