GLM-4-9B
Read this in English
2024/08/12, ๆฌไปๅบไปฃ็ ๅทฒๆดๆฐๅนถไฝฟ็จ transformers>=4.44.0, ่ฏทๅๆถๆดๆฐไพ่ตใ
GLM-4-9B ๆฏๆบ่ฐฑ AI ๆจๅบ็ๆๆฐไธไปฃ้ข่ฎญ็ปๆจกๅ GLM-4 ็ณปๅไธญ็ๅผๆบ็ๆฌใ ๅจ่ฏญไนใๆฐๅญฆใๆจ็ใไปฃ็ ๅ็ฅ่ฏ็ญๅคๆน้ข็ๆฐๆฎ้ๆต่ฏไธญ๏ผ GLM-4-9B ๅๅ ถไบบ็ฑปๅๅฅฝๅฏน้ฝ็็ๆฌ GLM-4-9B-Chat ๅ่กจ็ฐๅบ่ถ ่ถ Llama-3-8B ็ๅ่ถๆง่ฝใ้คไบ่ฝ่ฟ่กๅค่ฝฎๅฏน่ฏ๏ผGLM-4-9B-Chat ่ฟๅ ทๅค็ฝ้กตๆต่งใไปฃ็ ๆง่กใ่ชๅฎไนๅทฅๅ ท่ฐ็จ๏ผFunction Call๏ผๅ้ฟๆๆฌๆจ็๏ผๆฏๆๆๅคง 128K ไธไธๆ๏ผ็ญ้ซ็บงๅ่ฝใๆฌไปฃๆจกๅๅขๅ ไบๅค่ฏญ่จๆฏๆ๏ผๆฏๆๅ ๆฌๆฅ่ฏญ๏ผ้ฉ่ฏญ๏ผๅพท่ฏญๅจๅ ็ 26 ็ง่ฏญ่จใๆไปฌ่ฟๆจๅบไบๆฏๆ 1M ไธไธๆ้ฟๅบฆ๏ผ็บฆ 200 ไธไธญๆๅญ็ฌฆ๏ผ็ GLM-4-9B-Chat-1M ๆจกๅๅๅบไบ GLM-4-9B ็ๅคๆจกๆๆจกๅ GLM-4V-9BใGLM-4V-9B ๅ ทๅค 1120 * 1120 ้ซๅ่พจ็ไธ็ไธญ่ฑๅ่ฏญๅค่ฝฎๅฏน่ฏ่ฝๅ๏ผๅจไธญ่ฑๆ็ปผๅ่ฝๅใๆ็ฅๆจ็ใๆๅญ่ฏๅซใๅพ่กจ็่งฃ็ญๅคๆน้ขๅคๆจกๆ่ฏๆตไธญ๏ผGLM-4V-9B ่กจ็ฐๅบ่ถ ่ถ GPT-4-turbo-2024-04-09ใGemini 1.0 ProใQwen-VL-Max ๅ Claude 3 Opus ็ๅ่ถๆง่ฝใ
ๆไปฌๅจไธไบๅ ธๅไปปๅกไธๅฏน GLM-4-9B ๅบๅบงๆจกๅ่ฟ่กไบ่ฏๆต๏ผ็ปๆๅฆไธ๏ผ
| Model | MMLU | C-Eval | GPQA | GSM8K | MATH | HumanEval |
|---|---|---|---|---|---|---|
| Llama-3-8B | 66.6 | 51.2 | - | 45.8 | - | - |
| Llama-3-8B-Instruct | 68.4 | 51.3 | 34.2 | 79.6 | 30.0 | 62.2 |
| ChatGLM3-6B-Base | 61.4 | 69.0 | - | 72.3 | 25.7 | - |
| GLM-4-9B | 74.7 | 77.1 | 34.3 | 84.0 | 30.4 | 70.1 |
ๆดๅคๆจ็ไปฃ็ ๅไพ่ตไฟกๆฏ๏ผ่ฏท่ฎฟ้ฎๆไปฌ็ github ใ
ๆฌไปๅบๆฏ GLM-4-9B ็ๅบๅบง็ๆฌ๏ผๆฏๆ8Kไธไธๆ้ฟๅบฆใ
ๅ่ฎฎ
GLM-4 ๆจกๅ็ๆ้็ไฝฟ็จๅ้่ฆ้ตๅพช LICENSEใ
ๅผ็จ
ๅฆๆไฝ ่งๅพๆไปฌ็ๅทฅไฝๆๅธฎๅฉ็่ฏ๏ผ่ฏท่่ๅผ็จไธๅ่ฎบๆใ
@misc{glm2024chatglm,
title={ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools},
author={Team GLM and Aohan Zeng and Bin Xu and Bowen Wang and Chenhui Zhang and Da Yin and Diego Rojas and Guanyu Feng and Hanlin Zhao and Hanyu Lai and Hao Yu and Hongning Wang and Jiadai Sun and Jiajie Zhang and Jiale Cheng and Jiayi Gui and Jie Tang and Jing Zhang and Juanzi Li and Lei Zhao and Lindong Wu and Lucen Zhong and Mingdao Liu and Minlie Huang and Peng Zhang and Qinkai Zheng and Rui Lu and Shuaiqi Duan and Shudan Zhang and Shulin Cao and Shuxun Yang and Weng Lam Tam and Wenyi Zhao and Xiao Liu and Xiao Xia and Xiaohan Zhang and Xiaotao Gu and Xin Lv and Xinghan Liu and Xinyi Liu and Xinyue Yang and Xixuan Song and Xunkai Zhang and Yifan An and Yifan Xu and Yilin Niu and Yuantao Yang and Yueyan Li and Yushi Bai and Yuxiao Dong and Zehan Qi and Zhaoyu Wang and Zhen Yang and Zhengxiao Du and Zhenyu Hou and Zihan Wang},
year={2024},
eprint={2406.12793},
archivePrefix={arXiv},
primaryClass={id='cs.CL' full_name='Computation and Language' is_active=True alt_name='cmp-lg' in_archive='cs' is_general=False description='Covers natural language processing. Roughly includes material in ACM Subject Class I.2.7. Note that work on artificial languages (programming languages, logics, formal systems) that does not explicitly address natural-language issues broadly construed (natural-language processing, computational linguistics, speech, text retrieval, etc.) is not appropriate for this area.'}
}
- Downloads last month
- 24,740