-
tokyotech-llm/GPT-OSS-Swallow-20B-RL-v0.1
Text Generation • 21B • Updated • 965 • 9 -
tokyotech-llm/GPT-OSS-Swallow-120B-RL-v0.1
Text Generation • 117B • Updated • 571 • 5 -
tokyotech-llm/GPT-OSS-Swallow-20B-SFT-v0.1
Text Generation • 21B • Updated • 1.8k • 4 -
tokyotech-llm/GPT-OSS-Swallow-120B-SFT-v0.1
Text Generation • 117B • Updated • 2.98k • 2
AI & ML interests
None defined yet.
Recent Activity
View all activity
Organization Card
Swallow LLM
Research and development of large language models conducted by the members mainly in Okazaki Laboratory and Yokota Laboratory at Institute of Science Tokyo (formerly known as Tokyo Institute of Technology)
- From Okazaki Laboratory, Institute of Science Tokyo, the following members:
- Naoaki Okazaki
- Sakae Mizuki
- Youmi Ma
- Sangwhan Moon
- Koki Maeda
- Masanari Ohi
- Hinari Shimada
- Taihei Shiotani
- Koshiro Saito
- Tatsuya Ichinose
- Naoya Matsushita
- Sora Miyamoto
- Nguyen Tien Dung
- Yuta Katayama
- From YOKOTA Laboratory, Institute of Science Tokyo, the following members:
- Rio Yokota
- Kazuki Fujii
- Taishi Nakamura
- Takumi Okamoto
- Ishida Shigeki
- Masaki Kawamura
- Yukito Tajima
- From Artificial Intelligence Research Center, AIST, Japan, the following members:
-
tokyotech-llm/Qwen3-Swallow-8B-RL-v0.2
Text Generation • 8B • Updated • 716 • 1 -
tokyotech-llm/Qwen3-Swallow-30B-A3B-RL-v0.2
Text Generation • 31B • Updated • 517 • 3 -
tokyotech-llm/Qwen3-Swallow-32B-RL-v0.2
Text Generation • 33B • Updated • 357 • 1 -
tokyotech-llm/Qwen3-Swallow-8B-SFT-v0.2
Text Generation • 8B • Updated • 4.72k • 3
-
tokyotech-llm/GPT-OSS-Swallow-20B-RL-v0.1
Text Generation • 21B • Updated • 965 • 9 -
tokyotech-llm/GPT-OSS-Swallow-120B-RL-v0.1
Text Generation • 117B • Updated • 571 • 5 -
tokyotech-llm/GPT-OSS-Swallow-20B-SFT-v0.1
Text Generation • 21B • Updated • 1.8k • 4 -
tokyotech-llm/GPT-OSS-Swallow-120B-SFT-v0.1
Text Generation • 117B • Updated • 2.98k • 2
-
tokyotech-llm/Qwen3-Swallow-8B-RL-v0.2
Text Generation • 8B • Updated • 716 • 1 -
tokyotech-llm/Qwen3-Swallow-30B-A3B-RL-v0.2
Text Generation • 31B • Updated • 517 • 3 -
tokyotech-llm/Qwen3-Swallow-32B-RL-v0.2
Text Generation • 33B • Updated • 357 • 1 -
tokyotech-llm/Qwen3-Swallow-8B-SFT-v0.2
Text Generation • 8B • Updated • 4.72k • 3
models
135
tokyotech-llm/Qwen3-Swallow-32B-RL-v0.2-GPTQ-INT4
Text Generation
•
33B
•
Updated
•
170
tokyotech-llm/Qwen3-Swallow-30B-A3B-RL-v0.2-AWQ-INT4
Text Generation
•
31B
•
Updated
•
213
tokyotech-llm/Qwen3-Swallow-30B-A3B-RL-v0.2-GPTQ-INT4
Text Generation
•
31B
•
Updated
•
208
tokyotech-llm/Qwen3-Swallow-8B-RL-v0.2-GPTQ-INT4
Text Generation
•
8B
•
Updated
•
404
tokyotech-llm/Qwen3-Swallow-32B-RL-v0.2-AWQ-INT4
Text Generation
•
33B
•
Updated
•
189
tokyotech-llm/GPT-OSS-Swallow-120B-SFT-v0.1
Text Generation
•
117B
•
Updated
•
2.98k
•
2
tokyotech-llm/GPT-OSS-Swallow-20B-RL-v0.1
Text Generation
•
21B
•
Updated
•
965
•
9
tokyotech-llm/GPT-OSS-Swallow-120B-RL-v0.1
Text Generation
•
117B
•
Updated
•
571
•
5
tokyotech-llm/GPT-OSS-Swallow-20B-SFT-v0.1
Text Generation
•
21B
•
Updated
•
1.8k
•
4
tokyotech-llm/Qwen3-Swallow-32B-CPT-v0.2
Text Generation
•
33B
•
Updated
•
158
•
1
datasets
19
tokyotech-llm/Swallow-Nemotron-Post-Training-Dataset-v1
Viewer
•
Updated
•
8.84M
•
127
•
2
tokyotech-llm/lmsys-chat-1m-synth
Updated
•
844
•
18
tokyotech-llm/s1-test-time-scaling-synth-public
Viewer
•
Updated
•
59k
•
11
tokyotech-llm/swallow-code-v2
Viewer
•
Updated
•
147M
•
174k
•
31
tokyotech-llm/swallow-math-v2
Viewer
•
Updated
•
17.4M
•
5.47k
•
26
tokyotech-llm/swallow_english_mt_bench
Viewer
•
Updated
•
80
•
193
tokyotech-llm/MMLU-ProX-English
Updated
•
287
tokyotech-llm/MMLU-Pro-English
Updated
•
506
tokyotech-llm/MMLU-ProX-Japanese
Updated
•
523
tokyotech-llm/JEMHopQA
Viewer
•
Updated
•
3.78k
•
232