Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2224.2
TFLOPS
24
19
30
Loser Cheems
JingzeShi
Follow
21world's profile picture
kroeke's profile picture
Patsmithjoe's profile picture
43 followers
·
22 following
https://github.com/LoserCheems
LoserCheems
AI & ML interests
I like training small languge models.
Recent Activity
authored
a paper
8 days ago
Towards Automated Kernel Generation in the Era of LLMs
authored
a paper
8 days ago
OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale
upvoted
a
paper
9 days ago
OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale
View all activity
Organizations
JingzeShi
's models
7
Sort:Â Recently updated
JingzeShi/OpenSeek-1.4B-A0.4B-KTO
Text Generation
•
1B
•
Updated
Sep 9, 2025
•
4
JingzeShi/OpenSeek-1.4B-A0.4B
Text Generation
•
1B
•
Updated
Aug 24, 2025
•
3
JingzeShi/Doge-20M
Text Generation
•
37.6M
•
Updated
Jul 5, 2025
•
1
JingzeShi/Doge-320M-Reason-checkpoint
0.4B
•
Updated
May 15, 2025
•
2
JingzeShi/Doge-320M-Reason-Distill
Text Generation
•
0.3B
•
Updated
Mar 29, 2025
•
1
JingzeShi/Doge-120M-MoE
0.1B
•
Updated
Mar 20, 2025
•
1
JingzeShi/Mixtral-7B-v0.1
Text Generation
•
Updated
Mar 4, 2025
•
2