11 2

wujinzhu

kimjohn

jinzhuer

AI & ML interests

Large Language Model, Natural Language Processing, Computer Vision

Recent Activity

authored a paper about 8 hours ago

GLM-5: from Vibe Coding to Agentic Engineering

new activity 6 months ago

xai-org/grok-2:Parameter Scale of Grok-2: 270B Total, 115B Activated

new activity 6 months ago

xai-org/grok-2:What do we know about the architecture so far?

View all activity

Organizations

None yet

authored a paper about 8 hours ago

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published 1 day ago • 29

New activity in xai-org/grok-2 6 months ago

Parameter Scale of Grok-2: 270B Total, 115B Activated

🤗 👍 22

#24 opened 6 months ago by

kimjohn

What do we know about the architecture so far?

👍 3

#6 opened 6 months ago by

amgadhasan

New activity in BytedTsinghua-SIA/DAPO-Math-17k 8 months ago

Question about the size of the dataset

👀 ➕ 2

#3 opened 10 months ago by

jessezhaoxizhang

upvoted a paper 9 months ago

Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design

Paper • 2506.04734 • Published Jun 5, 2025 • 21

commented a paper 10 months ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18, 2025 • 139 •

New activity in qihoo360/TinyR1-32B-Preview 12 months ago

What kind of model merge method do you use ?

#17 opened 12 months ago by

jiuerbujie

Repeated Thinking Tags in Output Generation

#2 opened 12 months ago by

xldistance

Output repeating

👀 1

#1 opened 12 months ago by

chriswritescode

使用chatbox输出重复，并且思考标签只有第二个

#14 opened 12 months ago by

mrguo

Dataset

#13 opened 12 months ago by

PSM24

TypeError argument 'tokens': 'NoneType' object cannot be converted to 'PyString'

#4 opened 12 months ago by

youyc22

upvoted a collection 12 months ago

360Zhinao2

Collection

360Zhinao2 language model, include both base and chat model • 7 items • Updated Oct 15, 2025 • 2