3 5 7

Jian Hu

jianh-nvidia

AI & ML interests

None yet

Recent Activity

liked a model 5 days ago

Jackrong/Qwopus3.5-27B-v3

liked a dataset 9 days ago

ServiceNow/VideoCUA

liked a model 9 days ago

Qwen/Qwen3.5-35B-A3B-FP8

View all activity

Organizations

liked a model 5 days ago

Jackrong/Qwopus3.5-27B-v3

Image-Text-to-Text • 27B • Updated 3 days ago • 11.9k • 154

liked a dataset 9 days ago

ServiceNow/VideoCUA

Updated 9 days ago • 1.76k • 29

liked a model 9 days ago

Qwen/Qwen3.5-35B-A3B-FP8

Image-Text-to-Text • 36B • Updated Feb 26 • 2.06M • 136

liked a model 12 days ago

Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

Image-Text-to-Text • 28B • Updated 3 days ago • 561k • 2.51k

New activity in Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled 12 days ago

How do you hack the cot responses?

🤯 1

#51 opened 12 days ago by

jianh-nvidia

upvoted a paper 14 days ago

ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents

Paper • 2603.18815 • Published 21 days ago • 14

liked a model 28 days ago

TeichAI/Qwen3.5-27B-Claude-Opus-4.6-Distill

Image-Text-to-Text • 27B • Updated Mar 4 • 1.01k • 40

liked a dataset about 1 month ago

HuggingFaceTB/smollm-corpus

Viewer • Updated Sep 6, 2024 • 237M • 42.4k • 448

upvoted a paper 2 months ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 110

upvoted a paper 3 months ago

Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning

Paper • 2601.09708 • Published Jan 14 • 54

liked a dataset 3 months ago

billxbf/aimo_hard_bilingual

Viewer • Updated Mar 1, 2025 • 3.56k • 6 • 1

upvoted a paper 3 months ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 230

upvoted a paper 4 months ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 106

New activity in nvidia/Nemotron-Research-Reasoning-Qwen-1.5B 5 months ago

Update README.md

#9 opened 5 months ago by

jianh-nvidia

Update README.md

#8 opened 5 months ago by

jianh-nvidia

Update README.md

#8 opened 5 months ago by

jianh-nvidia

Jian Hu

AI & ML interests

Recent Activity

Organizations

jianh-nvidia's activity

How do you hack the cot responses?

Update README.md

Update README.md

Update README.md