Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
2
sagnik mukherjee
sagnikM
Follow
AI & ML interests
None yet
Recent Activity
updated
a model
12 days ago
sagnikM/grpo_adam_small_beta
published
a model
12 days ago
sagnikM/grpo_adam_small_beta
upvoted
a
paper
2 months ago
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text
View all activity
Organizations
None yet
sagnikM
's models
16
Sort: Recently updated
sagnikM/grpo_adam_small_beta
Text Generation
•
2B
•
Updated
12 days ago
•
329
sagnikM/grpo_rmsprop_qwen3-8b_3k_seqlen
8B
•
Updated
Jan 27
sagnikM/grpo_rmsprop_llama3p1_8b_3k_seqlen_1e-7
Text Generation
•
8B
•
Updated
Jan 26
sagnikM/grpo_rmsprop_qwen3_1p7b_3k_seqlen_1e-5
Text Generation
•
2B
•
Updated
Jan 26
•
4
sagnikM/grpo_rmsprop_qwen3_1p7b_3k_seqlen_1e-6
Text Generation
•
2B
•
Updated
Jan 26
•
5
sagnikM/grpo_sgd_qwen3-8b_3k_seqlen_momentum_0p9_1e-2
Text Generation
•
8B
•
Updated
Jan 17
sagnikM/grpo_sgd_llama3p1_8b_3k-seqlen_momentum_0p9_1e-3
Text Generation
•
8B
•
Updated
Jan 15
•
1
sagnikM/grpo_sgd_qwen3_1p7b_3k-seqlen_momentum_0p9_1e-2
Text Generation
•
2B
•
Updated
Jan 15
•
2
sagnikM/grpo_sgd_qwen3_1p7b_3k-seqlen_momentum_0p9_1e-1
Text Generation
•
2B
•
Updated
Jan 15
•
2
sagnikM/grpo_sgd_qwen3-8b_3k_seqlen
Text Generation
•
8B
•
Updated
Dec 25, 2025
•
1
sagnikM/grpo_adam_qwen3-8b_3k_seqlen
Text Generation
•
8B
•
Updated
Dec 25, 2025
•
1
sagnikM/grpo_adam_llama3.1_8b_instruct_adam
Text Generation
•
8B
•
Updated
Dec 22, 2025
•
1
sagnikM/ppo_sgd_qwen3_1.7b_1e-2_critic_adamW
Text Generation
•
2B
•
Updated
Dec 22, 2025
•
2
sagnikM/ppo_sgd_qwen3_1.7b_1e-2
Text Generation
•
2B
•
Updated
Dec 22, 2025
•
1
sagnikM/ppo_adam_qwen3_1.7b
Text Generation
•
2B
•
Updated
Dec 22, 2025
•
2
sagnikM/bert-finetuned-ner
Updated
Apr 12, 2022