Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
2
sagnik mukherjee
sagnikM
Follow
AI & ML interests
None yet
Recent Activity
updated
a model
12 days ago
sagnikM/grpo_adam_small_beta
published
a model
12 days ago
sagnikM/grpo_adam_small_beta
upvoted
a
paper
2 months ago
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text
View all activity
Organizations
None yet
Papers
2
arxiv:
2505.11711
arxiv:
2410.19054
models
16
Sort: Recently updated
sagnikM/grpo_adam_small_beta
Text Generation
•
2B
•
Updated
12 days ago
•
329
sagnikM/grpo_rmsprop_qwen3-8b_3k_seqlen
8B
•
Updated
Jan 27
sagnikM/grpo_rmsprop_llama3p1_8b_3k_seqlen_1e-7
Text Generation
•
8B
•
Updated
Jan 26
sagnikM/grpo_rmsprop_qwen3_1p7b_3k_seqlen_1e-5
Text Generation
•
2B
•
Updated
Jan 26
•
4
sagnikM/grpo_rmsprop_qwen3_1p7b_3k_seqlen_1e-6
Text Generation
•
2B
•
Updated
Jan 26
•
5
sagnikM/grpo_sgd_qwen3-8b_3k_seqlen_momentum_0p9_1e-2
Text Generation
•
8B
•
Updated
Jan 17
sagnikM/grpo_sgd_llama3p1_8b_3k-seqlen_momentum_0p9_1e-3
Text Generation
•
8B
•
Updated
Jan 15
•
1
sagnikM/grpo_sgd_qwen3_1p7b_3k-seqlen_momentum_0p9_1e-2
Text Generation
•
2B
•
Updated
Jan 15
•
2
sagnikM/grpo_sgd_qwen3_1p7b_3k-seqlen_momentum_0p9_1e-1
Text Generation
•
2B
•
Updated
Jan 15
•
2
sagnikM/grpo_sgd_qwen3-8b_3k_seqlen
Text Generation
•
8B
•
Updated
Dec 25, 2025
•
1
View 16 models
datasets
4
Sort: Recently updated
sagnikM/your_huggingface_dataset_dir
Viewer
•
Updated
Apr 26, 2025
•
4
sagnikM/dataset-mix-cached
Updated
Apr 13, 2025
•
2
sagnikM/eurus-dpo-format
Viewer
•
Updated
Apr 10, 2025
•
115k
•
4
sagnikM/dpo-on-prime
Viewer
•
Updated
Apr 10, 2025
•
300k
•
5