4 34 80

Mann Patel

manncodes

AI & ML interests

NLP, Mech Interp, Reasoning, MLSystems

Recent Activity

liked a dataset 14 days ago

facebook/principia-bench

liked a dataset 14 days ago

facebook/principia-collection

liked a dataset about 1 month ago

wenjiema02/ProofBench

View all activity

Organizations

None yet

liked 2 datasets 14 days ago

facebook/principia-bench

Viewer • Updated Dec 18, 2025 • 2.24k • 196 • 19

facebook/principia-collection

Viewer • Updated Dec 19, 2025 • 554k • 311 • 44

liked a dataset about 1 month ago

wenjiema02/ProofBench

Viewer • Updated Oct 14, 2025 • 899 • 114 • 7

New activity in how2everything/how2bench about 2 months ago

Query about the License

#3 opened about 2 months ago by

manncodes

upvoted a collection about 2 months ago

How2Everything data

Collection

Data release for "How2Everything: Mining the Web for How-To Procedures to Evaluate and Improve LLMs" • 5 items • Updated Mar 2 • 2

liked a model about 2 months ago

osmosis-ai/Osmosis-Structure-0.6B

0.6B • Updated Jun 13, 2025 • 848 • 412

upvoted 5 papers 3 months ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 229

Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models

Paper • 2512.13607 • Published Dec 15, 2025 • 37

liked a model 3 months ago

XiaomiMiMo/MiMo-V2-Flash

Text Generation • 310B • Updated Feb 27 • 49.1k • • 694

upvoted an article 4 months ago

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

Dec 9, 2025

•

upvoted a collection 4 months ago

Nemotron-Post-Training-v3

Collection

Collection of datasets used in the post-training phase of Nemotron Nano and Super v3. • 28 items • Updated 2 days ago • 108

liked a model 4 months ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8

Text Generation • 32B • Updated 19 days ago • 1.17M • • 330

New activity in allenai/Dolci-Think-RL-32B 4 months ago

decoding the coding ground truths

#1 opened 4 months ago by

manncodes

upvoted a collection 4 months ago

Tiny-A2D

Collection

Small diffusion language models adapted from AR models • 4 items • Updated Dec 6, 2025 • 18

upvoted a paper 4 months ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 106

liked 2 datasets 4 months ago

Anthropic/AnthropicInterviewer

Viewer • Updated Jan 6 • 1.25k • 942 • 364

nvidia/AceReason-Math

Viewer • Updated Jun 18, 2025 • 49.6k • 837 • 49

Mann Patel

AI & ML interests

Recent Activity

Organizations

manncodes's activity

Query about the License

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

decoding the coding ground truths