Pratyush Ranjan Tiwari PRO
pratyushrt
AI & ML interests
Reinforcements Learning, Privacy, Post-training LLMs, SLMs
Recent Activity
liked
a Space about 21 hours ago
HuggingFaceTB/smol-training-playbook updated
a Space 4 months ago
eternisai/README authored
a paper
5 months ago
Hard Examples Are All You Need: Maximizing GRPO Post-Training Under
Annotation Budgets