Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Taeho Hwang's picture
2 9 2

Taeho Hwang

doubleyyh
21world's profile picture mjkmain's profile picture daniel0098's profile picture
ยท
  • ThisIsHwang

AI & ML interests

None yet

Recent Activity

reacted to sergiopaniego's post with ๐Ÿš€ 2 days ago
TRL v0.27.0 is out!! ๐Ÿฅณ It includes GDPO, the latest variant of GRPO for multi-reward RL โœจ GDPO decouples reward normalization to avoid reward collapse and improve per-reward convergence โ€” developed by @sliuau @SimonX et al. Explore the paper: https://huggingface.co/papers/2601.05242 Explore the full set of changes here: https://github.com/huggingface/trl/releases/tag/v0.27.0
liked a Space 13 days ago
SamsungResearch/TRUEBench
upvoted a paper 3 months ago
Adaptive Multi-Agent Response Refinement in Conversational Systems
View all activity

Organizations

KAIST AI's profile picture kormo's profile picture

doubleyyh 's datasets

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs