Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Taeho Hwang's picture
2 9 2

Taeho Hwang

doubleyyh
starsuzi's profile picture mjkmain's profile picture 21world's profile picture
·
  • ThisIsHwang

AI & ML interests

None yet

Recent Activity

reacted to sergiopaniego's post with 🚀 about 5 hours ago
TRL v0.27.0 is out!! 🥳 It includes GDPO, the latest variant of GRPO for multi-reward RL ✨ GDPO decouples reward normalization to avoid reward collapse and improve per-reward convergence — developed by @sliuau @SimonX et al. Explore the paper: https://huggingface.co/papers/2601.05242 Explore the full set of changes here: https://github.com/huggingface/trl/releases/tag/v0.27.0
liked a Space 11 days ago
SamsungResearch/TRUEBench
upvoted a paper 3 months ago
Adaptive Multi-Agent Response Refinement in Conversational Systems
View all activity

Organizations

KAIST AI's profile picture kormo's profile picture

Papers 3

arxiv:2502.05609
arxiv:2412.12559
arxiv:2404.13948

models 4

doubleyyh/email-tuned-qwen2-lora

Text Generation • Updated Dec 26, 2024

doubleyyh/mixed-bge-m3-email

Sentence Similarity • 0.6B • Updated Dec 25, 2024

doubleyyh/exit-gemma-2b

Updated Dec 21, 2024 • 1

doubleyyh/exit-gemma-7b

Updated Dec 21, 2024

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs