Hwang yechan PRO
SoonOk
AI & ML interests
AI&ML&ReinforcementLearning&DeepRL &DeepLearning
Recent Activity
upvoted
a
paper
4 days ago
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models
updated
a model
29 days ago
SoonOk/AuxKTO