Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
suchen's picture
9 43

suchen

suc16
·

AI & ML interests

LLM

Organizations

None yet

upvoted an article about 1 year ago
view article
Article

Proximal Policy Optimization (PPO)

Aug 5, 2022
•
79
upvoted a paper about 1 year ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4, 2025 • 104
upvoted a collection about 1 year ago

Cosmos

Collection
⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/nvidia-cosmos-2 • 14 items • Updated 4 days ago • 300
upvoted a collection over 1 year ago

BGE

Collection
31 items • Updated Feb 4 • 150
upvoted 5 papers over 2 years ago

CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Paper • 2308.07926 • Published Aug 15, 2023 • 29

DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales

Paper • 2308.01320 • Published Aug 2, 2023 • 46

Challenges and Applications of Large Language Models

Paper • 2307.10169 • Published Jul 19, 2023 • 51

Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 250

Secrets of RLHF in Large Language Models Part I: PPO

Paper • 2307.04964 • Published Jul 11, 2023 • 30
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs