R's picture

In a Training Loop 🔄

R PRO

juiceb0xc0de

·

JuiceB0xC0de

AI & ML interests

destroying heuristic determination in 4 dimensions to flood the engines with diversity and a lot of swear words

Recent Activity

reacted to danielhanchen's post with 🔥 1 day ago

We collaborated with NVIDIA to teach you about Reinforcement Learning and RL environments. 💚 Learn: • Why RL environments matter + how to build them • When RL is better than SFT • GRPO and RL best practices • How verifiable rewards and RLVR work Blog: https://unsloth.ai/blog/rl-environments

liked a model 1 day ago

juiceb0xc0de/bella-bartender-8b-llama3.1

liked a model 2 days ago

tiiuae/Falcon-H1-7B-Instruct

View all activity

Organizations

New activity in juiceb0xc0de/bella-bartender-8b-llama3.1 4 days ago

quantize error

#1 opened 6 days ago by

quantize error

#1 opened 6 days ago by

quantize error

#1 opened 6 days ago by