R PRO
juiceb0xc0de
AI & ML interests
destroying heuristic determination in 4 dimensions to flood the engines with diversity and a lot of swear words
Recent Activity
reacted
to
danielhanchen's
post with ๐ฅ 1 day ago
We collaborated with NVIDIA to teach you about Reinforcement Learning and RL environments. ๐ Learn:
โข Why RL environments matter + how to build them
โข When RL is better than SFT
โข GRPO and RL best practices
โข How verifiable rewards and RLVR work
Blog: https://unsloth.ai/blog/rl-environments liked
a model 1 day ago
juiceb0xc0de/bella-bartender-8b-llama3.1 liked
a model 2 days ago
tiiuae/Falcon-H1-7B-Instruct