This is a collection of models that I've trained on data collected through conversations with frontier models GPT, Claude, Perplexity and myself.
R PRO
juiceb0xc0de
AI & ML interests
destroying heuristic determination in 4 dimensions to flood the engines with diversity and a lot of swear words
Recent Activity
reacted
to
danielhanchen's
post with š„ about 17 hours ago
We collaborated with NVIDIA to teach you about Reinforcement Learning and RL environments. š Learn:
⢠Why RL environments matter + how to build them
⢠When RL is better than SFT
⢠GRPO and RL best practices
⢠How verifiable rewards and RLVR work
Blog: https://unsloth.ai/blog/rl-environments liked
a model about 23 hours ago
juiceb0xc0de/bella-bartender-8b-llama3.1 liked
a model 1 day ago
tiiuae/Falcon-H1-7B-Instruct