Uploaded model
- Developed by: lucaelin
- License: apache-2.0
- Finetuned from model : lucaelin/llama-3.2-3b-instruct-cn-grpo-16bit
This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.
- Downloads last month
- 9
Hardware compatibility
Log In to add your hardware
4-bit
5-bit
8-bit
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support
Model tree for lucaelin/llama-3.2-3b-instruct-cn-grpo-gguf
Base model
lucaelin/llama-3.2-3b-instruct-cn-grpo-16bit