Uploaded finetuned model

  • Developed by: Entity-27th
  • License: apache-2.0
  • Finetuned from model : unsloth/Qwen3.5-9B
  • Hardware: AMD Instinct MI300X x 1

This qwen3_5 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Stellar Pro is a variant of Qwen3.5-9B, PEFT'd and distilled with gemini-3.1-pro-hard-high-reasoning dataset. Trained on a single MI300X GPU, Stellar Pro is designed to enhance the base model's reasoning capabilities via distillation from Gemini 3.1 Pro.

Downloads last month
82
Safetensors
Model size
10B params
Tensor type
BF16
·
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Entity-27th/Stellar-Pro-9B

Finetuned
Qwen/Qwen3.5-9B
Finetuned
unsloth/Qwen3.5-9B
Finetuned
(2)
this model
Quantizations
1 model

Dataset used to train Entity-27th/Stellar-Pro-9B